Skip to content
This repository has been archived by the owner on Oct 16, 2021. It is now read-only.

Latest commit

 

History

History

docs

CPy2Py

PyPI Package Code Health Test Health Code Coverage

Multi-intepreter execution environment

cpy2py allows multiple interpreters to act as one application. In parallel to the main interpreter, other interpreters are run to execute parts of the application.

To connect two interpreters using cpy2py, it must be installed in both of them. For example, to use CPython and PyPy in a single application, make sure the package is available to them:

python -m pip install cpy2py --user
pypy -m pip install cpy2py --user

A twinterpreter is simply another interpreter running as a subprocess - with some glue and magic sprinkled on it. You can control and create them using a :py:class:`cpy2py.TwinMaster`.

You should only ever worry about two methods: :py:meth:`TwinMaster.start` launches the twinterpreter. :py:meth:`TwinMaster.execute` executes an arbitrary callable in the twinterpreter.

from cpy2py import TwinMaster
from my_module import my_function
twinterpreter = TwinMaster('pypy')
twinterpreter.start()

if __name__ == "__main__":
    twinterpreter.execute(my_function, 1, 2, 3, 'ka-pow!', doctor="who?")

The real power of :py:mod:`cpy2py` are Twins - objects living in one twinterpreter and being represented by proxies in any other interpeter. Using twins, you can seamlessly split your application across multiple twinterpreters.

You create twins by inheriting from :py:class:`cpy2py.TwinObject` instead of :py:class:`object` and setting a __twin_id__. That's it.

from cpy2py import TwinObject
class SuperComputer(TwinObject):
    __twin_id__ = 'pypy'  # makes class native to pypy twinterpeter

    def megaloop(self, x, y):
        return sum(a+b for a in range(x) for b in range(y))

class CWrapper(TwinObject):
    __twin_id__ = 'python'  # makes class native to python twinterpeter

    def callme(self, who, what="buy milk"):
        return some_clib.c_fcn_cll_cplx_xmpl(who, what)

If you don't set __twin_id__ on a child of :py:class:`cpy2py.TwinObject`, the class will always be native to the main interpreter. Handy for all the stuff that's needed everywhere but really doesn't belong anywhere.

note:At the moment, you have to explicitly start a class's native twinterpreter before instantiating the class. Only the main interpreter is always available, of course.

Instead of full-fletched objects, you can also define functions as twins. These are automatically called in their native twinterpreter.

from cpy2py import twinfunction

@twinfunction('pypy')
def superlooper(count=1000, add=3, start=0):
    for _ in range(count):
        start += add
    return add

print(superlooper(int(1E6), 1))
note:A :py:func:`cpy2py.twinfunction` is a regular function wrapping a callable. Unlike a :py:class:`cpy2py.TwinObject`, it will not pass attribute assignments.

The core of :py:mod:`cpy2py` supports some :py:mod:`logging` facilities. All such loggers are children of the __cpy2py__ logger. By default, no active handlers are attached and propagation is disabled. If needed, you reconfigure them like any other :py:mod:`logging` logger to suit your needs. Note that if python is run with the -O flag, several logging calls are skipped entirely to improve performance.

For small scale debugging, one can set the environment variable :envvar:`CPY2PY_DEBUG`. If it is defined and not empty, logging output is written to stderr. In addition, if it names a valid :py:mod:`logging` level, that logging level is used.

Note that loggers are meant for development and only address the internal state. Your application should not depend on this information. Unless :py:mod:`cpy2py` misbehaves (or you suspect it to), ignore its logging.

CPy2Py is stable at its core, but still has some features missing. What's there is more than sufficient to significantly enhance your applications.

  • Seamlessly integrates into python code.
    • All internals are wrapped away behind the plain python interfaces. No eval, exec or code strings required.
    • Lightweight hooks optimize objects and functions for use with :py:mod:`cpy2py`.
    • If needed, any pickle'able callable can be dispatched to another interpreter.
  • Objects natively integrate with twinterpreters.
    • Objects can live in a specific interpreter, with proxies replacing them in others. Classes and instances transparently interact with :py:mod:`cpy2py` in the background.
    • Both class and instance attributes work as expected. Methods, classmethods, staticmethods and descriptors are fully supported.
    • Inheritance is fully supported, including multiple inheritance. Affiliation to interpreters can be changed freely.
  • A wide range of interpeters is supported.
    • Pure python, no dependencies means perfect portability.
    • Any interpreter compatible with python 2.6 to 3.7 is supported.
    • Virtual Environments work out of the box.
    • Tested with cpython and pypy, on Linux and Mac OSX.
  • Importing functions and classes from __main__ may fail if the module can only be imported via its path.
  • By default, calls across interpreters are blocking and not threadsafe. If recursion switches between twinterpreters, :py:class:`cpy2py.TwinMaster` must use the 'async' kernel.
  • Module level settings are not synchronized. For example, configuration of :py:mod:`logging` is not applied to twinterpreters. Use :py:class:`~cpy2py.twinterpreter.group_state.TwinGroupState` for initialisation, write modules aware of twinterpreters, or use immutable module-level initializers.
  • A :py:mod:`weakref` to objects only takes local references into account, not cross-interpreter references.

Dispatching to another twinterpreter adds about 200 - 300 us of overhead. This is mainly due to serialization for the IPC between the interpreters. Using the asynchronous kernel, there is an additional overhead for creating threads.

In general, twinterpreters get faster the shorter they have to wait between requests. pypy twinterpreters benefit from a high number of requests, allowing their JIT to warm up. Python3 connections are the fastest, provided that both twinterpreters support pickle protocol 4.

A notable fraction of time is spent on debugging output via :py:mod:`logging`. Even if no output is produced, :py:mod:`cpy2py` is optimized to a point where the logging call is noticeable. If needed, any per-call logging can be disabled by running python in optimized mode. See the python documentation on the :option:`-O` option and :envvar:`PYTHONOPTIMIZE` environment variable.

You can benchmark the overhead yourself using the :py:mod:`cpy2py_benchmark` tools.

pypy2 15x15k 30x5k 300x1
pypy2 187 ± 1.5 us 228 ± 2.5 us 505 ± 51.8 us
pypy3 165 ± 1.3 us 209 ± 2.4 us 402 ± 8.0 us
python2.7 178 ± 0.6 us 139 ± 0.3 us 239 ± 7.6 us
python3.4 149 ± 0.4 us 118 ± 0.2 us 258 ± 8.0 us