tractor
tractor copied to clipboard
Spec out the async-ipc protocol
tractor utilizes a simple multiplexed protocol for conducting inter-process-task-communication (IPTC)?
Each per-process trio task can invoke tasks in other processes and received responses depending on the type of the remote callable. All packets are encoded as msgpack serialized dictionaries which I'll refer to as messages.
How it works
When an actor wants to invoke a remote routine it sends a cmd packet:
{'cmd': (ns, func, kwargs, uid, cid)} of type Dict[str, Tuple[str, str, Dict[str, Any], Tuple[str, str], str]]
Where:
nsis the remote module namefuncis the remote function namekwargsis adictof keyword arguments to call the function withuidis the unique id of the calling actorcidis the unique id of the call by a specific task
The first response is a function type notifier msg:
{'functype': functype, 'cid': cid} of type Dict[str, str].
Where functype can take one of:
'asyncfunc'for an asynchronous function'asyncgenfor a single direction stream either implemented using an asyn generator function or a@streamdecorated async func'context'for a inter actor, task linked, context. For now see #209.
Depending on the value of functype then the following message(s) are sent back to the caller:
'asyncfunc':- a single packet with the remote routine's result
{'return', result, 'cid', cid}of typeDict[str, Any]
- a single packet with the remote routine's result
'asyncgen':- a stream of packets with the remote async generator's sequence of results
{'yield', value, 'cid', cid}of typeDict[str, Any].
- a stream of packets with the remote async generator's sequence of results
'context':- a single
'started'message containing a first value returned from theContext.started()call in the remote task followed by a possible stream of{'yield', value, 'cid', cid}messages if a bidir stream is opened on each side. Again see #209.
- a single
A remote task which is streaming over a channel can indicate completion using a ''stop'` message:
{'stop': True}- an example of completion from a remote async generator
- upon receiving this message the stream (and possibly its caller task in the single direction case) should complete and teardown (eg. such as closing the local streaming interface)
If a remote task errors it should capture it's error output (still working on what output) and send it in a message back to its caller:
{'error': {tb_str': traceback.format_exc(), 'type_str': type(exc).__name__,}}- in
tractorthis is done by thepack_error()func
- in
A remote actor must have a system in place to cancel tasks spawned from the caller. The system to do this should be invoke-able using the existing protocol defined above and thus no extra "cancel" message should be required (I think).
- an example is when a local caller cancels its currently consuming stream by calling a
Actor._cancel_task()routine in the remote actor. This routine should have knowledge of the rpc system and be capable of looking up the caller's task-id to conduct cancellation. - any remote task should should in theory be cancel-able in this way but there is not yet a "cross-actor cancel scope" system in place for generic tasks (this is maybe a todo)
- likely we'll need our own
tractor.CancelScopearound calls toPortal.run()see #122
- likely we'll need our own
What should be done
- spec out this protocol and define it somewhere in the code base with proper generic type definitions that can be used to type check related internal ipc apis (see @vodik's comment in #35).
- work towards full bi-directional async generator support i.e. using
.asend()also mentioned in an internal todo. - deeply consider how far to move forward with async generators considering the outstanding problems with garbage collection and cleanup as per discussion, Pep 533, and the recommended use of
async_generator.aclosing()- maybe an alternative reactive programming approach may end up being better?
Pretty interested in using msgspec for the first draft of this.
I've made https://github.com/jcrist/msgspec/issues/25 to see what we can do about getting nested msgpec.Structs which would be most ideal for defining our internal message set.
This also ties in heavily with #196 since we'll likely want to offer a system for automatically inferring schema from target remote task functions and/or allowing for an explicit declaration system of some sort.
Also of note is how we could also possibly tie this in with capnproto schemas