Node WorkerChannel native addon
Substitute spawning a process with mediasoup for running mediasoup in a thread created by this addon.
The communication is now done via message passing rather than using unixsocket, avoiding the kernel networking stack for each and every message passed from Node to C++ and viceversa.
Performance gains are enhanced by approximately 8 times. Tested by sending a request from Node to C++ and waiting for the response N times in a loop.
Caveats:
- Since we don't use a separate process, a crash in mediasoup would crash Node, meaning it must be handled by node process signal handling.
- At the same time, the coredumps will be named something-node-something
- In order to debug coredumps the worker-channel.node binary can be used along with the coredump file and gdb/lldb
TODO:
- [ ] Document removed 'died' and 'subprocessclose' Worker.ts events.
- [ ] Document removed 'died()' and 'subprocessClosed()' Worker.ts methods.
- [ ] Missing all the prebuilt binary upload/download stuff.
.prettierignore and .eslintignore should include /node/workerChannel/lib.
Please merge v3 and check prettier and eslint path arrays in npm-scripts.mjs
Update:
This PR is almost ready to be shipped if we want to.
The are two main concerns already defined in the PR description:
- Since we don't use a separate process for the application and the worker, a crash in mediasoup would crash Node, meaning it must be handled by node process signal handling. This is true for Rust too.
- The coredumps will be named something-node-something
- In order to debug coredumps the worker-channel.node binary can be used along with the coredump file and gdb/lldb
- Since the worker is now used in multi-thread fashion, the very same concerns apply to Node as to Rust, such us https://github.com/versatica/mediasoup/issues/1352
Having said this, do we still want to go in this direction?. As indicated in the description:
Performance gains are enhanced by approximately 8 times. Tested by sending a request from Node to C++ and waiting for the response N times in a loop.
But of course it comes with the mentioned caveats too.
What do you think @ibc, @nazar-pc?
If there use cases where the performance gap is that large, I think it is worth it. The proper solution to crashes is largely to write code that doesn't crash rather than running it in a separate process.
I think that keeping the current design just to keep the "benefit" of not crashing the node process if the worker fails is not a good argument so I am ok but:
How does this change affect prebuilt worker binaries? Now they should not be "binaries" but "libraries" so more changes are needed specially in the naming of those assets and the function in npm-script that fetches them.
How does this change affect prebuilt worker binaries? Now they should not be "binaries" but "libraries" so more changes are needed specially in the naming of those assets and the function in npm-script that fetches them.
Yes, there is a TODO for this.
:+1:,