uvloop icon indicating copy to clipboard operation
uvloop copied to clipboard

Using vectorized IO (scatter/gather)

Open jakirkham opened this issue 3 years ago • 3 comments

Operations like writelines in the Stream and the Transport APIs provide library authors the opportunity to send collections of buffers that they would like written, sent, etc. in one go. This can be really handy as it takes only one pass (as opposed to multiple passes) through layers of code to prepare buffers before they go out.

Many OSes supply similar C-level operations like writev on POSIX compatible or similar on Windows for operating on file descriptors. Similarly sendmsg on Linux and Unix or WSASend on Windows provide implementations for sockets.

Admittedly am not very familiar with libuv's API (so maybe devs here can comment on this), but it appears there are some APIs in libuv like uv_write can take multiple buffers, which can internally redirect to sendmsg or WSASend. This also appears to be true for files with the uv_fs_write API.

AFAICT (and I could be wrong about this) uvloop's writelines for Streams calls an internal _write function in a loop, which could write one entire buffer (if it is sufficiently large, etc.) or at least queue a write. Please correct me if I'm misunderstanding anything here.

However given libuv's own propensity to use scatter/gather IO under-the-hood, it might be worth holding off on queuing write operations until all of the buffers in writelines are collected and prepped. This would allow one larger send, write, etc. to occur and if it is above the high watermark for any buffer (likely?), no additional buffer prepping would be necessary either.


Side note: A separate interesting question would be doing something similar for reading. Not sure there is an API that could leverage this currently (may be wrong about this though). Maybe through pausing and resuming reading one could get close (though likely still leaves something on the table)?

Note: There may be similar optimizations possible in asyncio ( https://github.com/python/asyncio/pull/339 ) ( https://github.com/python/cpython/pull/19062 )

jakirkham avatar Feb 28 '21 00:02 jakirkham

After digging into this more I think uv__try_write uses sendmsg, which uvloop already calls in _exec_write, does handle scattering. Plus uvloop already can collect multiple buffers to send. So uvloop is already doing most of the right things.

We would just want to hold off on calling _queue_write until we have finished processing buffers in writelines.

jakirkham avatar Sep 08 '21 22:09 jakirkham

Submitted PR ( https://github.com/MagicStack/uvloop/pull/445 ) to make the change proposed in the last comment.

jakirkham avatar Oct 01 '21 21:10 jakirkham

PR ( https://github.com/MagicStack/uvloop/pull/445 ) has since been merged and included in the 0.17.0 release

An open question remains around how to do this for reading

jakirkham avatar Sep 26 '22 08:09 jakirkham