wasi-filesystem directory-entry-stream read batching

directory-entry-stream read batching

Open guybedford opened this issue 1 year ago • 3 comments

This was originally brought up by @dbaeumer - having to have a function call per directory entry to read through the directory entry stream might introduce unnecessary performance costs for iterating large directories.

It could be useful to have the ability to fetch a chunked list by specifying a read variant that takes some kind of upper chunk limit of directory counts at a time to batch reads together.

Nov 30 '23 01:11 guybedford

Yes, this makes sense to look into. The current iterator API is inspired by POSIX readdir which iterates over entries one at a time, with typical implementations reading in batches under the covers. But if benchmarks show that batching would be faster, we should consider it.

Dec 19 '23 22:12 sunfishcode

When this was originally raised it was based on performance benchmarking, which was later clarified to not be using the Wasmtime compliation cache when running the component benchmarks - when enabled the benchmarks demonstrated equality with preview1 and less call overhead.

@dbaeumer do you think this is something that should still be a priority for consideration to provide a noticeable performance improvement?

Dec 20 '23 07:12 guybedford

@guybedford for VS Code this is not necessary anymore. The performance problem raised from the fact that we add to do a context switch for every read. I decided to but the result in shared memory to avoid such a context switch.

However other implementation might hit the same problem. But from the VS Code side this can be closed

Jan 09 '24 13:01 dbaeumer

wasi-filesystem wasi-filesystem copied to clipboard

directory-entry-stream read batching

wasi-filesystem
wasi-filesystem copied to clipboard