superduper Optimize model writing and loading during prediction

Optimize model writing and loading during prediction

Open blythed opened this issue 6 months ago • 0 comments

We should have separate processes writing model outputs, while other processes are loading data and predicting. This is particularly obvious when the inputs and outputs are large (order of magnitude of work in I/O comparable to prediction).

Aug 02 '24 10:08 blythed

superduper superduper copied to clipboard

Optimize model writing and loading during prediction

superduper
superduper copied to clipboard