tpie
tpie copied to clipboard
Improve input/output buffer efficiency in parallel
Right now the main thread in parallel blocks when there is no worker available when it could be filling up more input buffers for future processing. Also, we should swap between the input/output buffers instead of using std::copy. Two minor issues, but fixing them might give a slight speedup.