lance
lance copied to clipboard
CPU count is being used to throttle PQ loading parallelism
When searching an IVF/PQ index we need to load partitions. Currently we are using the CPU count to determine how many partitions to load in parallel. However, this is primarily an I/O task and we should be using the I/O parallelism.