COIL
COIL copied to clipboard
Retrieval latency is very large with one thread
Hi, thank you for sharing this codes.
I tested the latency of COIL using the retriever-fast.py
with one thread and one shard. Batch size is set to one. The cpu info is Intel(R) Xeon(R) CPU E5-2630 v4 @ 2.20GHz. However, the query latency is roughly 4 seconds, which is substantially larger than 0.38s reported in the paper. I wonder why this happens. Does the paper use multi-threads to evaluate the latency?