rank_llm icon indicating copy to clipboard operation
rank_llm copied to clipboard

Time profiling of RankVicunna or RankZephyr zero-shot evaluation/ inference on BEIR datasets

Open cramraj8 opened this issue 10 months ago • 0 comments

Hi, I wonder the time profiling of each LLMs to run across queries for re-ranking. I am running RankVicunna and RankZephyr on the zero-shot setting across BEIR datasets. For FiQA (648 queries) to conduct re-ranking of BM25 top-100 documents, RankVicunna takes ~4.5 hrs on a powerful machine (H100 GPU). The calculation leads to 30-40 seconds per query to re-rank. I wonder if this is the ideal time profiling anyone observed, or the code can be optimized with different window size or strides. Thanks in advance!

cramraj8 avatar Apr 17 '24 21:04 cramraj8