Corpus encoding times for hotpotqa on A100 GPU

Open jeyendranbalakrishnan opened this issue 1 year ago • 0 comments

I'm trying to reproduce evaluate_sbert.py on the hotpotqa dataset on an A100 GPU (AWS ml.p4d.24xlarge instance), using msmarco-distilbert-base-tas-b model. According to the progress, it seems to be taking about 8 minutes for ~ 10,000 corpus passages, implying it will take about 69 hours for the entire 5,233,329 passages. Is this normal, or am I doing something really wrong? If the latter, could anybody share some expected times, or any tips? Thanks a lot!

Feb 27 '24 08:02 jeyendranbalakrishnan