hnswlib icon indicating copy to clipboard operation
hnswlib copied to clipboard

Question about recall performance

Open wuwenjunwwj opened this issue 4 years ago • 1 comments

I build an index with hnswlib in inner product space(normalized data),the data dimension is 128,m= 60, ef_construction = 400。I use random vec(normalized) to test recall performance,I get the result below: topK recall 1 40% 100 70% 500 88% 5000 97% Is this reasonable?how can I improve top1、top100 recall acc in this situasion?

wuwenjunwwj avatar Mar 10 '20 15:03 wuwenjunwwj

Hi @wuwenjunwwj, Yes, this might be reasonable. When you increase the number of neighbors (K) you also increase the ef search parameter (ef>K). You can try setting, e.g. ef=1000 and test at K=100.

yurymalkov avatar Mar 10 '20 21:03 yurymalkov