KeyBERT icon indicating copy to clipboard operation
KeyBERT copied to clipboard

How to use leaderboards?

Open sdspieg opened this issue 1 year ago • 2 comments

Can you recommend a way to find the most appropriate, pretrained recent language model(s) that focuses on semantic similarity AND should work with KeyBERT? E.g. these seem to be appropriate models that should also work for Dutch. But how can we tell which ones would work? Also, we'd like to run KeyBERT with different models, in order to be able to see the differences - you don't happen to have any Jupyter notebooks that would show us how to do this? Thanks!

sdspieg avatar Aug 07 '23 16:08 sdspieg

Personally, I would advise looking at the MTEB Leaderboard. These models are optimized for sentence similarity tasks and especially the top models ("bge-" and "gte-") work incredibly well and can be used within KeyBERT.

MaartenGr avatar Aug 09 '23 11:08 MaartenGr

Great! Thanks much Maarten...

sdspieg avatar Aug 19 '23 03:08 sdspieg