scandinavian-embedding-benchmark
scandinavian-embedding-benchmark copied to clipboard
A Scandinavian Benchmark for sentence embeddings
https://aclanthology.org/2023.nodalida-1.61/
Add a naive baseline model for 7b model. E.g. one of the best performing models on ScandEval. Potentially take a look at: https://github.com/vllm-project/vllm/issues/1654
One way to do this is to add create a gradio app and embed it. This would allow for much more user customization in the averaging.
Seems like Scandisent is a valid cross-lingual dataset for the Scandinavian languages. https://github.com/timpal0l/ScandiSent?tab=readme-ov-file
Some models such as the "translate and embed" models can't be used for cross-lingual tasks, ideally their scores should just be nan. I am unsure what the best solution is....
Might be interesting to add author-style clustering based on: https://huggingface.co/datasets/MiMe-MeMo/Corpus-v1.1
Add the: sentence-transformers/use-cmlm-multilingual as it performs well on ScandEval
Add a time x performance plot to the website. This allows us to see how performance has developed over time. This requires us to add a date to each of...
The current implementation of the evaluators only gives a singular score. This makes it hard to see the uncertainty in the scores. A potential solution is bootstrapping on the document...