scandinavian-embedding-benchmark icon indicating copy to clipboard operation
scandinavian-embedding-benchmark copied to clipboard

Add dataset annotation for construction

Open KennethEnevoldsen opened this issue 1 year ago • 1 comments

E.g. for ScaLA it is natural text, but synthetically augmented (and humanly evaluated). Other construction methods could include translations. Others could be found or expert-generated. It is probably reasonable to compare this with the HF metadata as well.

KennethEnevoldsen avatar Mar 03 '24 17:03 KennethEnevoldsen

This has been added to MTEB and can be added to SEB in the future

KennethEnevoldsen avatar Jul 18 '24 12:07 KennethEnevoldsen