langtest Preparing Embeddings Benchmarks (LangTest)

Preparing Embeddings Benchmarks (LangTest)

Open ArshaanNazir opened this issue 1 year ago • 9 comments

Dec 27 '23 17:12 ArshaanNazir

@ArshaanNazir please add your self as an assignee.

Dec 28 '23 08:12 JustHeroo

@ArshaanNazir could you share your updates?

Feb 27 '24 10:02 JustHeroo

@chakravarthik27 please share the latest updates.

Mar 05 '24 10:03 JustHeroo

Hi @JustHeroo,

We have finished benchmarking the paul_graham dataset and now I am working on creating curated datasets for retrieval evaluation to do the benchmarking embedding models. I plan to generate question-answer pairs for each dataset and implement metrics to evaluate embedding models.

Mar 05 '24 17:03 chakravarthik27

@chakravarthik27 please update the latest status here.

Mar 11 '24 19:03 JustHeroo

@ArshaanNazir could you share your update

Mar 12 '24 10:03 Cabir40

@Cabir40 Kalyan is working on it. He will be enhancing embedding benchmarks for other retrieval tasks. He is working on FinBERT-QA right now.

Mar 12 '24 10:03 ArshaanNazir

@chakravarthik27 is there any update?

Mar 19 '24 09:03 Cabir40

Hi @Cabir40,

Still, it is in progress, I am currently working alone on the langtest project and implementing a high priority feature similar to Open LLM Leaderboard by Hugging Face(Eleuther). It may take some time to complete the embedding benchmarks.

Mar 19 '24 10:03 chakravarthik27

langtest langtest copied to clipboard

Preparing Embeddings Benchmarks (LangTest)

langtest
langtest copied to clipboard