llm2vec after training, the model with mmlu performace would downgrade?

after training, the model with mmlu performace would downgrade?

Open trouble-maker007 opened this issue 1 year ago • 1 comments

trafficstars

May 07 '24 03:05 trouble-maker007

As the models are trained to be encoders and not be generative, it is very likely the MMLU performance of LLM2Vec models will be lower than their causal LM counterparts.

May 07 '24 19:05 vaibhavad

Feel free to re-open if you have any more questions about this issue.

May 09 '24 14:05 vaibhavad

@vaibhavad thanks for the response, I just wonder after llm2vec, llm model a better text encoder than t5 in sd training

May 28 '24 07:05 trouble-maker007

llm2vec llm2vec copied to clipboard

after training, the model with mmlu performace would downgrade?

llm2vec
llm2vec copied to clipboard