llm2vec icon indicating copy to clipboard operation
llm2vec copied to clipboard

after training, the model with mmlu performace would downgrade?

Open trouble-maker007 opened this issue 1 year ago • 1 comments
trafficstars

trouble-maker007 avatar May 07 '24 03:05 trouble-maker007

As the models are trained to be encoders and not be generative, it is very likely the MMLU performance of LLM2Vec models will be lower than their causal LM counterparts.

vaibhavad avatar May 07 '24 19:05 vaibhavad

Feel free to re-open if you have any more questions about this issue.

vaibhavad avatar May 09 '24 14:05 vaibhavad

@vaibhavad thanks for the response, I just wonder after llm2vec, llm model a better text encoder than t5 in sd training

trouble-maker007 avatar May 28 '24 07:05 trouble-maker007