unsloth
unsloth copied to clipboard
Model inference - performace drop when using unsloth
Hi, I fine-tuned a model (yam-peleg/Experiment26-7B) using unsloth. Then during inference, model correctness drops when using unsloath FastLanguageModel. I see some modules are replaced. It looks a little bit weird that for Mistral type model LlamaRotaryEmbedding is used. Any idea if this could cause a performance drop?
OLD inference:
Unsloth way
When comparing model files I see the following differences:
and this: