Daniel Han comments

Results 781 comments of


                                            Daniel Han

Num examples of SFTTrainer decreased to 4862 from 109955(original data)

Hmm probs not - i would just inc grad accum

anyone has experienced different results after merging the adapters

@acsankar Did you use a chat template with the merged model? Alpaca style?

regarding inference speed

@Shuaib11-Github Oh yes you asked in Discord! 1. Unsloth inference makes LoRA / QLoRA 2x faster. You benchmarked HF without any adapters. Best to merge then benchmark. 2. Your HF...

@Shuaib11-Github Oh yes I checked and responded on Discord: ![image](https://github.com/unslothai/unsloth/assets/23090290/bb5b1880-fcfd-46b9-b0fd-bd7a34a7e7e6) Unsloth 16bit is 2x faster than HF inference. 4bit is ~1.42x faster than HF using ur exact notebook, and also...

regarding inference speed

@Shuaib11-Github I made 2 reproducible notebooks using your exact example. 1. Fast Unsloth 16bit version 2x faster takes 5.94s / 3.33s / 2.6s https://colab.research.google.com/drive/1C9DDEtZD1zKVSh3zG1dIflP5GXoT8s-e?usp=sharing 2. Slow HF 16bit version takes...

Unsloth这个对cogvlm2-llama3-chinese-chat-19B 推理加速这个会实现吗？

Wait is this a vision model?

How do I make the squad metric outputs F1 and accuracy scores from evaluate?

Maybe https://stackoverflow.com/questions/72367324/calculate-precision-recall-f1-score-for-custom-dataset-for-multiclass-classifi?

Error import unsloth

@GBrochado11 When did you install Unsloth? Can you check your xformers, CUDA versions

Error import unsloth

Oh my that's a very very weird problem - that seems like a Xformers issue itself hmm

Error import unsloth

@mrheinen Is this via Conda as well?