Daniel Han comments

Results 781 comments of


                                            Daniel Han

qlora taining on qwen1.5-15b-chat

@wsp317 I fixed it just then! Sorry on the delay! I If you're on a local machine, please update Unsloth via ``` pip uninstall unsloth -y pip install --upgrade --force-reinstall...

Do unsloth work on non lora case?

Sadly currently full finetuning isnt yet supported - some Unsloth community members have tried doing it, and it does converge, albeit the layernorms are not trained

Windows 11+Conda: What is right version of triton to be used to work with unsloth

Apologies this slipped by me! Extreme sorry! Ye unfortunately Windows is a bit of an issue to support - (due to Triton). See https://github.com/unslothai/unsloth/issues/210 which might be helpful

Windows 11+Conda: What is right version of triton to be used to work with unsloth

Very cool @Jiar !! Will check that out!

OOM issue when finetuning unsloth/llama-3-8b-bnb-4bit on Colab with T4 with 18000 context length

Yes too long contexts will cause OOMs. According to our blog: https://unsloth.ai/blog/llama3, the max context length on Tesla T4s (16GB) is 10K ish

AWQ support

You need to change `merged_4bit_forced` to `merged_16bit`

AWQ support

Ye AWQ is nice :) We might be adding a AWQ option for exporting!

AWQ support

@subhamiitk Use `model.save_pretrained_merged("location", tokenizer, save_method = "merged_16bit",)` then use vLLM

AWQ support

So sorry on the delay - just relocated to SF - exporting to AWQ is for now on the roadmap - directly finetuning AWQ could work as well, but will...

AWQ support

I'll see what I can do!