Daniel Han comments

Results 983 comments of


                                            Daniel Han

Lora downcasting issue

@kiddyboots216 Ohh no so what we're doing is correct. It seems like you're not using mixed precision for training (fp16 = True, bf16 = True)

Lora downcasting issue

@kiddyboots216 For training, dY is in bfloat16. LoRA A and B must be in float32. This is for mixed precision training. The code you provided will not run at all,...

Lora downcasting issue

@BrunoBSM Wait so does normal Unsloth work on V100s? T4s work for now. @world2vec Apologies on the delay - this got lost! When dropout = 0, Unsloth will call the...

Lora downcasting issue

Ye so a dropout = 0 is optimized , but anything else is not - it still runs correct. @world2vec sadly I'm unsure why your RTX 4090 isn't working sorry...

LoftQ with Unsloth and LLaMA-Factory

@gotzmann Unsloth itself can load adapters, but unsure on Llama-Factory @hiyouga might be able to help

Follow typical conda install but xformer and bitsandbytes fail on A100

@hellangleZ Apologies on the delay - I'm assuming removing `dataclasses` could help?

Download llama-3-8b-bnb-4bit jupyter and run it locally，Error: tensors found at least two devices

@Qinolion Are you using multiple GPUs?

Download llama-3-8b-bnb-4bit jupyter and run it locally，Error: tensors found at least two devices

@erwe324 The current OSS here won't exactly work for multi GPU, but Llama Factory's Unsloth integration does have a slow somewhat functional multi GPU offering. Accelerate only partially works, and...

Add GGML saving option to Unsloth for easier Ollama model creation and testing.

Thanks for the PR again!

Add GGML saving option to Unsloth for easier Ollama model creation and testing.

I added this into nightly for now! Will do some final changes! Thanks @mahiatlinux !