Daniel Han comments

Results 1103 comments of


                                            Daniel Han

trafficstars

[GRPO] Changing QLoRA to LoRA or increasing num_gen does not affect VRAM

We shaved VRAM by a lot, so it's probably correct! Are you certain on QLoRA / LoRA? `load_in_4bit = True / False`? That's a weird one - LoRA should use...

feat: ship cli with package

Sorry on the delay!! @SauravMaheshkar That'll be very cool indeed! If you're interested in working on it - that would be awesome! Also https://github.com/unslothai/unsloth/pull/1035 Leo made a cli v2 which...

imatrix : use GGUF to store importance matrices

Oh this looks fantastic great work! Re using bigger batch sizes - does this mean if memory allows, imatrix should be in fact faster to process via PP? I'll try...

Window Support Fix--Update pyproject.toml

Why are there deletions of dependencies?

Window Support Fix--Update pyproject.toml

Apologies this is incorrect sorry!

llama.cpp GGUF breaks

Yep apologies - it's been much much more complex than I initially thought - some computers work, but some do not, so I'm trying to find a generic solution, so...

Unexpected latency times

@davidjimenezphd Apologies on the delay! Our benchmarks are at https://huggingface.co/blog/unsloth-trl which might be helpful. Gemma 2 should enable Flash Attention 2 to speed things up (Unsloth should have provided a...

Unsloth vision models merging crashes

Oh that's not good hmm I'll have to auto check the memory usage before merging

Exception: CUDA error: an illegal memory access was encountered. CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.

@vhiwase Apologies on the delay! Would you happen to know what dataset you might have been using - it's possible there are some weird out of bounds tokens causing errors

Exception: CUDA error: an illegal memory access was encountered. CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.

@vhiwase No worries! Does this happen on other machines? Like in a Colab?