Daniel Han comments

Results 781 comments of


                                            Daniel Han

Please include a tutorial on tinyllama for chat conversation with custom dataset

@gracehubai I just added chat templates! It supports Zephyr (the one TInyLlama uses), ChatML, Vicuna etc - https://colab.research.google.com/drive/1Aau3lgPzeZKQ-98h69CCu1UJcvIBLmy2?usp=sharing

error when installing unsloth in a docker image

@abstrcode You might need to restart the Runpod instance, and just run `pip install "unsloth[cu121] @ git+https://github.com/unslothai/unsloth.git"` without doing anything else. If you're using A100s or Ampere GPUs, `pip install...

error when installing unsloth in a docker image

@abstrcode I don' think the GPU matters - V100 is supported. It's mainly bitsandbytes being unable to be installed. If you have conda, try using the conda method to install...

DoRA Support

@RonanKMcGovern Yes! We had a chat on our Discord server about this! It looks very promising, and it removes lora_alpha (finally!!!) 1 less hyperparameter! Love how it looks very similar...

Mistral inputs_embeds without causal mask raises AttributeError

@namednil Oh nice catch on the bug - will solve this

Support for Command-R model from Cohere

@gottlike Will take a look!

Support for Command-R model from Cohere

Hmmm Ill see what I can do but hmmm

please add support for deberta model

Hmm an issue is Deberta is an encoder decoder BERT type model right? I might add support for other BERT type models in a future release :)

Anyone wanna attempt tweaking unsloth for Mamba-2.8b?

More than happy to welcome contributions for Mamba!!

Faster Inference & Training Roadmap

@jeromeku I will get to reviewing GPTQ - sorry on the delay!! * The VRAM reductions are from Unsloth's optims :) Ie Triton kernels, making memory copies go away, FA2,...