Vincent Min
Vincent Min
@BugReporterZ Could you explain the reasoning for why group_by_length may be causing this issue?
@ritabratamaiti Yes, QLoRA is supported by SFTTrainer. You can use this example script and set `load_in_4bit=True` and `use_peft=True`. https://github.com/lvwerra/trl/blob/main/examples/scripts/sft_trainer.py See this blog for more details: https://huggingface.co/blog/4bit-transformers-bitsandbytes @BugReporterZ thanks for the...
Hi @pacman100 , could you explain why the code is structured such that you must provide the `base_model`? It seems to me that the `base_model` is already present in the...