Benjamin Bossan comments

Results 1181 comments of


                                            Benjamin Bossan

[FEAT] Integrate LoRA-One into PEFT

> I think your understanding of the gradient approximation is right. Since LoRA-One needs to use the first-step gradients from full fine-tuning, we need the efficient approach from LoRA-GA to...

Fix autocast_adapter_dtype=False for quantized models

Thanks for proposing this fix @Aznix07. However, to apply this broadly requires a lot more changes. I have worked on those in #2893. I think this PR can be closed....

Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel

Just to be sure, this will be part of transformers v5?

Accelerate test fails: Exception: Could not find the transformer layer class to wrap in the model

Thanks for this detailed report. Debugging this type of issue can be really difficult, props for trying out a bunch of different things. At a first glance, I can't spot...

Accelerate test fails: Exception: Could not find the transformer layer class to wrap in the model

> This script seems to work (and wow it is much better than mine haha). I use LLama3-8B and everything can be saved locally, which is where my script fails,...

Accelerate test fails: Exception: Could not find the transformer layer class to wrap in the model

> I used the [bits and bytes guide](https://huggingface.co/docs/bitsandbytes/main/en/fsdp_qlora#training) which actually uses the [PEFT example repo](https://github.com/huggingface/peft/tree/main/examples/sft). > It seems that both guides work as they reference the same example in the...

Benjamin Bossan

[FEAT] Integrate LoRA-One into PEFT

Fix autocast_adapter_dtype=False for quantized models

Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel

Accelerate test fails: Exception: Could not find the transformer layer class to wrap in the model

Accelerate test fails: Exception: Could not find the transformer layer class to wrap in the model

Accelerate test fails: Exception: Could not find the transformer layer class to wrap in the model

GPU Memory Imbalance and OOM Errors During Training

GPU Memory Imbalance and OOM Errors During Training

GPU Memory Imbalance and OOM Errors During Training

GPU Memory Imbalance and OOM Errors During Training