Benjamin Bossan comments

Results 1181 comments of


                                            Benjamin Bossan

[Question/Bug] How to safely continue LoRA fine-tuning under DeepSpeed ZeRO-3 (multi-stage training with modules_to_save)

Thanks for investigating further. `modules_to_save` is intended to work with DeepSpeed, so if it doesn't, we should try to fix it. Again, if you could provide more details as mentioned...

[Question/Bug] How to safely continue LoRA fine-tuning under DeepSpeed ZeRO-3 (multi-stage training with modules_to_save)

> Yes, that’s right. Stage 1 trained successfully (the main issue there was with resize_token_embeddings, which I fixed manually). > However, I still face GPU OOM issues whenever modules_to_save is...

[FEAT] Add LoReTTA

@mbaddar1 Just asking if you still plan on working on this.

[FEAT] Add LoReTTA

Hi, no worries, there is no time pressure. I just wanted to ping you, in case you simply forgot about this. Take all the time you need.

Weight LoRA

Thanks for this PR that proposes to add Weight LoRA to PEFT. Do you have a link to the full paper? I only skimmed the implementation, but from what I...

Weight LoRA

> These constraints should not be taken into account in the WeightLoRA method, but in the implementation of the optimizer step (e.g. SGD with projection). In our paper, we provide...

Weight LoRA

_not stale_

Comparison of Different Fine-Tuning Techniques for Conversational AI

Thanks for coming up with this proposal. Indeed, this is something we have on our backlog for a long time. As you can imagine, providing objective and useful information on...

Comparison of Different Fine-Tuning Techniques for Conversational AI

> I would be interested to contribute as well Thanks for the offer. As mentioned, as a first step, we could use some help with updating the "blurbs" of the...

Comparison of Different Fine-Tuning Techniques for Conversational AI

> How about having a sample fine-tuning script for each method and comparing different approaches for different tasks? I'm not 100% sure what you mean, but let's start with a...