Benjamin Bossan comments

Results 1183 comments of


                                            Benjamin Bossan

[Feature request] Mechanism to Disable Accerate in runtime

Thanks for elaborating Thiago. I agree that there can be scenarios where a flag to turn off accelerate would be useful. My feeling is, however, that in the vast majority...

Model accuracy drops when upgrading from accelerate==0.25.0 to 0.26.0 or 0.27.2

This should not happen, thanks for reporting this issue. > but noticed that the accuracy drops by ~4.5% when doing so Is this train or validation/test accuracy? Are those absolute...

Different versions seem to have an impact on the results

Thanks for the details. Unfortunately, this is not enough for us to pinpoint the exact issue could be. I have 2 suggestions: - Take the usual steps to mitigate NAN...

Different versions seem to have an impact on the results

@skylooop Without the full code and data, we cannot start debugging this issue. If you cannot share, can you identify the exact version of PEFT at which training starts to...

Different versions seem to have an impact on the results

Thanks for the pointer. I tried replicating your issue by using the [fine-tuning script](https://github.com/microsoft/TransformerCompression/tree/main?tab=readme-ov-file#recovery-fine-tuning) but ran out of memory. I tried lowering some hyper-params but still no luck, and when...

Different versions seem to have an impact on the results

> I found the same problem when trying to peft fine-tune CodeLLama-7B (using LlamaForSequenceClassification), the Loss is always 0 during the fine-tuning. Is that also with the MS compression library?...

Different versions seem to have an impact on the results

Thanks for providing more details. I did a quick check on the diff between v0.6.0 and v0.7.0 but at first glance, nothing came up that could explain the difference that...

Different versions seem to have an impact on the results

@sssszh Thanks for providing the script. I didn't have access to your data, so I requested it. Would this also work with another dataset, like one of the datasets on...

Different versions seem to have an impact on the results

> This dataset is also available on huggingface: `claudios/code_x_glue_devign`, but you may need to modify the code in the Dataset section, because the format of the data is not the...

Different versions seem to have an impact on the results

Thanks for providing access, I could finally run the example. First of all, I had to make a few small modifications to the script because of OOM errors, but I...