Bilal Ghanem

Results 2 issues of Bilal Ghanem

Hi I need your help with loading the model. I see how you're doing that in the "converting..." file. But this is only for LORA models. What about full_shard models?...

Hi Thanks for your efforts folks! While I was testing the code on my own dataset, I found that when the length of the input is large (~4000), the loss...