Shrey Gupta

Results 2 comments of Shrey Gupta

No, I just leave the strategy to "auto" which essentially means that I am not using "FSDP". Also, I tried using both 1 and multiple devices but the model weights...

I tried using FSDP and using 8 80GB A100 GPUs but still, it gets stuck while using the generate/lora.py script. The commands I am using are: 1. python generate/lora.py --lora_path...