Shrey Gupta
Results
2
comments of
Shrey Gupta
No, I just leave the strategy to "auto" which essentially means that I am not using "FSDP". Also, I tried using both 1 and multiple devices but the model weights...
I tried using FSDP and using 8 80GB A100 GPUs but still, it gets stuck while using the generate/lora.py script. The commands I am using are: 1. python generate/lora.py --lora_path...