Zeeshan Patel issues

Repositories
Issues
Comments

Results 4 issues of


                                            Zeeshan Patel

Training loss extremely noisy during fine-tuning and randomly goes to 0

I'm trying to fine-tune the 6.7B model on my own code dataset. I am running a multinode training with fp32 precision on NVIDIA Tesla V100 GPUs with DeepSpeed ZeRO Stage...

FSDP + PEFT

# What does this PR do ? Enables FSDP when using PEFT during training/fine-tuning. **Collection**: [NLP] # Changelog - Add specific line by line info of high level changes in...

NLP

fixed typo in GPT training explanation

# What does this PR do ? Fixed a small typo in the GPT model training documentation. **Collection**: docs # Changelog - Fixed spelling mistake on line 38. # Jenkins...

NLP

dit training diagrams

# What does this PR do ? Adds diagrams explaining DiT training pipeline. **Collection**: [Note which collection this PR will affect] # Changelog - adds diagrams to explain mixed image-video...

Run CICD