Zeeshan Patel

Results 4 issues of Zeeshan Patel

I'm trying to fine-tune the 6.7B model on my own code dataset. I am running a multinode training with fp32 precision on NVIDIA Tesla V100 GPUs with DeepSpeed ZeRO Stage...

# What does this PR do ? Enables FSDP when using PEFT during training/fine-tuning. **Collection**: [NLP] # Changelog - Add specific line by line info of high level changes in...

NLP

# What does this PR do ? Fixed a small typo in the GPT model training documentation. **Collection**: docs # Changelog - Fixed spelling mistake on line 38. # Jenkins...

NLP

# What does this PR do ? Adds diagrams explaining DiT training pipeline. **Collection**: [Note which collection this PR will affect] # Changelog - adds diagrams to explain mixed image-video...

Run CICD