Guangyao Zhang issues

Results 7 issues of


                                            Guangyao Zhang

[Draft]Support Pytorch 2.2.2

## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...

[ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM

## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A concise...

[FEATURE]: Support SP+PP in Llama etc.

### Describe the feature Currently most models like Llama does not support SP together with PP. Please add support for this.

enhancement

shardformer

[FEATURE]: Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM

### Describe the feature Please add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM

enhancement

shardformer

[BUG]: Pytest with a specific config failed after PR #5868

### Is there an existing issue for this bug? - [X] I have searched the existing issues ### 🐛 Describe the bug Main repo test_shard_llama fails for these configs: ```...

bug

shardformer

[BUG]: Pipeline Parallelism fails when input shape varies

### Is there an existing issue for this bug? - [X] I have searched the existing issues ### 🐛 Describe the bug Pipeline parallelism fails when input size is different....

bug

shardformer

[BUG]: Shardformer FP8 communication training accuracy degradation

### Is there an existing issue for this bug? - [X] I have searched the existing issues ### 🐛 Describe the bug TP+Split Gather(Acc) 4GPU Original FP16 Model: 0.755 FP8...

bug