eric8607242
eric8607242
Create One shot class Add some new NAS paper published in CVPR2019, ICCV 2019 and Arxiv
#2361 Hi, I try to dig into the root cause of this issue. After digging, I found the reason to raise the checkpoint key missing issue for GPT2 is that...
### 🐛 Describe the bug Hello, I currently fine-tune the Huggingface GPT2 with ColossalAI. I follow the example with GeminiDPP and ZeroOptimizer. However, I found that there are some keys...
### 🐛 Describe the bug Hello there, Thanks for this awesome project. I am currently training a GPT2 model for contrastive learning InfoNCE loss using tensor parallelism. To implement the...
Hello, Thanks for the amazing repo! I'm wondering what is the difference between "finetune" and "final-finetune" in `/example`. Do we train the model and the mask score in the finetune...