InternEvo
InternEvo copied to clipboard
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand...
Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand...
when tp size > kv head_number, copy kv head
Add docs for 2d-attention
## Motivation 1. Fix CI timeout for https://github.com/InternLM/InternEvo/issues/342 **(Completed)** 2. Refine implementation of megatron and mocked dataloader **(Completed)** ## Modification - `internlm/train/pipeline.py` - `internlm/data/*` ## BC-breaking (Optional) None ## Use...
Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand...
Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand...
Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand...
Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand...