InternEvo issues

[Bug] the usage "x.is_cuda" is not recommended, since we have both GPU and NPU

1

### Describe the bug ### Environment Torch2.1 ### Other information _No response_

sunpengsdu

bug

[Bug] do not use torch.cuda.current_device() as device, since it only retures an int

3

### Describe the bug we have a lot of cases like following: ` data = torch.empty(partition_size, dtype=tensor.dtype, device=torch.cuda.current_device(), requires_grad=False) ` where we directly use device=torch.cuda.current_device(). However, it is not recommended...

sunpengsdu

bug

[Feature] CPU synchronization Problem

### Describe the feature Some CPU synchronizations block the GPU kernel, leading to bubbles between GPU kernels. It should be optimized in the future. 1. item() in rotary embedding. 2....

yingtongxiong

enhancement

[Feature] update readme with new version of dependency.

### Describe the feature update readme with new version of dependency. ### Will you implement it? - [ ] I would like to implement this feature and create a PR!

sunpengsdu

enhancement

[Feature] supporting hugging-face modeling python file

### Describe the feature supporting hugging-face modeling python file ### Will you implement it? - [ ] I would like to implement this feature and create a PR!

sunpengsdu

enhancement

[Feature] support sequence parallel in head layer and embedding layer

1

### Describe the feature they should not in separated parameter group ### Will you implement it? - [X] I would like to implement this feature and create a PR!

sunpengsdu

enhancement

在安装docker环境时，总是爆出这个错误，无法解决

1

### 描述该错误 make -f docker.Makefile BASE_OS=ubuntu20.04 时，总是会出一个错误，无法解决。发生在[intrenlm-dev 3/3] RUN git submodule update --init --recursive 这一步 ### 环境信息 ERROR: failed to solve: process "/bin/sh -c git submodule update --init --recursive &&...

zjtggssg

bug

add gradient sharding

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand...

ChenQiaoling00

[Feature] add internlm2-1.8b finetuning config

### Describe the feature internlm2-1.8b finetuning config is missing ### Will you implement it? - [X] I would like to implement this feature and create a PR!

00INDEX

enhancement

[Feature] rotary代码规范化

### Describe the feature 目前 rotary_embedding类的实现有大量历史遗留的代码，建议和https://github.com/Dao-AILab/flash-attention/blob/v2.2.1/flash_attn/layers/rotary.py 对齐，并且支持triton算子 ### Will you implement it? - [ ] I would like to implement this feature and create a PR!

sunpengsdu

enhancement

InternEvo
InternEvo copied to clipboard

Metadata

[Bug] the usage "x.is_cuda" is not recommended, since we have both GPU and NPU

[Bug] do not use torch.cuda.current_device() as device, since it only retures an int

[Feature] CPU synchronization Problem

[Feature] update readme with new version of dependency.

[Feature] supporting hugging-face modeling python file

[Feature] support sequence parallel in head layer and embedding layer

在安装docker环境时，总是爆出这个错误，无法解决

add gradient sharding

[Feature] add internlm2-1.8b finetuning config

[Feature] rotary代码规范化

← Metadata

Owner

Metadata

InternEvo InternEvo copied to clipboard

Metadata

← Metadata

Owner

Metadata

InternEvo
InternEvo copied to clipboard