kevin

Results 7 issues of kevin

### 需求描述 Feature Description tensor.numpy()执行大量数据从GPU拷贝到CPU速度缓慢,5M数据执行tensor.numpy()耗费了1.4s,完全不可接受!啥原因呢? ### 替代实现 Alternatives _No response_

status/new-issue
type/feature-request

### 🐛 Describe the bug ChatGPT: why total reward is reward = r - kl_coef * kl, not total_reward=r + gamma * critic(next_states)? ![image](https://user-images.githubusercontent.com/11996971/220237581-94ddb213-576f-431a-a5da-671285992a58.png) ### Environment _No response_

bug

where is the train file? I want to learn how to train.

when I run pretrain.py of llama-7b model, it has exception below (not user zero3): ExceptionException: : Current loss scale already at minimum - cannot decrease scale anymore. Exiting run.Current loss...

### Is your feature request related to a problem? Please describe. 为啥我的4090显卡跑chatglm-6b的GPU只有20%? 如何提升? ### Solutions who can help me? ### Additional context _No response_

chatglm-6b(chatglm2-6b) is a very popular Chinese LLM. Do you have a plan?

new model

when I run pretrain.py of llama-7b model, it has exception below: ExceptionException: : Current loss scale already at minimum - cannot decrease scale anymore. Exiting run.Current loss scale already at...