Jiarui Fang（方佳瑞） comments

Results 220 comments of


                                            Jiarui Fang（方佳瑞）

支持大模型的推理吗

> hehe 友商别黑，现在已经很多做Decoder-only LLM的inference了，其实挑战和encoder有比较大的差别了。这issue post的两年前的大模型还不是指decoder-only的。

question about import model_zoo.gpt.gpt as col_gpt

Currently, a config file is not necessary. See this as the latest CAI GPT example https://github.com/hpcaitech/ColossalAI/blob/main/examples/language/gpt/README.md

[CI] add test_ci.sh for palm, opt and gpt

> Please run pre-commit on your changed files. I have already rerun it.

[BUG]: Can't find the file /config/server.sh in EnergonAI docker

you have to map your local directory to docker with `-v` to do column mapping. For docker volumn mapping usage please refer https://www.geeksforgeeks.org/mounting-a-volume-inside-docker-container/

Load ColossalAI GPT model as HuggingFace/Transformers Model

Hi, can you tell me your method to **use a GPT model I trained using ColossalAI with huggingface/transformers**? Pointing out which example is your implementation reference would be helpful.

[BUG]: Use Teyvat but produce blur grid pictures

Hello, how long did you train the model. Epoches? Iterations? Considering that train a diffusion model from scratch is very slow to converge.

[BUG]: problem to run the example of colossalai

Actually, I am not an expert on SP and PP. I can help you to contact the author of the SP paper. @FrankLeeeee can you help with this project?

'pytorch_lightning.strategies' has no attribute 'ColossalAIStrategy'

You can fix the bug with the correct PL version. I closed the issue.

[BUG]: after finetuning dreambooth, infer with trained model.

Thanks for using our new feature, this is a known bug and we are working on fixing it. I will reply you in this issue if it is fixed.

OOM for OPT 30B model using GeminiDDP [BUG]:

That's right, policy cpu is more stable than auto. I suggest you use CPU for large model. We will check the problem in auto implementation. you can refer the benchmark...