Results 5 comments of charger

> I don't understand the issue. Do you just need to run inference? If that is the case, DS-inference is compatible with all Huggingface models. Hello, I have the same...

By the way: model structure: gpt model link: https://huggingface.co/TsinghuaAI/CPM-Generate I want to train the model with 4 pipeline parallel and deepspeed.

> @AnShengqiang Its non-trivial to convert models for training. People are actively exploring this as far as I know. This repository saves something called a universal checkpoint which can be...

这么大的模型,用ollama 4bit可能都得120G内存以上。 有没有勇士尝试过?