MiniCPM
MiniCPM copied to clipboard
MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips
### Feature request / 功能建议 Hi, I was trying to get this up running on my macOS, and found [this](https://github.com/facebookresearch/xformers/issues/828). Since there is a dependency to `xformers`, I think MiniCPM...
### Feature request / 功能建议  使用colab的T4尝试Lora微调,使用了官方给的案例文件,模型是DPO-FP16,但是却迟迟卡顿在一开始的内存上,2小时了也不见动静,是数据太大了吗,要多久才能开始训练?
Mathmetics -> Mathematics
### Feature request / 功能建议 In many cases of deploying software, especially AI software, resolving potential errors or technical dependencies can be very time-consuming. It would be good if you...
### Feature request / 功能建议 对模型的性能很期待,期望能提供使用ollama部署的方法。
新增 Redmi k50 测试
### Feature request / 功能建议 Your nice work helps me a lot! I meet some bugs when finetuning the ```openbmb/MiniCPM-2B-sft-bf16```, I guess it should be caused by version inconsistency of...
### Feature request / 功能建议 开箱即用的 Docker 环境,可以让更多人玩起来,也更容易进行基于模型的开发,或者和其他模型联动。 - [ ] HF - [ ] VLLM (use latest CUDA & Torch & etc... faster and faster) 我基于 Nvidia 最新版本的月度容器做了一个简单的封装(代码 merge...