MiniCPM issues

tech-report 疑问

### Description / 描述 tech report 有这个实验，那有对比过这样的效果吗 A0 预训练数据退火 B0 预训练数据+SFT数据退火 A1 预训练数据退火 + 4B sft B1 预训练数据+SFT数据退火 -> 4B sft ### Case Explaination / 案例解释 _No response_

airlsyn

badcase

我在使用你们提供的inference_vllm.py进行推理时，发生了以下错误 ERROR 08-19 09:33:57 pynccl.py:53] Failed to load NCCL library from libnccl.so.2 .It is expected if you are not running on NVIDIA/AMD GPUs.Otherwise please set the environment variable VLLM_NCCL_SO_PATH to...

lifelsl

[Bad Case]: 多模态MiniCPM-V 2.0 transformers 推理报错

5

### Description / 描述 import torch from PIL import Image from transformers import AutoModel, AutoTokenizer model = AutoModel.from_pretrained('./model/OpenBMB/MiniCPM-V-2', trust_remote_code=True) model = model.to(device='cuda') tokenizer = AutoTokenizer.from_pretrained('./model/OpenBMB/MiniCPM-V-2', trust_remote_code=True) model.eval() image = Image.open('./img/tmp.jpg').convert('RGB')...

wangyao123456a

badcase

[Bad Case]: 多模态 MiniCPM-V 推理报错

2

### Description / 描述 **代码：** `import torch from PIL import Image from transformers import AutoModel, AutoTokenizer model = AutoModel.from_pretrained('openbmb/MiniCPM-V', trust_remote_code=True) tokenizer = AutoTokenizer.from_pretrained('openbmb/MiniCPM-V', trust_remote_code=True) model.eval().cuda() image = Image.open('xx.jpg').convert('RGB') question =...

c122-ode

badcase

vllm使用lora微调后的模型报错

1

### Is there an existing issue ? / 是否已有相关的 issue ? - [X] I have searched, and there is no existing issue. / 我已经搜索过了，没有相关的 issue。 ### Describe the bug /...

ngz-sun

bug

triage

[Bug]: 出现报错_pickle.UnpicklingError: Weights only load failed. Re-running `torch.load` with `weights_only` set to `False` will likely succeed, but it can result in arbitrary code execution. Do it only if you got the file from a trusted source.

1

### Is there an existing issue ? / 是否已有相关的 issue ? - [X] I have searched, and there is no existing issue. / 我已经搜索过了，没有相关的 issue。 ### Describe the bug /...

weiruijinglu

bug

triage

[Bad Case]: android部署问题

1

### Description / 描述在使用llama.cpp在安卓编译时，显示'execinfo.h'头文件缺失 ### Case Explaination / 案例解释 _No response_

No3cat

badcase

[Bug]: 为什么Flash Attention2里不需要repeat_kv

### Is there an existing issue ? / 是否已有相关的 issue ? - [X] I have searched, and there is no existing issue. / 我已经搜索过了，没有相关的 issue。 ### Describe the bug /...

huyiwen

bug

triage

在手机端输入长文本报告维度不对错误

1

输入约1134字，错误信息如下 MLCChat failed Stack trace: org.apache.tvm.Base$TVMError: TVMError: Check failed: value->shape[0] shape[0]

rudaoshi

[Bad Case]: 无法复现模型结构缩放的最优学习率一致性实验

### Description / 描述错误结果：无法复现基于minicpm模型结构进行缩放，得到不同尺寸下的最优学习率一致性结果 ### Case Explaination / 案例解释作者您好，minicpm是一个非常棒的工作。我在minicpm开源的模型结构基础上进行基于模型的尺寸缩放实验，但是并不能复现博客中的如下图最优学习率一致性的结论 ![image](https://github.com/user-attachments/assets/91033fdf-25d1-404a-831e-47b8bec1e741) 在技术报告中看到了模型整体的缩放参数，在代码中基本都找到了缩放的位置，但是没找到参数初始化和learning rate部分，请问这部分是如何实现的呢？非常感谢您的回复。 ![image](https://github.com/user-attachments/assets/8d7f57cf-d2f7-42ea-9086-ef3405e96e2f) 在代码中看到是正常的初始化方式 ![image](https://github.com/user-attachments/assets/5e89dcd0-7a15-4dc6-8d7c-6e53cc9d259a)

xiaofengShi

badcase

MiniCPM
MiniCPM copied to clipboard

Metadata

tech-report 疑问

使用vllm推理时出现错误

[Bad Case]: 多模态MiniCPM-V 2.0 transformers 推理报错

[Bad Case]: 多模态 MiniCPM-V 推理报错

vllm使用lora微调后的模型报错

[Bug]: 出现报错_pickle.UnpicklingError: Weights only load failed. Re-running `torch.load` with `weights_only` set to `False` will likely succeed, but it can result in arbitrary code execution. Do it only if you got the file from a trusted source.

[Bad Case]: android部署问题

[Bug]: 为什么Flash Attention2里不需要repeat_kv

在手机端输入长文本报告维度不对错误

[Bad Case]: 无法复现模型结构缩放的最优学习率一致性实验

← Metadata

Owner

Metadata

MiniCPM MiniCPM copied to clipboard

Metadata

← Metadata

Owner

Metadata

MiniCPM
MiniCPM copied to clipboard