MiniCPM-V issues

What's the learning rate scheduler when training the perceiver resampler from scratch?

1

Any training steps and hyperparams settings suggestions?

lucasjinreal

question

web_demo.py 运行需要10G显存。为啥 vllm 启动api模式运行，需要23G显存啊？

1

web_demo.py 运行需要10G显存。为啥 vllm 启动api模式运行，需要23G显存啊？ python web_demo.py --device cuda --dtype bf16 Vllm运行命令如下 /services/srv/MiniCPM-vllm/venv/bin/python -m vllm.entrypoints.openai.api_server --model /services/srv/MiniCPM-V/openbmb/MiniCPM-V-2/ --trust-remote-code ![5fc1c797807f86d12cc37208620d731](https://github.com/OpenBMB/MiniCPM-V/assets/11950/5c79bb50-37d8-4b6a-8e72-2ec0cb19c088)

triumph

疑问

1

问什么识别一张普通发票很多字段都识别不出来？是提示词有讲究

intjun

MiniCPM-v2进行文字识别与提取的demo有建议吗？

1

ChenCong7375

流式输出

请问在git主页的测试案例,如何实现模型的流式输出呢? 或者huggingface主页的推理示例如何实现流式输出,一个字一个字的输出而非一句话一起输出

sssssshf

ollama支持吗

36

catzqaz

feature

llamacpp

tech report or docs

hi, thanks for your awesome work! when will your release the tech report or docs for current work like MiniCPM-Llama3-V 2.5 and MiniCPM-V 2.0? thanks

bpwl0121

GGUF versions doesn't seem to run on llama.cpp (through LocalAI)

12

First of all, thank you for your impressive work! I've found that your model fares better than the latest LLAVA (13B) on some of my tasks. I've tried running the...

naifmeh

llamacpp

int4和bffloat16推理时间问题（着急）

2

用如下代码分别测试MiniCPM-2B-dpo-bf16和MiniCPM-dpo-Int4两个模型，推理时间MiniCPM-2B-dpo-bf16有3秒多，MiniCPM-dpo-Int4有10秒以上，请问原因是啥？ ![image](https://github.com/OpenBMB/MiniCPM-V/assets/77612906/88b36241-c1b5-4826-b251-946161658f9d)

githublsk

help wanted

inference

RuntimeError: shape mismatch: value tensor of shape [1037] cannot be broadcast to indexing result of shape [1036]

1

File "lib/python3.9/site-packages/transformers/models/idefics2/modeling_idefics2.py", line 190, in forward position_ids[batch_idx][p_attn_mask.view(-1).cpu()] = pos_ids RuntimeError: shape mismatch: value tensor of shape [1037] cannot be broadcast to indexing result of shape [1036]

hujunchao

MiniCPM-V
MiniCPM-V copied to clipboard

Metadata

What's the learning rate scheduler when training the perceiver resampler from scratch?

web_demo.py 运行需要10G显存。为啥 vllm 启动api模式运行，需要23G显存啊？

疑问

MiniCPM-v2进行文字识别与提取的demo有建议吗？

流式输出

ollama支持吗

tech report or docs

GGUF versions doesn't seem to run on llama.cpp (through LocalAI)

int4和bffloat16推理时间问题（着急）

RuntimeError: shape mismatch: value tensor of shape [1037] cannot be broadcast to indexing result of shape [1036]

← Metadata

Owner

Metadata

MiniCPM-V MiniCPM-V copied to clipboard

Metadata

← Metadata

Owner

Metadata

MiniCPM-V
MiniCPM-V copied to clipboard