MiniCPM-V icon indicating copy to clipboard operation
MiniCPM-V copied to clipboard

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Results 480 MiniCPM-V issues
Sort by recently updated
recently updated
newest added

I installed conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=12.1 -c pytorch -c nvidia but an error message showed up, RuntimeError: cutlassF: no kernel found to launch! , it indicates that my...

![1713856321333](https://github.com/OpenBMB/MiniCPM-V/assets/26639421/64c9b538-1d49-4c94-bc77-6c81b53a611e) ![1713856452853](https://github.com/OpenBMB/MiniCPM-V/assets/26639421/b6bb0525-5d78-4960-8050-f1a81791c3de) pytorch version:2.1.2+cu121 报问题的代码: ![image](https://github.com/OpenBMB/MiniCPM-V/assets/26639421/474ebef2-2532-47ee-a3be-d70d3bc00f5a)

遍历图片时总是复制前面的输出,是因为跑多个图片预测时会缓存前面的输出结果? 而单独对每个图片进行推断,中文ocr效果又很差。是不支持中文ocr识别吗

Apologies for asking this in the issues tab. Is this model capable of chatting without image input? I see in the benchmarks you have some models that can take both...

在消费级的电脑上要多大的显存才能部署?

请问一下演示的demo部分的代码大概什么时候开源呢?可以先说一下框架和实现的思路吗?

我在Hugging Face上下载了OmniLMM相关的repo之后,将代码中模型的路径换成了本地文件夹路径,但是最后报错

MLCChat failed Stack trace: org.apache.tvm.Base$TVMError: TVMError: OpenCL build error for device=0x78b138eaa8 Error: CL_OUT_OF_HOST_MEMORY Pass Stack trace: File "/Users/acha/Desktop/mlc-MiniCPM/3rdparty/tvm/src/runtime/library_module.cc", line 78 at org.apache.tvm.Base.checkCall(Base.java:173) at org.apache.tvm.Function.invoke(Function.java:130) at com.modelbest.mlcllm.ChatModule.image(ChatModule.java:64) at com.modelbest.minicpm.AppViewModel$ChatState$requestImage$f$1$1.invoke(AppViewModel.kt:637) at com.modelbest.minicpm.AppViewModel$ChatState$requestImage$f$1$1.invoke(AppViewModel.kt:637)...

方便提供openai_api接口吗,这样我就可以做成服务调用了,VLLM就更好了,听说这周就可以实现吗?