zhudongwork comments

Results 36 comments of


                                            zhudongwork

SAM HQ?

Yes, you can refer to https://github.com/dinglufe/segment-anything-cpp-wrapper.

thanks your work

Yes, you can refer to https://github.com/dinglufe/segment-anything-cpp-wrapper.

Yes, the input size of the model is 1024*1024. The images were processed using ResizeLongestSide-1024 in the original SAM project. In order to avoid tensorrt's dynamic inference of orig_img_size, the...

TensorRT 8.6 change onnx to trt error

Please try TensorRT-8.5

TensorRT 8.6 change onnx to trt error

This is confusing, I tried it on both TensorRT-8.6.1.6 and 8.5.3.1 and it works fine. This may be an environmental problem or hardware issue with your local tensorrt.

Result not correct

In main.cpp, the upper left corner of cv::Rect is used as the prompt point of sam. In your case1, the upper left corner belongs to the ground, which leads to...

自定义数据微调MiniCPM-Llama3-V-2_5报错

> 微调的数据集不含response字段嘛有的，这是数据的样式： ![image](https://github.com/modelscope/swift/assets/32665466/c3563faa-a8cc-4d4f-bbb1-eafcc0ff6af6)

自定义数据微调MiniCPM-Llama3-V-2_5报错

微调Cogvlm2的时候也是一样的问题，是我数据集的格式不对吗。这是运行命令 `CUDA_VISIBLE_DEVICES=7 swift sft --model_id_or_path /node6/models/ZhipuAI/cogvlm2-llama3-chinese-chat-19B --model_type cogvlm2-19b-chat --dataset ../ocr_100.json --batch_size 4 --val_dataset_sample 10` 微调数据 [ocr_100.json](https://github.com/user-attachments/files/15510763/ocr_100.json)

[BUG] 在尝试单独使用PdfLoader出现问题

> > > Please download the pdf parser related checkpoints in modelscope [https://www.modelscope.cn/models/netease-youdao/QAnything-pdf-parser/files] > > > > > > 好的十分感谢，另外是不是Qanything无法处理没有文本元素的pdf啊，我截了一张图进行解析，发现有报错。如果是这样那它里面的ocr的意义是什么呢，是解析表格？ > > 报错信息如下: LOCAL DATA PATH: c:\Users\Administrator\Desktop\QAnything-1.4.1\QANY_DB\content LOCAL_RERANK_REPO: netease-youdao/bce-reranker-base_v1 LOCAL_EMBED_REPO:...

PdfLoader失败，显示ocr的模型权重加载失败

> from qanything_kernel.utils.loader.self_pdf_loader import PdfLoader pdf_loader = PdfLoader(filename='tables/table-03d9ec345317b0115180d7dbcf843ef6.pdf') markdown_directory = pdf_loader.load_to_markdown() print(f"Markdown文件在: {markdown_directory}") > > ➜ QAnything python QAnything_ocr.py LOCAL DATA PATH: /mnt/user/QAnything-qanything-python/QAnything/QANY_DB/content LOCAL_RERANK_REPO: netease-youdao/bce-reranker-base_v1 LOCAL_EMBED_REPO: netease-youdao/bce-embedding-base_v1 Traceback (most recent...