mPLUG issues

Results 8 mPLUG issues

Sort by recently updated

Question about baseline reward in `caption_mplug_scst.py`

The [code](https://github.com/X-PLUG/mPLUG/blob/c666bfa1044bde5a6ce47fa1b4ae22d7bf9de633/caption_mplug_scst.py#L84-L86) in this repo shows that baseline reward is calculated by averaging reward of generated captions. However, the [original version of scst](https://github.com/ruotianluo/self-critical.pytorch) as well as some other scst implementation...

czy-orange

Unable to run for image captioning

Hi. I don't understand how to use this pre-trained model for image captioning. Am I supposed to clone the github repo and then somehow load the pre-trained model? It would...

hamza13-12

which ckpt is better to making Visual Entailment in SNLI-VE?

and how to find it? thanks

bigbrother001

Any Chance for releasing the fintuned COCO/Flikr30k checkpoint for image-retrieval task?

Hi authors, Thanks for your great work. Are there any chance for you to release the fintuned COCO/Flikr30k checkpoint for image-retrieval task? Thanks a lot.

letitiabanana

复现image caption测试结果遇到问题请教一下

请问在modelsope中公布的模型是论文中SOTA的模型吗，我这边使用模型介绍中的模型和caption输出方法，在coco-val 5k上统计bleu4和cider指标，都是低于论文指标的。是模型的原因、测试方法不对或者是测试集测试工具没对齐吗？

MingsYang

Cannot reproduce VQA finetuning, please upload checkpoints

Dear authors, I finetuned mPLUG Base on VQAv2 but only get around 75% accuracy instead of the around 80% reported in the readme. Could you kindly upload the finetuned checkpoints...

simon-ging

pretrain计划什么时候开放源码？

你好，请问pretrain什么时候开放源码，看到readme里写的 coming soon，想在自己的数据上试一下效果。感谢

letiantony

which version is damo/mplug_visual-question-answering_coco_large_en used in modelscope? And there is memory leak with pipeline inference...

Hi, thanks for your work! I used your model with modelscope, but I didn't find which model size is the default setting in modelscope, I only know is its named...

aixiaodewugege

mPLUG
mPLUG copied to clipboard

Metadata

Question about baseline reward in `caption_mplug_scst.py`

Unable to run for image captioning

which ckpt is better to making Visual Entailment in SNLI-VE?

Any Chance for releasing the fintuned COCO/Flikr30k checkpoint for image-retrieval task?

复现image caption测试结果遇到问题请教一下

Cannot reproduce VQA finetuning, please upload checkpoints

pretrain计划什么时候开放源码？

which version is damo/mplug_visual-question-answering_coco_large_en used in modelscope? And there is memory leak with pipeline inference...

← Metadata

Owner

Metadata

mPLUG mPLUG copied to clipboard

Metadata

← Metadata

Owner

Metadata

mPLUG
mPLUG copied to clipboard