Haonan Zhang comments

Results 13 comments of


                                            Haonan Zhang

Create dataset, generate Tsv file

Yes, the predicted object labels are not used for captioning task.

Pre-extracted features link not working

Hello, I got the same issues, can you send me a copy of TGIF-QA (visual and textual features)? Thank you very much! This is my email ([[email protected]]([email protected])).

openai-caption-full.pkl

Thanks for your attention to our work. This [link ](https://github.com/albanie/collaborative-experts/blob/master/misc/datasets/msvd/README.md)may provide some help. ![image](https://github.com/zchoi/PKOL/assets/30498168/0e496195-4a75-4f16-8ff1-ab810c929637)

📝 文档改善 - 欢迎大家来贡献实用有趣的角色信息！！

![image](https://user-images.githubusercontent.com/30498168/228862577-8259a1ec-03c3-462f-af41-f755b4b76c2f.png) 我点击选项卡之后会出错，不能选择，请问是为啥呀

📝 文档改善 - 欢迎大家来贡献实用有趣的角色信息！！

![19e1548e-665d-464d-8141-e0c6f6919652](https://user-images.githubusercontent.com/30498168/228862802-eed41b23-eb4d-4ee2-b974-8436c8227b7e.jpeg) 还有一个问题就是在群聊里面无法进入图片创作模式，会报错

Error Count API Failed !

yes, doesn't work again

Error Count API Failed !

got it. thanks~

Error

Solved by removing this line https://github.com/RayeRen/acad-homepage.github.io/blob/2cc1577eeaf2f74dede6d016a70722dbd409ea2f/_pages/about.md?plain=1#L4

使用自己的数据集

您好，可以参考这篇论文[1]，他的repo里面提供了特征抽取方案(hdf5 file): [Link](https://github.com/facebookresearch/grid-feats-vqa) [1] Jiang, H., Misra, I., Rohrbach, M., Learned-Miller, E., & Chen, X. (2020). In defense of grid features for visual question answering. In Proceedings of the IEEE/CVF...

有没有图像描述生成器代码

您好，由于本项目是离线视觉特征训练和测试，如果要实际部署，需要修改成端到端推理。有几点建议包括但不限于：1. 修改dataloader，从而对输入图像进行online的特征抽取，见#12 ，2. 下面的代码包含了模型输出解码和评测，```gts```如果不需要你可以删除掉，只要模型当前的输出```gen_i```即可。 https://github.com/zchoi/S2-Transformer/blob/c2f56c5715fc44e4677f546e5bb7c36d57909374/test_transformer.py#L33-L37