Yinan He comments

Results 176 comments of


                                            Yinan He

[PERFORMANCE_REPORT]+[OPTIMIZATION]/[SUGGESTION]

Thank you for your careful analysis and excellent suggestions! We added support for whisper in [long_video_support](https://github.com/OpenGVLab/Ask-Anything/tree/long_video_support), we will update our code with your suggestions!

Google drive dataset Error 404: File not found

Hi，Do you sign-in with your api

validation dataset

https://pan.quark.cn/s/576bf2bea36f

Access Permission of Google drive

It's already been shared with you. We will handle the mail for the first time on the Chinese workday.

How to get spatial_localize?

Yes, we used pixel difference.

clevr数据集的使用

> 原来如此，不过好像在CLEVR的metadata里没有看到image_index，代码是： > > ``` > ds = datasets.load_dataset("./datasets/M3IT/", "clevr", split="train", streaming=True) > ds.info > ``` 抱歉，看到这个问题，我们是通过直接下载huggingface dataset repo里的jsonl文件读取的 ![image](https://github.com/OpenGVLab/Ask-Anything/assets/43169235/a1669ee8-fa03-46eb-a0a4-fc8363f8ecb9)

Any instructions for fine-tuning on custom datasets?

You can do so by adding your meta file in the [instruction_data](https://github.com/OpenGVLab/Ask-Anything/blob/main/video_chat2/configs/instruction_data.py) file. The meta file is a json and follows the following schema: ``` [ { "video": "string", "QA":...

Error when run main.py

you can simplely remove `bn_var_mode=syncbnVarMode_t.L2` in line94

Bugs during training stage2

Thank you for your feedback. You can directly comment on them, and we will fix this issue as soon as possible.