Yi Wang comments

Results 72 comments of


                                            Yi Wang

What do DIV and FLT stand for?

DIV and FLT stand for diverse sampling and filtering respectively. Specifically, for DIV (diversity sampling), we aim to sample video clips from all long videos available to maximize data diversity....

What do DIV and FLT stand for?

Apologies for the delayed response. 1. You can access the full version of InternVid [here](https://huggingface.co/datasets/OpenGVLab/InternVid-Full). 2. No. The aesthetic dataset does not consider CLIP score. When filtering by aesthetic scores,...

is there any clear training instruction of internvideo1, like step by step?

It is legacy issue. We will see what we could do.

Do you have plans to release all the captions of InternVid?

Our team is preparing it. However, due to our tight schedule, the precise release date for the full version is currently uncertain.

Do you have plans to release all the captions of InternVid?

Apologies for the delayed response. You can access the complete version of InternVid [here](https://huggingface.co/datasets/OpenGVLab/InternVid-Full).

Installation Issues with Demo Notebook

Thank you for your feedback. We will address your mentioned issues soon.

ModuleNotFoundError: No module named 'dropout_layer_norm'

This is caused by the missing installation of some libs given in flash attention. You need to get the source code of flash attention, and then install layer_norm as in...

ModuleNotFoundError: No module named 'dropout_layer_norm'

If your machine did not support the installation of these libs, you could alter the settings in config.py that does not use half precision and bf16 for running. In that...

ModuleNotFoundError: No module named 'dropout_layer_norm'

You can refer to [this instruction](https://github.com/OpenGVLab/InternVideo/blob/main/InternVideo2/multi_modality/INSTALL.md#key-dependencies-installation-for-flashattention2) to install dependencies to run flash-attn with layernorm and other components. If your hardware does not support flash-attn and its dependenies installation, you can...

About Video Temporal Grounding

Sorry for replying late. Please refer to the [branch](https://github.com/OpenGVLab/InternVideo/tree/grounding_evaluation) of this repo regarding grounding evaluations.