InternVideo issues

When finetuning, len(train_loader)==0, ZeroDivisionError: integer division or modulo by zero in tasks/pretrain.py

1

I want to finetune the InternVideo2-Stage2_1B-224p-f4 on activitynet. I adjust the data in data.py. My data is: `available_corpus["anet_ret_val"] = dict( anno_path=".../ActivityNet/anno_downstream/anet_ret_val.json", data_root=".../ActivityNet", media_type="video", is_paragraph_retrieval=True, max_txt_l = 150 ) available_corpus["anet_ret_train"] =...

KevinHuhr

How to use internvideo2-s2_6b-224p-f4

Thanks for your great work. I use your work for text-video retrieval and tried the internvideo2-s2_1b-224p-f4 from the demo and worked well. You recently released the 6b version. I noticed...

locys

Access to pooling ablations. (Section X)

Hi, Thanks for sharing this solid model. In the paper, there is mention of pooling ablations in section X. I believe thats in the appendix. But the arxiv version does...

rohun-tripathi

Request for InternVideo2-stage2 6b model

6

Hello, Is there any approximate time when the 6b model will be available that had been used to get the rank 1 result on the [MSR-VTT leaderboard](https://paperswithcode.com/sota/zero-shot-video-retrieval-on-msr-vtt)?

dipta007

Weights for InternVideo2 s 2 -6B

3

Hi Team, Could you please clarify the release timeline for the weights of `InternVideo2 S2-6B` for video-text retrieval ? If they have already been released, could you kindly share the...

KerolosAtef

Problem with dropout_layer_norm

1

Hi, thank you for your great work! I want to use InternVideo2 as backbone. I installed flash-attn 2.7.3 and dropout_layer_norm from source. But now i am struggling with this error...

anfortas337

Whether InternVideo2.5 will open source pre-training code

1

bobo0810

Open Source Plans for InternVideo2.5 SFT and Training Code

1

When will InternVideo2.5 open source the SFT code like InternVL2? Will the training code be open-sourced to facilitate following? 🙏

sotayang

Chinese Text2video retrieval support?

8

Thank you for contributing such outstanding work, I would like to ask InternVideo2 support Chinese text search video? What model do I need to replace the VisionEncoder and TextEncoder with?...

KeyaoZhao

在InternVideo2.5中Adaptive Temporal Sampling是怎么做的？

1

在论文3.1中看到了Adaptive Temporal Sampling相关介绍，但是好像没有提到技术上怎么实现的。

double-fire-0

InternVideo
InternVideo copied to clipboard

Metadata

When finetuning, len(train_loader)==0, ZeroDivisionError: integer division or modulo by zero in tasks/pretrain.py

How to use internvideo2-s2_6b-224p-f4

Access to pooling ablations. (Section X)

Request for InternVideo2-stage2 6b model

Weights for InternVideo2 s 2 -6B

Problem with dropout_layer_norm

Whether InternVideo2.5 will open source pre-training code

Open Source Plans for InternVideo2.5 SFT and Training Code

Chinese Text2video retrieval support?

在InternVideo2.5中Adaptive Temporal Sampling是怎么做的？

← Metadata

Owner

Metadata

InternVideo InternVideo copied to clipboard

Metadata

← Metadata

Owner

Metadata

InternVideo
InternVideo copied to clipboard