xeroqin issues

Results 3 issues of


                                            xeroqin

extract multi-modal features using InternVideo2

Hi InternVideo2 team！ Could you please share a code about how you extract the multi-modal features? I'd like to use the models to extract feature of my own dataset. Thanks...

调用API时，视频一直上传失败

在使用start.py运行项目时可以正常上传视频并生成字幕，但是在使用API时总是无法上传视频，并报错“Expectiec value: line 1 column 1 (char 0)” ![mmexport1729094028950](https://github.com/user-attachments/assets/f5e765a5-8aeb-446a-b67e-3dc5ab92c819)

训练时提示视频可能有问题

大佬您好，将数据集更换为自定义数据集后，训练时会出现如下内容： [train] Unique sentence is 3483 , all num is 3486 Video number: 3486 Total Pairs: 3486 [mov,mp4,m4a,3gp,3g2,mj2 @ 0x114200e40] moov atom not found data/biology/video_split/2_8_12_(P12. 鸟（2）)_6.mp4 data/biology/video_split/2_8_12_(P12. 鸟（2）)_6.mp4 data/biology/video_split/2_8_12_(P12. 鸟（2）)_6.mp4...