VideoGPT-plus issues

Intermediate descriptions for vcg-plus_112k

Hi team, Nice work! Can I request the intermediate descriptions for vcg-plus_112k generated by [this file](https://github.com/mbzuai-oryx/VideoGPT-plus/blob/main/annotation_pipeline/3_dense_video_description.py)? Thanks in advance!

Charleshhy

enhancement

Question about Training Time

1

Hello, Thank you for sharing your excellent research and code. I am currently pretraining an image encoder using 8 A100 GPUs. The estimated time of arrival (ETA) is about 6...

Backdrop9019

Detailed Video Descriptions

3

Do you have a plan to release the original "Detailed Video Descriptions"?

ShiYaya

eval/vcgbench/inference/run_ddp_inference.sh

1

[h264 @ 0x16543c00] Missing reference picture, default is 65562 [h264 @ 0x16543c00] mmco: unref short failure [h264 @ 0x16543c00] mmco: unref short failure [h264 @ 0x16543c00] Missing reference picture, default...

rixejzvdl649

The webm file from ssv2 can not be loaded

3

raise DECORDError(err_str) decord._ffi.base.DECORDError: [05:19:05] /github/workspace/src/video/ffmpeg/threaded_decoder.cc:292: [05:19:05] /github/workspace/src/video/ffmpeg/threaded_decoder.cc:218: Check failed: avcodec_send_packet(dec_ctx_.get(), pkt.get()) >= 0 (-11 vs. 0) Thread worker: Error sending packet.

MonolithFoundation

Question about training data

Hi, thanks for your awesome work! I want to know why training two models for 2（VGG and MV）benchmarks? Why not use all the data to train a single model. Looking...

vvirgooo2

Question about Dataset

I have a question about the construction of the dataset. Does the keyframe extraction in the paper take only one frame per scene after it passes scene detection?

Nastu-Ho

VideoGPT-plus
VideoGPT-plus copied to clipboard

Metadata

Intermediate descriptions for vcg-plus_112k

Question about Training Time

Detailed Video Descriptions

eval/vcgbench/inference/run_ddp_inference.sh

The webm file from ssv2 can not be loaded

Question about training data

Question about Dataset

← Metadata

Owner

Metadata

VideoGPT-plus VideoGPT-plus copied to clipboard

Metadata

Intermediate descriptions for vcg-plus_112k

Question about Training Time

Detailed Video Descriptions

eval/vcgbench/inference/run_ddp_inference.sh

The webm file from ssv2 can not be loaded

Question about training data

Question about Dataset

← Metadata

Owner

Metadata

VideoGPT-plus
VideoGPT-plus copied to clipboard