Video-LLaMA issues

Prevent variable "atts_img" referred before assignment error on training script on README page.

SushantGautam

Update environment.yml

1

added `pytorchvideo` under pip to resolve `ResolvePackageNotFound`

Rajathbharadwaj

The amount of updated parameters during stage1 and stage2 ?

1

Great project ! I would like to ask 3 questions to learn： 1.Does your public checkpoint include the parameters of the 2-layer Q-former and the linear projection layer? 2.Seeing that...

cnxupupup

The config of finetuning video-llama with the video instruction data?

1

xuhzyy

I'm also training this... I haven't downloaded webvid2.5m yet and then I found that you have done everything I want to do, hahahaha

1

I'm also training this... I haven't downloaded webvid2.5m yet and then I found that you have done everything I want to do, hahahaha

xmy0916

Great job! Will you release the machine-translated VideoChat instructions data?

xuhzyy

Unable to launch demo

2

Using pad_token, but it is not set yet. ![image](https://github.com/DAMO-NLP-SG/Video-LLaMA/assets/49881437/8e689d08-0605-4a32-b642-b36cccd01988)

joysl

how to increase the numbers of input frame?

2

hi, authors, I want to use Video-LLaMA to infer my own dataset, I find that the current framework supports the max number of input frames as 32, if I change...

onlyonewater

What is the input sample of the forward function in videollama

1

Hi, I'm wondering what is the input **sample** of the forward function in videollama.py. It seems like an dict() which contains **image**, **text_input** as its keys, but I can't find...

llx-08

Hi, do you think the following could be bugs in the lr scheduler? 1. https://github.com/DAMO-NLP-SG/Video-LLaMA/blob/ae12557b8510a7cc94baa3d3aea58ea07f6de76a/video_llama/common/optims.py#L83 should be `step=total_cur_step,`? 2. https://github.com/DAMO-NLP-SG/Video-LLaMA/blob/ae12557b8510a7cc94baa3d3aea58ea07f6de76a/video_llama/common/optims.py#L91 should be `epoch=total_cur_step - self.warmup_steps,`? 3. https://github.com/DAMO-NLP-SG/Video-LLaMA/blob/ae12557b8510a7cc94baa3d3aea58ea07f6de76a/video_llama/common/optims.py#L93 should be `max_epoch=self.max_epoch...

SAGNIKMJR

Video-LLaMA
Video-LLaMA copied to clipboard

Metadata

Prevent variable "atts_img" referred before assignment error on training script on README page.

Update environment.yml

The amount of updated parameters during stage1 and stage2 ?

The config of finetuning video-llama with the video instruction data?

I'm also training this... I haven't downloaded webvid2.5m yet and then I found that you have done everything I want to do, hahahaha

Great job! Will you release the machine-translated VideoChat instructions data?

Unable to launch demo

how to increase the numbers of input frame?

What is the input sample of the forward function in videollama

Possible bugs in LR scheduler

← Metadata

Owner

Metadata

Video-LLaMA Video-LLaMA copied to clipboard

Metadata

← Metadata

Owner

Metadata

Video-LLaMA
Video-LLaMA copied to clipboard