Video-LLaMA issues

Results 58 Video-LLaMA issues

Sort by recently updated

Prompt

Hi. What is the full text that the model sees for the demo running in huggingface? Does it include any special tags/sys messages etc.? Thanks.

tobyperrett

Error(s) in loading state_dict for VideoLLAMA

Hi, I am running the demo with only VL branch, I set the checkpoint path like: ```shell llama_model: "model_weights/vicuna_final/" ckpt: '/home/ubuntu/Documents/Video-LLaMA/model_weights/Pre-trained_Visual_Encoder/pretrained_minigpt4.pth' # you can use our pretrained ckpt from https://huggingface.co/DAMO-NLP-SG/Video-LLaMA-2-13B-Pretrained/...

tiesanguaixia

change the frames and query_tokens size

Hi, Thanks a lot for your great work! I am wondering if we can change the sampling frames and query_tokens size.

AllenFind

After completing the fine-tuning of 'visionbranch stage2', what should be its loss value to be considered as having correctly converged?

Hi guys, Now I can fine-tune 'visionbranch_stage2_finetune.yaml' on **four** A100 80GB GPUs using gradient accumulation. I'd like to know at what point the Loss is considered to have converged? For...

joeking11829

Do you have any plans to open-source the pre-training and fine-tuning checkpoints based on Llama 2 Chinese version?

bjcodereview3

Fixed training interrupt bug

Before repair: ```bash TypeError: Caught TypeError in DataLoader worker process 6. File "/video_llama/datasets/datasets/webvid_datasets.py", line 70, in __getitem__ video_path = self._get_video_path(sample_dict) File "/video_llama/datasets/datasets/webvid_datasets.py", line 50, in _get_video_path rel_video_fp = os.path.join(sample['page_dir'], str(sample['videoid'])...

bobo0810

Video-LLaMA
Video-LLaMA copied to clipboard

Metadata

Prompt

Error(s) in loading state_dict for VideoLLAMA

change the frames and query_tokens size

After completing the fine-tuning of 'visionbranch stage2', what should be its loss value to be considered as having correctly converged?

Do you have any plans to open-source the pre-training and fine-tuning checkpoints based on Llama 2 Chinese version?

Fixed training interrupt bug

如何部署LLama2训练出的video llama？

请问训练设置val，这样正确吗

7B模型，video微调训练时，Dataloader获取数据获取不到

可以提供一个infer的脚本吗？非gradio demo的那种

← Metadata

Owner

Metadata

Video-LLaMA Video-LLaMA copied to clipboard

Metadata

← Metadata

Owner

Metadata

Video-LLaMA
Video-LLaMA copied to clipboard