Video-LLaMA issues

训练时长？

感谢你们的开源工作！想请问一下用8xA100训练和微调各自大概要多久呢？

In Video-LLaMA, we notice that you load LlamaForCausalLM from ./models/modelling_llama.py. I wonder why not directly load it by "from transformers import LlamaForCausalLM". Do you make any change of the original...

zeroQiaoba

Problem running demo: Loading checkpoint shards never finishes

1

My specs are: ``` GPU 0: NVIDIA A100-PCIE-40GB MEM: 60 GB ``` My config file looks like: ``` model: arch: video_llama model_type: pretrain_vicuna freeze_vit: True freeze_qformer: True max_txt_len: 512 end_sym:...

jpssoares

Fix error on loading audio of the input video, as described in issue #163.

This is fixed by extracting the audio from the input video and save to a `wav` file by the `ffmpeg-python` package. Fixes #163

xjr01

Error loading the audio

There seems to be a bug in the function `upload_video()` in the class `Chat` in file `video_llama/conversation/conversation_video.py`. On the 255 line of `conversation_video.py`, you directly pass the `video_path` to the...

xjr01

Finetune with LoRA and QLoRA

can you tell me how to use LoRA or QLoRA to finetune this model? moreover how to load the entire model from huggingface?

thisurawz1

finetune-billa7b-zh inference error shape '[-1, 136]' is invalid for input of size 137

Hi , Thank you very much for your great work I encountered some problems while using finetune-billa7b-zh model for inference. The configuration is as follows： ``` model: arch: video_llama model_type:...

len2618187

配置文件位置在本地但是还是提示OSError: Can't load tokenizer for 'bert-base-uncased'. If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure 'bert-base-uncased' is the correct path to a directory containing all relevant files for a BertTokenizer tokenizer.

2

本地配置了本地的模型地址，还是提示无法连接远程下载配置的文件如下 model: arch: video_llama model_type: pretrain_vicuna freeze_vit: True freeze_qformer: True max_txt_len: 512 end_sym: "###" low_resource: False frozen_llama_proj: False # If you want use LLaMA-2-chat, # some ckpts could be...

Asmallsoldier

Do you have plan to release Video-LLaMA checkpoints with LLaMA 3.1?

1

Thanks for your contributions! It would be nice if you please let me you if you are going to release Video-LLaMA checkpoints with LLaMA 3.1 anytime soon. Thanks, Shraman

ShramanPramanick

Inference code for custom data

Hi, thank you so much for this work! I was wondering if there any API to run inference on some custom video-audio question answering dataset? Also I wanted to confirm...

Aafiya-H

Video-LLaMA
Video-LLaMA copied to clipboard

Metadata

训练时长？

modelling_llama.py

Problem running demo: Loading checkpoint shards never finishes

Fix error on loading audio of the input video, as described in issue #163.

Error loading the audio

Finetune with LoRA and QLoRA

finetune-billa7b-zh inference error shape '[-1, 136]' is invalid for input of size 137

Do you have plan to release Video-LLaMA checkpoints with LLaMA 3.1?

Inference code for custom data

← Metadata

Owner

Metadata

Video-LLaMA Video-LLaMA copied to clipboard

Metadata

← Metadata

Owner

Metadata

Video-LLaMA
Video-LLaMA copied to clipboard