Video-LLaMA issues

multi-cards training

Do you have code for single player multi card training? I couldn't find it in the train. py and dataset code, only the torch run instruction

gqsmmz

inferece如何使用多张V100代替一张A100？

4

你好，如果只有V100机器，加载llama13B版本，会OOM，但有多张V100，如何实现类似automap的功能，将模型映射到多张v100GPU上？

flying2023

关于environment.yml文件的问题

2

peft只支持1.13以上版本的torch，environment.yml会强行让torch升级，然后与torchaudio不兼容报错，似乎有这个问题？

balabanahei

demo运行

9

![9fb057dd904c82588aa15dbf6996a441](https://github.com/DAMO-NLP-SG/Video-LLaMA/assets/29787866/412f3db0-ff65-4c40-a0b6-c79c95757049) 跑了两个demo.py，用的给的例子，都会出现这样的错误

luyao-cv

example model deployment

Hi Do you have any example of Video-LLaMA deployment using docker?

nahidalam

CPU support

1

swapped out CUDA params for CPU support

esteininger

inf value occurs during forwarding process when fine-tuning VL branch with LLAVA-150K+MiniGPT4-3.5K+webvid-instruct

1

Great works! But I've met some problems and hope anyone has some ideas. When I fine-tune the VL branch only with LLaMA-2 on image/video instruction datas, inf values occurs and...

xuboshen

Dear author, How much time does it cost to train this model？ With what type of GPU cards?

zhangyuereal

RuntimeError: Error(s) in loading state_dict for LlamaForCausalLM: size mismatch for model.embed_tokens.weight: copying a param with shape torch.Size([32001, 4096]) from checkpoint, the shape in current model is torch.Size([32000, 4096]). size mismatch for lm_head.weight: copying a param with shape torch.Size([32001, 4096]) from checkpoint, the shape in current model is torch.Size([32000, 4096]).

Amber0913

How to finetune video-llama using deepspeed?

tangyipeng100

Video-LLaMA
Video-LLaMA copied to clipboard

Metadata

multi-cards training

inferece如何使用多张V100代替一张A100？

关于environment.yml文件的问题

demo运行

example model deployment

CPU support

inf value occurs during forwarding process when fine-tuning VL branch with LLAVA-150K+MiniGPT4-3.5K+webvid-instruct

Dear author, How much time does it cost to train this model？ With what type of GPU cards?

How to finetune video-llama using deepspeed?

← Metadata

Owner

Metadata

Video-LLaMA Video-LLaMA copied to clipboard

Metadata

← Metadata

Owner

Metadata

Video-LLaMA
Video-LLaMA copied to clipboard