Video-LLaMA icon indicating copy to clipboard operation
Video-LLaMA copied to clipboard

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Results 58 Video-LLaMA issues
Sort by recently updated
recently updated
newest added

Do you have code for single player multi card training? I couldn't find it in the train. py and dataset code, only the torch run instruction

你好,如果只有V100机器,加载llama13B版本,会OOM,但有多张V100,如何实现类似automap的功能,将模型映射到多张v100GPU上?

peft只支持1.13以上版本的torch,environment.yml会强行让torch升级,然后与torchaudio不兼容报错,似乎有这个问题?

![9fb057dd904c82588aa15dbf6996a441](https://github.com/DAMO-NLP-SG/Video-LLaMA/assets/29787866/412f3db0-ff65-4c40-a0b6-c79c95757049) 跑了两个demo.py,用的给的例子,都会出现这样的错误

Hi Do you have any example of Video-LLaMA deployment using docker?

swapped out CUDA params for CPU support

Great works! But I've met some problems and hope anyone has some ideas. When I fine-tune the VL branch only with LLaMA-2 on image/video instruction datas, inf values occurs and...

Dear author, How much time does it cost to train this model? With what type of GPU cards?