xincheng Ju comments

Results 9 comments of


                                            xincheng Ju

Is there any way to speed up the inference

I also have the same problem. I used one 4090 and it needs about 3s for only one sample

finetune with lora

![image](https://github.com/PKU-YuanGroup/Video-LLaVA/assets/38723604/78a03042-003d-4a2f-b324-b3343eeb3960)

After finetuing_lora.sh, I get some file in checkpoint ![image](https://github.com/PKU-YuanGroup/Video-LLaVA/assets/38723604/c5c6c5ce-1dbd-4216-b8f2-4a7b14c7ff32) How can I use this checkpoint to infer or eval ? I want to use this new finetuing model to infer...

finetune with lora

![image](https://github.com/PKU-YuanGroup/Video-LLaVA/assets/38723604/eb666b74-3ff6-4be3-90a2-bc21924d8a33)

finetune with lora

数据集可以选择huggingface中的某一组，valley的可以只下一部分，然后打开用于测试

finetune with lora

![image](https://github.com/PKU-YuanGroup/Video-LLaVA/assets/38723604/69aef432-6443-41ca-8255-b7de4fb00d83) ![image](https://github.com/PKU-YuanGroup/Video-LLaVA/assets/38723604/4578826f-b33a-45ac-a422-e3e0ea8ddf0c) 你会下载这两个文件吧。然后解压下valley.json文件。然后你写个代码把有视频的挑出来。