Video-LLaMA
Video-LLaMA copied to clipboard
What if no frame_position_embeddings?
Will the performance be worse?