Text2Video-Zero
Text2Video-Zero copied to clipboard
Cross Frame Attention vs Sparse-Causal Attention
Hi, your work is amazing!
After reading your paper, I have one question. What exactly is the difference between Cross Frame Attention and the Sparse-Causal Attention from the Tune-A-Video paper?
Thank you.
Hi @HyeonHo99,
we are attending only to the first frame, without updating any model weights. In fact, by attending to the first frame only, we obtain better results and at the same faster results compared to the SCA approach.