How consistent self-atttention fits into the semantic motion predictor？

Open woshipapa opened this issue 1 year ago • 0 comments

Great work by everyone! I'd like to ask you a little bit about how consistent self-atttention fits into the semantic motion predictor, I see that the input in the semantic motion predictor in the thesis is that there are only two images (one as the start frame and one as the end frame).

Dec 18 '24 09:12 woshipapa