stoneMo

Results 2 comments of stoneMo

Thanks for your interest in our FastConvMAE. Motivated by the information density in MAE, we fixed the group size to 4. Each group covers 25% of tokens and reconstructs the...

Hi, thanks for your interest in our work. We extracted the middle frame as a single image from the video.