BecauseImBatman0

Results 1 comments of BecauseImBatman0

I have the same question. In the paper the outputs from Ref_Net and UNet are concat in the width dimension, but the code: bank_fea = [ rearrange( d.unsqueeze(1).repeat(1, video_length, 1,...