Xiaodong Wang
Xiaodong Wang
Hi! lucidrains. Thanks for providing a great repo which is convenient to understand the NUWA paper. I have a question as follows: In the NUWA paper, we can see that...
add clip support to train_t2v.py. This PR does that.
update readme.md
Hi! Thanks for your great work! I have looked into the code about video processing, but I did not figure out **how many frames** of a video are encoded into...
if `image_idx not in video_idx_in_batch`, `image_feature` will be added into `new_image_features` repeatedly, which should be avoided. That's what this PR does.