DoubleXING
DoubleXING
Please see #99.
Hi. I will release the training code this month. Currently, I am too busy with other research projects. Thanks for your interest and patience.
We don't recommend finetuning the 256x256 model, since its performance is much, much, much worse than that of the 512x320 and 1024x576. models..
Hi. We will release it soon, as I am currently busy with other research projects.
Available now.
They are filtered as described in the paper.
Here I just use the ChatGPT to generate some keywords about the CGI/Graphics content and then filter those videos using caption only. A better way is to filter using image...
yeap, using CLIP following previous video generation works
For the image-to-video/image animation application, we just feed a single image to the i2v model, while for interpolation, we feed both the starting image and ending image to the model...