VGen
VGen copied to clipboard
What is the corresponding paper for text-to-video model training?
I see that there are two papers referred for t2v: modelscope t2v and HiGen. Which paper corresponds to class UNetSD_T2VBase implementation?