YiqingShi

Results 1 comments of YiqingShi

> The current code use the GT image as conditional frame and generate the subsequent video frames for inference, so modifying the text prompt cannot modify the textual attributes well...