YiqingShi
Results
1
comments of
YiqingShi
> The current code use the GT image as conditional frame and generate the subsequent video frames for inference, so modifying the text prompt cannot modify the textual attributes well...