CogVideo icon indicating copy to clipboard operation
CogVideo copied to clipboard

Fine-tuning results

Open xvjiarui opened this issue 5 months ago • 11 comments

Hi Team,

Thanks for providing the diffusers fine-tuning script. I just tried it out. It turns out results look strange.

https://github.com/user-attachments/assets/5e1764fd-27d6-466a-a7ce-fb130c80b9c6

https://github.com/user-attachments/assets/9faf714c-67bb-4e85-9300-4f26a7cfc91c

Prompt:

  1. A black and white animated scene unfolds with an anthropomorphic goat surrounded by musical notes and symbols, suggesting a playful environment. Mickey Mouse appears, leaning forward in curiosity as the goat remains still. The goat then engages with Mickey, who bends down to converse or react. The dynamics shift as Mickey grabs the goat, potentially in surprise or playfulness, amidst a minimalistic background. The scene captures the evolving relationship between the two characters in a whimsical, animated setting, emphasizing their interactions and emotions.
  2. A domestic scene unfolds indoors, with a parrot on a stand and a mouse-like character standing next to it, amidst a domestic setting. A lamp is knocked over, causing a sudden change in lighting and affecting the mood. The scene shifts to a maritime setting, where a sailor-like character is shown in dynamic poses near ship's wheel controls and a bell, with a view of waves and distant land through a window.

I didn't modify any training script. I directly run https://github.com/THUDM/CogVideo/blob/main/finetune/finetune_multi_rank.sh Did I missing something?

Btw, may I ask whether it is possible to share your training log and validation videos? I want to make sure I am getting correct results.

xvjiarui avatar Sep 22 '24 02:09 xvjiarui