CogVideo icon indicating copy to clipboard operation
CogVideo copied to clipboard

How to finetune with multi resolution?

Open trouble-maker007 opened this issue 1 year ago • 2 comments

Is there any plan release the code of multi resolution finetune with sat? I found that the current sft.yaml file is not adjust with the v1.5 ckpt

trouble-maker007 avatar Nov 12 '24 08:11 trouble-maker007

In the fine-tuning section, we will provide a fine-tuning version of diffusers. The model version of diffusers will be released this week, and it is expected to be fine-tuned on cogvideox-factory

zRzRzRzRzRzRzR avatar Nov 12 '24 09:11 zRzRzRzRzRzRzR

@zRzRzRzRzRzRzR Could you add some comments to this code? What is the role of OFS embedding? https://github.com/THUDM/CogVideo/blob/main/sat/dit_video_concat.py#L721

trouble-maker007 avatar Nov 12 '24 16:11 trouble-maker007

ofs is a constant, mainly adding a constant to the embed of the model

zRzRzRzRzRzRzR avatar Jan 08 '25 01:01 zRzRzRzRzRzRzR