CogVideo icon indicating copy to clipboard operation
CogVideo copied to clipboard

I see portrait videos on the cogvideox page

Open powerspowers opened this issue 1 year ago • 9 comments

How were these portrait videos accomplished? Was it all post processing or can cogvideox I2V produce portrait now?

https://yzy-thu.github.io/CogVideoX-demo/

powerspowers avatar Oct 19 '24 00:10 powerspowers

This is naturally generated, but currently we do not have control functions such as controlnet.

zRzRzRzRzRzRzR avatar Oct 20 '24 07:10 zRzRzRzRzRzRzR

I get that the controlnet workflow is not available currently. When you say 'naturally generated' do you mean the codebase in this repository support portrait image input (I2V) and can generate output that is portrait ratio (9:16 for example)?

Thank you

powerspowers avatar Oct 21 '24 03:10 powerspowers

This is a model that we continued training based on 5b, capable of generating 16fps 720p resolution videos.

yzy-thu avatar Oct 21 '24 05:10 yzy-thu

This is a model that we continued training based on 5b, capable of generating 16fps 720p resolution videos.

@yzy-thu will this 16fps model be open-source?

Florenyci avatar Oct 21 '24 21:10 Florenyci

This is a model that we continued training based on 5b, capable of generating 16fps 720p resolution videos.

@yzy-thu will this 16fps model be open-source?

Stay tuned

yzy-thu avatar Oct 22 '24 06:10 yzy-thu

This is a model that we continued training based on 5b, capable of generating 16fps 720p resolution videos.

@yzy-thu will this 16fps model be open-source?

Stay tuned

@yzy-thu actually I already tried 16 fps fine-tuning, but the output is only 3s, I'm wondering how you get 6s video?

Florenyci avatar Oct 22 '24 18:10 Florenyci

This is a model that we continued training based on 5b, capable of generating 16fps 720p resolution videos.

@yzy-thu will this 16fps model be open-source?

Stay tuned

@yzy-thu actually I already tried 16 fps fine-tuning, but the output is only 3s, I'm wondering how you get 6s video?

we do a patch in T dimension

yzy-thu avatar Oct 23 '24 05:10 yzy-thu

This is a model that we continued training based on 5b, capable of generating 16fps 720p resolution videos.

@yzy-thu will this 16fps model be open-source?

Stay tuned

Woah, exciting, you are becoming number one in this space quickly!!! Following with great interest!

oliverban avatar Oct 26 '24 02:10 oliverban

This is a model that we continued training based on 5b, capable of generating 16fps 720p resolution videos.

@yzy-thu will this 16fps model be open-source?

Stay tuned

@yzy-thu actually I already tried 16 fps fine-tuning, but the output is only 3s, I'm wondering how you get 6s video?

we do a patch in T dimension

Can you explain it in detail? @yzy-thu

hanshumin001 avatar Oct 29 '24 10:10 hanshumin001