Open-Sora-Plan How to extract t5 or video ae feature to fit T2V_Feature

How to extract t5 or video ae feature to fit T2V_Feature_Dataset？

Open knightyxp opened this issue 9 months ago • 0 comments

Hi, I noticed that pre-extracting features for 512-frame videos can significantly reduce memory usage. Could you please share the code for pre-computing video and T5 features? Additionally, the memory savings from compress-kv are limited. Could this be due to the compress_kv factor being set to 1? Would higher compression factors like 2 or 4 further reduce memory usage? What are the potential fault?

Apr 30 '24 07:04 knightyxp

Open-Sora-Plan Open-Sora-Plan copied to clipboard

How to extract t5 or video ae feature to fit T2V_Feature_Dataset？

Open-Sora-Plan
Open-Sora-Plan copied to clipboard