Is there any support for visual generation?

Open dunbar12138 opened this issue 1 year ago • 2 comments

Hi, thanks for the great repo.

I'm wondering how I can use the Lit Autoregressive models in the image/video generation tasks.

Is there any doc / tutorial about how to prepare visual tokens as training set, and how to make the LitGPT/llama model take in tokens instead of text?

Or even better, is there any tutorial / example about how to integrate LitGPT/llama into a normal pytorch lightning training recipe (potentially with other models getting trained together), and customize the architecture and training instead of being a standalone model itself?

Jul 20 '24 18:07 dunbar12138

Hi there!

Image/video is not supported but I can surely add a Trainer recipe some time. Thanks for suggesting!

Jul 20 '24 19:07 rasbt

Thanks for the reply! That would be great. Let me know when the tutorial is added. I really appreciate it!

Jul 20 '24 19:07 dunbar12138