Curlypla
Curlypla
To begin, very good work I found the software very easy to use and the documentation is perfect. For the problem, when I am in the "Vocoder Fine-Tuning" phase the...
Can you add the possibility to train several subjects using a json like in https://github.com/ShivamShrirao/diffusers/commit/351f3b6206f0453706346fd34a337cdf6ac6ef07. The captioned images look unnecessarily complicated unlike this commit where you just have to specify...
https://github.com/THUDM/CogVLM CogVLM is one of the best models for describing images, much better than qwen vl in my experience. To make image subtitles faster would be a huge gain. Being...
Conditional dropout means the prompt or caption on the training image is dropped, and the caption is "blank". The theory is this can help with unconditional guidance, per the original...