Yuhao Xu
Yuhao Xu
或许会,但我们感觉目前的性能瓶颈可能在于数据而非base模型
Hi, Thanks for asking. It is a common failure case for cross-category VTON due to the imperfect mask. We plan to solve this in our OOTDiffusion V2.
Please download the model checkpoints from https://huggingface.co/levihsu/OOTDiffusion/tree/main
Final output is saved in `./run/images_output` and no such intermediate result should be saved Please check the correctness of your code and files
Please download the checkpoints from https://huggingface.co/levihsu/OOTDiffusion/tree/main/checkpoints
Yep. Just save checkpoints and set resume_from_checkpoint in your training script to restart training.
Please refer to [README](https://github.com/levihsu/OOTDiffusion/blob/main/README.md) for installation and inference
请检查你的环境配置 我们自己的gradio demo都是几秒出图 https://huggingface.co/spaces/levihsu/OOTDiffusion https://ootd.ibot.cn/
Hi all, please check if your downloaded checkpoints (including several different ***config.json***) are correct. Our online demo uses the completely same checkpoints. See the files at https://huggingface.co/spaces/levihsu/OOTDiffusion
Hi. It takes around 3 seconds and 6GB memory for 1 sample and 20 steps on our RTX 4090 GPU. Maybe a 6GB 3060 is not completely enough.