Yuhao Xu

Results 63 comments of Yuhao Xu

或许会,但我们感觉目前的性能瓶颈可能在于数据而非base模型

Hi, Thanks for asking. It is a common failure case for cross-category VTON due to the imperfect mask. We plan to solve this in our OOTDiffusion V2.

Please download the model checkpoints from https://huggingface.co/levihsu/OOTDiffusion/tree/main

Final output is saved in `./run/images_output` and no such intermediate result should be saved Please check the correctness of your code and files

Please download the checkpoints from https://huggingface.co/levihsu/OOTDiffusion/tree/main/checkpoints

Yep. Just save checkpoints and set resume_from_checkpoint in your training script to restart training.

Please refer to [README](https://github.com/levihsu/OOTDiffusion/blob/main/README.md) for installation and inference

请检查你的环境配置 我们自己的gradio demo都是几秒出图 https://huggingface.co/spaces/levihsu/OOTDiffusion https://ootd.ibot.cn/

Hi all, please check if your downloaded checkpoints (including several different ***config.json***) are correct. Our online demo uses the completely same checkpoints. See the files at https://huggingface.co/spaces/levihsu/OOTDiffusion

Hi. It takes around 3 seconds and 6GB memory for 1 sample and 20 steps on our RTX 4090 GPU. Maybe a 6GB 3060 is not completely enough.