sd_dreambooth_extension Integrate with ColossalAI for memory/speed efficiency and improvements

WHAT

Integrate with ColossalAI for memory/speed efficiency and improvements.

WHERE

https://github.com/hpcaitech/ColossalAI
- Colossal-AI: A Unified Deep Learning System for Big Model Era
https://medium.com/@yangyou_berkeley/diffusion-pretraining-and-hardware-fine-tuning-can-be-almost-7x-cheaper-85e970fe207b

HOW

https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/diffusion

WHEN

Whenever I get to it. Hold your horses... 👌🏻

SEE ALSO

https://github.com/huggingface/diffusers/issues/1212
https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/4606
https://github.com/hpcaitech/ColossalAI/issues/1935
https://github.com/TheLastBen/fast-stable-diffusion/issues/493
https://github.com/ShivamShrirao/diffusers/issues/129

Nov 14 '22 03:11 0xdevalias

It looks like the dev team of ColossalAI is looking to work directly with huggingface diffusers to simplify integrations in this issue:

https://github.com/huggingface/diffusers/issues/1212

This is very cool @binmakeswell ! Would be very happy to integrate diffusers in colossalAI examples!

Originally posted by @patil-suraj in https://github.com/huggingface/diffusers/issues/1212#issuecomment-1310167948

Thanks for your enthusiastic response and we look forward to working together!

How do you expect it to be carried out? Take our collaboration with Lightning for example, we communicated in slack for details, provided documentation and eventually did a joint release.

Originally posted by @binmakeswell in https://github.com/huggingface/diffusers/issues/1212#issuecomment-1311220560

And there seems to be at least some initial/cursory interest on the issue I opened on the ColossalAI repo RE: integrating with dreambooth:

https://github.com/hpcaitech/ColossalAI/issues/1935

Hi, thanks for reaching out! I think it’s pretty awesome. Can I check it with our team and get back to you ASAP?

Originally posted by @Sze-qq in https://github.com/hpcaitech/ColossalAI/issues/1935#issuecomment-1313053594

So hopefully things rapidly move towards the 'easier to integrate'/'better documentation' direction of things!

Originally posted by @0xdevalias in https://github.com/ShivamShrirao/diffusers/issues/129#issuecomment-1313059637

Nov 14 '22 04:11 0xdevalias

So, while I'm always hesitant to say "no" to a feature request...this is a BIG one.

Like, porting Shivam's repo isn't too tedious, because I'm mostly just exposing the args to the UI, adding some stuff to update the text, and tying in extraction and conversion.

This sounds a bit more complex than that. I'd bascially have to go through the existing "train_dreambooth.py" script, rip out some big components, and then replace them with the optimized stuff from ColossalAI.

Still, I definitely see how it could be very useful to other people, so I'll say "Let's put a pin in this one for now." :P

Nov 14 '22 14:11 d8ahazard

From my study, current training scripts can be categorized to two types: One uses the stable diffusion repo, ldm and torch.lightning to train, including the original training in stable diffusion, pokemon diffusion, old waifu diffusion and colossal ai. The other uses diffusers+accelerate, including ShivamShrirao's diffuser that this extension is based on, kohya_ss, new waifu diffusion, etc. I find it difficult to adapt tools used by one to the other. The routines are just too different. One thing to note is that webui uses ldm and not diffusers, and I find integrating webui routines into this extension is already not easy. So the current colossal ai example uses the first training pipeline, unless someone understand all these very well I do not expect it to land very soon.

Nov 15 '22 06:11 sgsdxzy

From what I get from it, it's like DeepSpeed, does the same thing more or less, but it doesn't have the same level of documentation (None, last I checked). So, it would be a painful process.

Nov 16 '22 03:11 78Alpha

This issue is stale because it has been open 5 days with no activity. Remove stale label or comment or this will be closed in 5 days

Nov 22 '22 00:11 github-actions[bot]

This feels relevant to keep open even if it's not something that would immediately be worked on?

Nov 22 '22 02:11 0xdevalias

Update: I just find ColossalAI does NOT support Windows. So I doubt it will be of much use (Linux can already use DeepSpeed).

Nov 22 '22 03:11 sgsdxzy

I guess the big question though is whether this is better/faster than deep speed, which it kind of seemed to be pitching that it was.

Nov 22 '22 07:11 0xdevalias

This issue is stale because it has been open 5 days with no activity. Remove stale label or comment or this will be closed in 5 days

Nov 28 '22 00:11 github-actions[bot]

Yeah, so, we've had a lot of improvements to VRAM and speed, but I don't see ColossalAI in the cards. As such, I'm gonna close this.

Jan 21 '23 23:01 d8ahazard

Just adding a note, looks like it was added to a few things

https://github.com/huggingface/diffusers/issues/1212#issuecomment-1373183443

Jan 22 '23 18:01 78Alpha

Any update @binmakeswell ? It would be very great to fasten training process.

I found train_dreambooth.py and train_dreambooth_colossalai.py that are based on diffusers.

Thank you for your interest and contribution to colossalai, we have merged into the hugging face diffuser library. https://github.com/huggingface/diffusers/tree/main/examples/research_projects/colossalai

Originally posted by @Fazziekey in https://github.com/huggingface/diffusers/issues/1212#issuecomment-1373183443

Jan 30 '23 10:01 0xdevalias

sd_dreambooth_extension sd_dreambooth_extension copied to clipboard

Integrate with ColossalAI for memory/speed efficiency and improvements

sd_dreambooth_extension
sd_dreambooth_extension copied to clipboard