Sana icon indicating copy to clipboard operation
Sana copied to clipboard

Pre-emptive Feature anticipation

Open TomLucidor opened this issue 1 year ago • 11 comments

Probably gonna shortlist some wonky idea, but hey if this tool will be workable anywhere it better be feature-full

  • [ ] Finetuning and LoRA (or other PEFT type) training toolkit https://github.com/Nerogar/OneTrainer
  • [ ] LoRA (or other PEFT type) and model merging (or extraction) https://github.com/hako-mikan/sd-webui-supermerger
  • [ ] Getting with X-Adapter in case people have LoRAs of other base models https://github.com/showlab/X-Adapter
  • [ ] CFG manipulation cus it's a thing https://github.com/Extraltodeus/ComfyUI-AutomaticCFG/discussions/53
  • [ ] InvokeAI and other inpainting tools (e.g. Krita) that allows for creative exploration https://github.com/invoke-ai/InvokeAI
  • [ ] Hacks like LCM, Lightning, and Turbo (yes imagine the 100x get another zero on top cus why not?)
  • [ ] ControlNet and other related tooling for allowing more flexibility
    • [ ] OpenPose and facial landmarks for character generation
    • [ ] depth maps and normal map for 3D-aided generation
    • [ ] line art or segmentation for illustrations and cartoons
  • [ ] LLM tooling for getting Sana to talk with other models that can possibly improve UX or help with finetuning

TomLucidor avatar Oct 23 '24 00:10 TomLucidor

Nice one. We will work on the function supporting works including the ones you listed above during this whole year. Also we will keep improving the quality of DC-AE and Sana, planning to release like v1.5 in the futher. Trying to make the high-compression tech popular.

lawrence-cj avatar Oct 24 '24 13:10 lawrence-cj

I need to add some features too as I had been using PixArt-Sigma and found some success with it.

  1. Achieve a similar midjourney-style aesthetic quality, currently found in PixArt-Sigma.
  2. Include image datasets that were used for training PixArt-Sigma model.

doogyhatts avatar Oct 24 '24 14:10 doogyhatts

Definitely, I am willing to integrate these features.

lawrence-cj avatar Oct 24 '24 15:10 lawrence-cj

@doogyhatts please do not have aesthetic locking, since coming from the SDXL derivative side of things, people have really put in effort to make everything aesthetically flexible.

TomLucidor avatar Oct 26 '24 00:10 TomLucidor

@doogyhatts please do not have aesthetic locking, since coming from the SDXL derivative side of things, people have really put in effort to make everything aesthetically flexible.

Simply have different base models, including one that does not have any style-specific datasets applied to it.

doogyhatts avatar Oct 26 '24 00:10 doogyhatts

@doogyhatts here is the thing tho... if there is a way to create a style embedding for all major implicit styles (looking at the current and future methodology of Pony), then the problem of style bias becomes trivial, and finetuning (or pruning) would be easier. LoRA extractors kind of do this already.

TomLucidor avatar Oct 30 '24 01:10 TomLucidor

my boss that me use this model  but no lora train, I no idea finish work.

Deng-Xian-Sheng avatar Dec 03 '24 10:12 Deng-Xian-Sheng

help me please.

Deng-Xian-Sheng avatar Dec 03 '24 10:12 Deng-Xian-Sheng

@lawrence-cj Advent calendar or bust, bro XD 🕙 but seriously make a timeline of this with conservative estimates would be sweet

TomLucidor avatar Dec 06 '24 11:12 TomLucidor

I'm not resting. You tell me the priority or do something to help, pls? @TomLucidor

https://github.com/city96/ComfyUI_ExtraModels/pull/84 https://github.com/huggingface/diffusers/pull/9982 https://github.com/huggingface/diffusers/pull/9708 BF16 model fine-tuning https://github.com/bghira/SimpleTuner/pull/1187

lawrence-cj avatar Dec 06 '24 12:12 lawrence-cj

LoRA is supported in diffusers refer to: https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/README_sana.md and Official diffusers docs: https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/README_sana.md

lawrence-cj avatar Dec 18 '24 07:12 lawrence-cj