sd-webui-incantations icon indicating copy to clipboard operation
sd-webui-incantations copied to clipboard

Saliency-adaptive noise fusion for PAG

Open v0xie opened this issue 9 months ago • 0 comments

Adds a new method of combining the guidance from PAG and CFG.

Derives from "High-fidelity Person-centric Subject-to-Image Synthesis": https://arxiv.org/abs/2311.10329

In the paper they are combining the guidance from two different models, so I thought we could apply that to PAG since it's doing pretty much the same thing.

A couple of examples:

High CFG scales: xyz_grid-0012-1 xyz_grid-0014-1

Greater than 512px for SD1.5: xyz_grid-0002-1

Normal CFG scale, high PAG scale: xyz_grid-0000-1

v0xie avatar May 20 '24 15:05 v0xie