Could you upload the transparent sample images
Like the apple could you upload the rest of the sample image transparent versions? I'm doing some of the "sanity check" images and besides the apple you only uploaded the checkerboard pattern examples. Would be useful for compairson sake.
Mine:
Yours:
the one that i did was a bit off, but still perfectly good. No rush btw you did great work and this will be immensivly useful to many people i imagine, can't wait for the full release!
I don't get exactly the same apple either, but it is very close. I can not tell the difference by eye.
Settings:
- Prompt:
an apple, high quality - Negative prompt:
bad, ugly - Steps:
20 - Sampler:
DPM++ 2M SDE Karras - CFG scale:
5 - Seed:
12345 - Size:
1024x1024 - Model:
juggernautXL_version6Rundiffusion.safetensors - sha256:
1fe6c7ec54c786040cdabc7b4e89720069d97096922e20d01f13e7764412b47f - layerdiffusion_enabled:
True - layerdiffusion_method:
Only Generate Transparent Image (Attention Injection)
Assuming that your settings are identical, there might be a difference in library versions or hardware. My configuration:
- GPU: V100 (32G)
- Driver Version: 525.147.05
- CUDA Version: 12.0
- https://github.com/layerdiffusion/sd-forge-layerdiffusion commit:
7d4c1d4defa91ee40332db02943282131995357d - https://github.com/lllyasviel/stable-diffusion-webui-forge commit:
b9705c58f66c6fd2c4a0168b26c5cf1fa6c0dde3 - Python 3.11.4
Library versions from venv:
$ pip freeze
absl-py==2.1.0
accelerate==0.21.0
addict==2.4.0
aenum==3.1.15
aiofiles==23.2.1
aiohttp==3.9.3
aiosignal==1.3.1
altair==5.2.0
antlr4-python3-runtime==4.9.3
anyio==3.7.1
attrs==23.2.0
basicsr==1.4.2
blendmodes==2022
certifi==2024.2.2
cffi==1.16.0
chardet==5.2.0
charset-normalizer==3.3.2
clean-fid==0.1.35
click==8.1.7
clip @ https://github.com/openai/CLIP/archive/d50d76daa670286dd6cacf3bcd80b5e4823fc8e1.zip#sha256=b5842c25da441d6c581b53a5c60e0c2127ebafe0f746f8e15561a006c6c3be6a
coloredlogs==15.0.1
colorlog==6.8.2
contourpy==1.2.0
cssselect2==0.7.0
cycler==0.12.1
deprecation==2.1.0
depth_anything @ https://github.com/huchenlei/Depth-Anything/releases/download/v1.0.0/depth_anything-2024.1.22.0-py2.py3-none-any.whl#sha256=26c1d38b8c3c306b4a2197d725a4b989ff65f7ebcf4fb5a96a1b6db7fbd56780
diffusers==0.25.0
einops==0.4.1
embreex==2.17.7.post4
facexlib==0.3.0
fastapi==0.94.0
ffmpy==0.3.2
filelock==3.13.1
filterpy==1.4.5
flatbuffers==23.5.26
fonttools==4.49.0
frozenlist==1.4.1
fsspec==2024.2.0
ftfy==6.1.3
future==1.0.0
fvcore==0.1.5.post20221221
gitdb==4.0.11
GitPython==3.1.32
gradio==3.41.2
gradio_client==0.5.0
grpcio==1.62.0
h11==0.12.0
handrefinerportable @ https://github.com/huchenlei/HandRefinerPortable/releases/download/v1.0.1/handrefinerportable-2024.2.12.0-py2.py3-none-any.whl#sha256=1e6c702905919f4c49bcb2db7b20d334e8458a7555cd57630600584ec38ca6a9
httpcore==0.15.0
httpx==0.24.1
huggingface-hub==0.21.3
humanfriendly==10.0
idna==3.6
imageio==2.34.0
importlib-metadata==7.0.1
importlib_resources==6.1.2
inflection==0.5.1
iopath==0.1.9
jax==0.4.25
Jinja2==3.1.3
jsonmerge==1.8.0
jsonschema==4.21.1
jsonschema-specifications==2023.12.1
kiwisolver==1.4.5
kornia==0.6.7
lark==1.1.2
lazy_loader==0.3
lightning-utilities==0.10.1
llvmlite==0.42.0
lmdb==1.4.1
lxml==5.1.0
mapbox-earcut==1.0.1
Markdown==3.5.2
MarkupSafe==2.1.5
matplotlib==3.8.3
mediapipe==0.10.10
ml-dtypes==0.3.2
mpmath==1.3.0
multidict==6.0.5
networkx==3.2.1
numba==0.59.0
numpy==1.26.2
omegaconf==2.2.3
onnxruntime==1.17.1
open-clip-torch==2.20.0
opencv-contrib-python==4.9.0.80
opencv-python==4.9.0.80
opt-einsum==3.3.0
orjson==3.9.15
packaging==23.2
pandas==2.2.1
piexif==1.1.3
Pillow==9.5.0
platformdirs==4.2.0
portalocker==2.8.2
protobuf==3.20.0
psutil==5.9.5
pycollada==0.8
pycparser==2.21
pydantic==1.10.14
pydub==0.25.1
pyparsing==3.1.1
python-dateutil==2.9.0.post0
python-multipart==0.0.9
pytorch-lightning==1.9.4
pytz==2024.1
PyWavelets==1.5.0
PyYAML==6.0.1
referencing==0.33.0
regex==2023.12.25
reportlab==4.1.0
requests==2.31.0
resize-right==0.0.2
rpds-py==0.18.0
Rtree==1.2.0
safetensors==0.4.2
scikit-image==0.21.0
scipy==1.12.0
semantic-version==2.10.0
sentencepiece==0.2.0
shapely==2.0.3
six==1.16.0
smmap==5.0.1
sniffio==1.3.1
sounddevice==0.4.6
spandrel==0.1.6
starlette==0.26.1
svg.path==6.3
svglib==1.5.1
sympy==1.12
tabulate==0.9.0
tb-nightly==2.17.0a20240303
tensorboard-data-server==0.7.2
termcolor==2.4.0
tifffile==2024.2.12
timm==0.9.16
tinycss2==1.2.1
tokenizers==0.13.3
tomesd==0.1.3
tomli==2.0.1
toolz==0.12.1
torch==2.1.2+cu121
torchdiffeq==0.2.3
torchmetrics==1.3.1
torchsde==0.2.6
torchvision==0.16.2+cu121
tqdm==4.66.2
trampoline==0.1.2
transformers==4.30.2
trimesh==4.1.7
triton==2.1.0
typing_extensions==4.10.0
tzdata==2024.1
urllib3==2.2.1
uvicorn==0.27.1
vhacdx==0.0.5
wcwidth==0.2.13
webencodings==0.5.1
websockets==11.0.3
Werkzeug==3.0.1
xatlas==0.0.9
xxhash==3.4.1
yacs==0.1.8
yapf==0.40.2
yarl==1.9.4
zipp==3.17.0
sample images
woman, messy hair, high quality
a cup made of glass, high quality
glowing effect, book of magic, high quality
(Note: CFG 7, not 5 as before)