ComfyUI_ExtraModels icon indicating copy to clipboard operation
ComfyUI_ExtraModels copied to clipboard

Problem when loading SANA's VAE

Open frutiemax92 opened this issue 1 year ago • 3 comments

Downloaded the VAE from here: https://huggingface.co/mit-han-lab/dc-ae-f32c32-sana-1.0-diffusers/tree/main

I'm getting a black screen for now, but there seems to be many missing layers... This is on Windows 11.

got prompt
Missing VAE keys ['encoder.project_in.weight', 'encoder.project_in.bias', 'encoder.stages.0.0.conv1.conv.weight', 'encoder.stages.0.0.conv1.conv.bias', 'encoder.stages.0.0.conv2.conv.weight', 'encoder.stages.0.0.conv2.norm.weight', 'encoder.stages.0.0.conv2.norm.bias', 'encoder.stages.0.1.conv1.conv.weight', 'encoder.stages.0.1.conv1.conv.bias', 'encoder.stages.0.1.conv2.conv.weight', 'encoder.stages.0.1.conv2.norm.weight', 'encoder.stages.0.1.conv2.norm.bias', 'encoder.stages.0.2.main.weight', 'encoder.stages.0.2.main.bias', 'encoder.stages.1.0.conv1.conv.weight', 'encoder.stages.1.0.conv1.conv.bias', 'encoder.stages.1.0.conv2.conv.weight', 'encoder.stages.1.0.conv2.norm.weight', 'encoder.stages.1.0.conv2.norm.bias', 'encoder.stages.1.1.conv1.conv.weight', 'encoder.stages.1.1.conv1.conv.bias', 'encoder.stages.1.1.conv2.conv.weight', 'encoder.stages.1.1.conv2.norm.weight', 'encoder.stages.1.1.conv2.norm.bias', 'encoder.stages.1.

frutiemax92 avatar Dec 09 '24 21:12 frutiemax92

Hmm, looks like the repo with the separate VAE was (re)moved, and the one you link to is in the reference format (which doesn't have any conversion logic in place at the moment).

Could you check if this one works? I'll update the link in the readme if yes. https://huggingface.co/Efficient-Large-Model/Sana_1600M_1024px_diffusers/blob/main/vae/diffusion_pytorch_model.safetensors

city96 avatar Dec 09 '24 22:12 city96

This works with the model you provided. image

frutiemax92 avatar Dec 09 '24 22:12 frutiemax92

Thanks, updated the link in the readme.

city96 avatar Dec 09 '24 22:12 city96