stable-diffusion.cpp icon indicating copy to clipboard operation
stable-diffusion.cpp copied to clipboard

[Bug] causvid Lora for wan 2.1 1.3B doesn’t work

Open KintCark opened this issue 1 month ago • 3 comments

Git commit

I’m trying to use causvid Lora with wan 2.1 1.3B version the Lora loaded but it’s not working so I found another causvid Lora that’s not for ComfyUI but it won’t load please fix wan 2.1 Lora’s support causvid is the only way I can generate videos on my 8gb ram I can’t even do the required resolution so I’m trying to get a video at 320x256 resolution but all I get is deformed burnt videos txt 2 Img with wan works fine tho

Operating System & Version

Android Samsung s20+ sd865 adreno 650

GGML backends

CPU

Command-line arguments used

./sd -M vid_gen --diffusion-model /root/stable-diffusion.cpp/build/models/Phantom_Wan_1.3B-Q3_K_S.gguf --vae /root/stable-diffusion.cpp/build/models/wan_2.1_vae_fp8_e4m3fn.safetensors --t5xxl /root/stable-diffusion.cpp/build/models/umt5xxl-encoder-q2_k.gguf --lora-model-dir /root/stable-diffusion.cpp/build/models/ -s -1 --steps 8 --sampling-method euler --scheduler simple -W 384 -H 216 --fps 16 --video-frames 17 --clip-on-cpu -p "A sensual cyber-geisha with reflective armor, A dark fantasy concept art, matte painting, submerged in clear shallow water on a white sand beach, a single softbox studio light, high-key exposure, complex gear details, smoke and particle effects, crisp focus lora:wan-causvid-1.3b:1.0" -n "anime, cartoon, drawing, 3d render, cgi render, ai generated, ugly face" -o /root/stable-diffusion.cpp/build/videos/wan.avi --diffusion-fa --offload-to-cpu --threads 8 --cfg-scale 5.0 --flow-shift 5.0 && ffmpeg -i /root/stable-diffusion.cpp/build/videos/wan.avi video.mp4

Steps to reproduce

Put Lora in a custom folder then put Lora path into the commands then paste commands into termux everything loads fine but after looking at video causvid isn’t working just deformed burnt video

What you expected to happen

Get quality video

What actually happened

Deformed video

Logs / error messages / stack trace

No response

Additional context / environment details

No response

KintCark avatar Nov 13 '25 19:11 KintCark

That's because causVid is not just like most LoRAs that add new knowledge to the model, it fundamentally changes the way the model is supposed to be sampled, kinda like the LCM and DMD LoRAs that require using different sampling methods to work properly, but with an even more drastic change: the model becomes causal, meaning it can't process all the video at once anymore.

Supporting CausVid would mean adding support for a new kind of models, which makes this not a bug, but a missing feature.

stduhpf avatar Nov 13 '25 19:11 stduhpf

That's because causVid is not just like most LoRAs that add new knowledge to the model, it fundamentally changes the way the model is supposed to be sampled, kinda like the LCM and DMD LoRAs that require using different sampling methods to work properly, but with an even more drastic change: the model becomes causal, meaning it can't process all the video at once anymore.

Ohhh so which sampler scheduler is supported with causvid lora in sd.cpp?

KintCark avatar Nov 13 '25 19:11 KintCark

Ohhh so which sampler scheduler is supported with causvid lora in sd.cpp?

I think LCM would be the best if autoregressive video generation was supported, but since it's not implemented yet, you're out of luck.

Wan generates videos all at once, and CausVid is trained to generate images frames by frame (kinda). So it won't work for now.

stduhpf avatar Nov 13 '25 19:11 stduhpf