ComfyUI icon indicating copy to clipboard operation
ComfyUI copied to clipboard

HunyuanVideo-1.5 120 frames 12G oom

Open zwukong opened this issue 1 month ago • 11 comments

81 frames is ok, while 120 frames can not run properly. Vram can sometimes over max vram and then speed becomes very slow in 121 frams. wan2.2 is ok,but hunyuan not . 3.12,2.8

Image

zwukong avatar Nov 24 '25 05:11 zwukong

Vram can sometimes over max vram and then speed becomes very slow in 121 frams. wan2.2 is ok,but hunyuan not . 3.12,2.8

Hi. Can we get you hardware specs, and the video resolution of what you are generating. If you can attach a workflow that's ideal.

rattus128 avatar Nov 24 '25 10:11 rattus128

4070 12G, 480p, your example workflow ,hunyuanvideo1.5_480p_i2v_cfg_distilled_fp8_scaled ,8 steps

zwukong avatar Nov 24 '25 10:11 zwukong

4070 12G, 480p, your example workflow ,hunyuanvideo1.5_480p_i2v_cfg_distilled_fp8_scaled ,8 steps

Thanks. I'll give it a go on my 3060.

rattus128 avatar Nov 24 '25 10:11 rattus128

@zwukong no issues for me at 480p121f on the 3060 12G. Few things to try from here though.

I notice the VAE uses more VRAM than the Model and I got forced to go back to the tiled VAE. Can you confirm whether the KSampler or VAE Decode is the problem?

If its a VAE problem, try manually tiling. The workflow has an example VAE Decode tiled node, hook that up instead of the default VAE Decode.

If you are on Windows, can you change this setting to "Prefer no Sysmem Fallback":

Image

Run again, and if it is OOMing the way you describe, this hopefully forces it to crash with OOM report instead of run really slow. Then paste me the log and we can see where the OOM is happening.

This setting can also help the automatically tiler figure out it needs to tile.

For any OOMing run, can we get your load statistics? In the UI, expand the console and paste the content. It should read something like this:

got prompt
Requested to load CLIPVisionModelProjection
loaded completely; 3196.40 MB usable, 787.72 MB loaded, full load: True
Requested to load AutoencodingEngine
loaded partially; 1351.83 MB usable, 1351.83 MB loaded, 1052.61 MB offloaded, lowvram patches: 0
Requested to load HunyuanVideo15
loaded partially; 7301.02 MB usable, 7301.02 MB loaded, 639.97 MB offloaded, lowvram patches: 0
  0%|          | 0/20 [00:00<?, ?it/s]Interrupting prompt 030cdc54-0921-4c6f-a7b2-9bf18528090b
  0%|          | 0/20 [01:41<?, ?it/s]
Processing interrupted
Prompt executed in 106.10 seconds
got prompt
0 models unloaded.
loaded partially: 7300.95 MB loaded, lowvram patches: 0
 75%|███████▌  | 6/8 [06:15<02:05, 62.66s/it]

rattus128 avatar Nov 24 '25 11:11 rattus128

Image in my experience,if the total size over the max vram ,it will become slow.

zwukong avatar Nov 24 '25 13:11 zwukong

Image 4 steps cost 2 minutes, very slow

Image

zwukong avatar Nov 24 '25 13:11 zwukong

Image 4 steps cost 2 minutes, very slow

Image

You have lowvram patches on load. Do you have Loras?

You might have to send me the complete workflow as I have similar load numbers to you but I don't go over 11GB of VRAM.

rattus128 avatar Nov 24 '25 15:11 rattus128

vae decode is the problem

loaded partially; 128.00 MB usable, 127.84 MB loaded, 2276.62 MB offloaded, lowvram patches: 0 Warning: Ran out of memory when regular VAE decoding, retrying with tiled VAE decoding.

kanarch66 avatar Nov 25 '25 03:11 kanarch66

hunyuan_video .json what i mean is shared vram ,11G vram + 2G shared vram >12G, speed will become slow

zwukong avatar Nov 25 '25 03:11 zwukong

@rattus128 i think it is the same problem as flux2

zwukong avatar Nov 28 '25 09:11 zwukong

I hit the same problem during VAE decode on my 5090 if I try to run the native 1080p upscale workflow from the default templates.

rwfsmith avatar Dec 02 '25 00:12 rwfsmith