ControlNet Low VRAM tests (now 8GB ok, begin to solve 6gb)

RuntimeError: CUDA out of memory. Tried to allocate 58.00 MiB (GPU 0; 8.00 GiB total capacity; 7.14 GiB already allocated; 0 bytes free; 7.26 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Feb 11 '23 12:02 My12123

same here

Feb 11 '23 15:02 rocryptogroup

I will try to add sliced attention this weekend. (and copy some codes from Automatic1111 perhaps.) Right now it seems the model OOM on 8GB gpus. See also https://github.com/CompVis/stable-diffusion/issues/39

Feb 11 '23 15:02 lllyasviel

Amazing work, very inspiring and i can't wait to try this. For now, just spent all day downloading and troubleshooting on my slow internet, just to find this OOM. I'm on 6gb 1660ti, so i have to run 1111 with --medvram --precision full --no-half.... Any hope at all of this ever working for me, or should i write this one off? Any chance for a Colab notebook for the rest of us?..

Feb 11 '23 21:02 AmusedDiffuser

Low VRAM mode added.

https://github.com/lllyasviel/ControlNet/blob/main/docs/low_vram.md

Tested on several 8GB cards. Lets see if work on 6GB.

Feb 11 '23 23:02 lllyasviel

Great, thanks! I have a 12GB vram gpu, does it also work for training ? Currently I can't train, even with a batch size of 1 I get OOM errors

Feb 12 '23 04:02 scarbain

Great, thanks! I have a 12GB vram gpu, does it also work for training ? Currently I can't train, even with a batch size of 1 I get OOM errors

will try. but it should work perhaps since attention layer is sliced already.

Feb 12 '23 04:02 lllyasviel

I also tried using xformers but I kept having incompatibility issues

Feb 12 '23 16:02 scarbain

It's not working on 6GB GeForce RTX 3060

Feb 12 '23 21:02 nichovski

It's not working on 6GB GeForce RTX 3060

OK we know. Let us solve 6GB now.

Feb 12 '23 23:02 lllyasviel

Great, thanks! I have a 12GB vram gpu, does it also work for training ? Currently I can't train, even with a batch size of 1 I get OOM errors

Same. Besides batch size, I also tried with accumulate_grad_batches and save_memory, but still no luck to train with 12GB vram. I also tried xformers, but I keep getting No operator found for `memory_efficient_attention_backward.

Feb 13 '23 12:02 ro99

I also tried xformers, but I keep getting No operator found for memory_efficient_attention_backward.

~~I've seen this too and I suspect it was a faulty installation in my case. I had pytorch 1.13.1 from conda & in a conda env, installed xformers via pip, and memory_efficient_attention_backward was the error. After replacing xformers with its conda release, the error was gone and I was back to oom at the beginning of the training (also on 12GB vram).~~ nvm my conda xformers install had issues.

Feb 19 '23 21:02 yamada321

Hello. Great work with ControlNet. Any updates on 6GB VRAM solutions?

Feb 20 '23 04:02 sovit-123

I still have this error save_memory = True it didn't help much RuntimeError: CUDA out of memory. Tried to allocate 290.00 MiB (GPU 0; 8.00 GiB total capacity; 6.29 GiB already allocated; 0 bytes free; 7.08 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Feb 22 '23 13:02 My12123

@My12123 Maybe you can make save_memory = True in config.py if you have not already. That will make at least the canny model to run on 8 GB GPU. I tried it with RTX 3070. Also, you can install xformers with pip install xformers. It seems to work pretty well in saving a gigabyte or so of memory and the inference is twice as fast. If you install xformers, just make sure you have PyTorch 1.13.1 (the latest as of now).

Feb 22 '23 13:02 sovit-123

The version mentionned in environment.yaml is 1.12.1 though. https://github.com/lllyasviel/ControlNet/blob/3e1340d83250c23e972f98900c764c36e5d7bd69/environment.yaml#L9

Feb 27 '23 06:02 twotwoiscute

Yes. But it also works if we use the latest version of PyTorch, along with the latest version of PyTorch lightning.

Feb 27 '23 06:02 sovit-123

I am not able to train the network. Even I am using the save memory option. @sovit-123 can you help me.

Mar 08 '23 11:03 engrmusawarali

@engrmusawarali Hi, even with the save memory option you need at least 18 GB of VRAM to train the model. That's what I found out when it was initially released. Not sure if that requirement has changed since then.

Mar 08 '23 12:03 sovit-123

have you tried small-scale training. if yes then how to process with it can you guide me @sovit-123

Mar 08 '23 13:03 engrmusawarali

@engrmusawarali I have not tried it yet. But planning to do it soon.

Mar 08 '23 13:03 sovit-123

the training doesnt work on 12GB vram rtx 3060

Jul 16 '23 13:07 Hbhatt-merexgenAI

ControlNet ControlNet copied to clipboard

Low VRAM tests (now 8GB ok, begin to solve 6gb)

ControlNet
ControlNet copied to clipboard