Why I get killed?
I use CLI python main.py --train --base configs/stableSRNew/v2-finetune_text_T_512.yaml --gpus GPU_ID, --name NAME --scale_lr False
and i get killed
At first ,i reduce the batch_size and num_workers to 1,but it cant work
As a possibility,I use ckpthttps://huggingface.co/stabilityai/stable-diffusion-x4-upscaler
I am not sure about that,HELP!!!!!
I use CLI
python main.py --train --base configs/stableSRNew/v2-finetune_text_T_512.yaml --gpus GPU_ID, --name NAME --scale_lr Falseand i get killedAt first ,i reduce the batch_size and num_workers to 1,but it cant work As a possibility,I use ckpthttps://huggingface.co/stabilityai/stable-diffusion-x4-upscaler I am not sure about that,HELP!!!!!
I am back,it is not reachable in ckpthttps://huggingface.co/stabilityai/stable-diffusion-2-1-base,this confused about me.
The most possible reason is that your cpu memory is not enough.
@IceClear Thanks for your reply!! maybe is this problem,but if i cant change me cpu memory,is there have alternative choice?
Sry. Currently I have no ideas about which part leads to high memory cost. So no solution from my side.
Maybe you can check if the model ckpt is loaded twice since in my current setting, the diffusion model and vqgan are initialized from the same large ckpt and I suspect maybe two copies of this large ckpt exist in cpu ram. Not sure about that.
Maybe you can check if the model ckpt is loaded twice since in my current setting, the diffusion model and vqgan are initialized from the same large ckpt and I suspect maybe two copies of this large ckpt exist in cpu ram. Not sure about that.
Thanks for you patient reply! I will check about that soon!
Maybe you can check if the model ckpt is loaded twice since in my current setting, the diffusion model and vqgan are initialized from the same large ckpt and I suspect maybe two copies of this large ckpt exist in cpu ram. Not sure about that.
Thanks for you patient reply! I will check about that soon!
Looking forward to your reply.
Maybe you can check if the model ckpt is loaded twice since in my current setting, the diffusion model and vqgan are initialized from the same large ckpt and I suspect maybe two copies of this large ckpt exist in cpu ram. Not sure about that.
Thanks for you patient reply! I will check about that soon!
Looking forward to your reply.
SAD,it doesn't work,but thanks to your contribution for this issue,hope I can use it in the future! Have a nice Day!
At first ,i reduce the batch_size and num_workers to 1,but it cant work As a possibility,I use ckpt