DeepFaceLab
DeepFaceLab copied to clipboard
I can't run seahd on the gpu! it only runs on the cpu
Initializing models: 80%|######################################## ######4 | 4/5 [00:28<00:07, 7.18s/it] Error: OOM when allocating tensor of shape [3,3,512,2048] and type float [[node src_dst_opt/vs_inter_B/upscale1/conv1/weight_0/Initializer/Const (defined at D:\deepfake\DeepFaceLab\DeepFaceLab_NVIDIA_up_to_RTX2080Ti_internal\DeepFaceLab\core\leras\optimizers\AdaBelief.py:38) ]]
Caused by op 'src_dst_opt/vs_inter_B/upscale1/conv1/weight_0/Initializer/Const', defined at:
File "threading.py", line 884, in bootstrap
File "threading.py", line 916, in bootstrap_inner
File "threading.py", line 864, in run
File "D:\deepfake\DeepFaceLab\DeepFaceLab_NVIDIA_up_to_RTX2080Ti_internal\DeepFaceLab\mainscripts\Trainer.py", line 58, in trainerThread
debug=debug)
File "D:\deepfake\DeepFaceLab\DeepFaceLab_NVIDIA_up_to_RTX2080Ti_internal\DeepFaceLab\models\ModelBase.py", line 193, in init
self.on_initialize()
File "D:\deepfake\DeepFaceLab\DeepFaceLab_NVIDIA_up_to_RTX2080Ti_internal\DeepFaceLab\models\Model_SAEHD\Model.py", line 341, in on_initialize
self.src_dst_opt.initialize_variables (self.src_dst_saveable_weights, vars_on_cpu=optimizer_vars_on_cpu, lr_dropout_on_cpu=self.options['lr_dropout']=='cpu')
File "D:\deepfake\DeepFaceLab\DeepFaceLab_NVIDIA_up_to_RTX2080Ti_internal\DeepFaceLab\core\leras\optimizers\AdaBelief.py", line 38, in initialize_variables
vs = { v.name : tf.get_variable ( f'vs{v.name}'.replace(':',''), v.shape, dtype=v.dtype, initializer=tf.initializers.constant( 0.0), trainable=False) for v in trainable_weights }
File "D:\deepfake\DeepFaceLab\DeepFaceLab_NVIDIA_up_to_RTX2080Ti_internal\DeepFaceLab\core\leras\optimizers\AdaBelief.py", line 38, in
============ Model Summary ============ == == == Model name: new_SAEHD == == == == Current iteration: 564 == == == ==---------- Model Options ----------== == == == resolution: 320 == == face_type: f == == models_opt_on_gpu: True == == archi: liae-ud == == ae_dims: 256 == == e_dims: 64 == == d_dims: 64 == == d_mask_dims: 22 == == masked_training: True == == eyes_mouth_prio: False == == uniform_yaw: False == == blur_out_mask: False == == adabelief: True == == lr_dropout: n == == random_warp: True == == random_hsv_power: 0.0 == == true_face_power: 0.0 == == face_style_power: 0.0 == == bg_style_power: 0.0 == == ct_mode: none == == clipgrad: False == == pretrain: False == == autobackup_hour: 0 == == write_preview_history: False == == target_iter: 3000000000 == == random_src_flip: False == == random_dst_flip: True == == batch_size: 4 == == gan_power: 0.0 == == gan_patch_size: 40 == == gan_dims: 16 == == == ==----------- Running On ------------== == == == Using device: CPU == == ==
I have same error, here is my "Post"
I'm not a pro, but try use the stander values during training...?
Or reinstall DFL.
I find a solution, you have to use your CPU for the first few interactions and then save and switch to GPU
Did you ever find the answer? If so, would you mind sharing it and closing this issue?