SimpleTuner
SimpleTuner copied to clipboard
Error on saving Flux lora, Deepspeed missing CUDA_HOME
Currently the trainer crashes when saving Flux lora checkpoints due to cuda home missing for the newer deepspeed. I'm on the latest main branch, with all the updated dependencies afaik.
Error:
raise MissingCUDAException("CUDA_HOME does not exist, unable to compile CUDA op(s)") deepspeed.ops.op_builder.builder.MissingCUDAException: CUDA_HOME does not exist, unable to compile CUDA op(s)