SimpleTuner icon indicating copy to clipboard operation
SimpleTuner copied to clipboard

Error on saving Flux lora, Deepspeed missing CUDA_HOME

Open TheDuckingDuck opened this issue 6 months ago • 3 comments

Currently the trainer crashes when saving Flux lora checkpoints due to cuda home missing for the newer deepspeed. I'm on the latest main branch, with all the updated dependencies afaik.

Error:

raise MissingCUDAException("CUDA_HOME does not exist, unable to compile CUDA op(s)") deepspeed.ops.op_builder.builder.MissingCUDAException: CUDA_HOME does not exist, unable to compile CUDA op(s)

TheDuckingDuck avatar Aug 07 '24 16:08 TheDuckingDuck