ai-toolkit icon indicating copy to clipboard operation
ai-toolkit copied to clipboard

How to change the base model of Qwen - Image for training?

Open bigtigercat opened this issue 2 months ago • 1 comments

When conducting LORA training for Qwen - Image, the transformer in Qwen - Image is a 9 - sharded model. Is there any way for me to perform the training using a single integrated model? I tried copying the file "qwen_image_fp8_e4m3fn.safetensors" to the transformer directory, but the training process threw an error immediately.

My original intention is to use a third - party fine - tuned model as the base model. However, all these third - party models seem to be provided as a single file rather than sharded files, which makes it impossible for me to replace the base model.

I cannot understand the relevant code. Will I have to modify the code myself to achieve this, or is there no other way around it?

bigtigercat avatar Nov 10 '25 09:11 bigtigercat

I have the same problem. Try to fill in E:\Qwen-image\transformer\qwen_image_fp8_e4m3fn.safetensors

zuohaoda avatar Dec 01 '25 05:12 zuohaoda