DiffSynth-Studio icon indicating copy to clipboard operation
DiffSynth-Studio copied to clipboard

Question about optimal hyper parameters for training Qwen-Image-Edit-2509

Open dylanzonix opened this issue 2 months ago • 8 comments

DiffSynth is great. I am trying to train Qwen-Image-Edit-2509. I have a dataset of 5.4k examples. I'm curious if full parameter fine-tuning could be feasible with this dataset size, and if so at what hyperparameters? I've tried 1e-5 and 5e-6 at a 128 effective batch size and the model degrades when I evaluate it.

Base model: Image

Step 1000 after training (clearly degraded in quality): Image

The loss barely budges: Image

The next thing I'll try is LoRA, but I'm just confused how I couldn't get the training to have any positive impact on the model at any point across a variety of training runs. Curious if anyone, besides the Qwen team, knows how to successfully fine-tune this model. Thanks!

dylanzonix avatar Oct 15 '25 00:10 dylanzonix

@dylanzonix We have also observed similar issues: the Qwen-Image-Edit and Qwen-Image-Edit-2509 models are difficult to fine-tune, despite following exactly the same mathematical principles as Qwen-Image. If we find a better improvement approach, we will update the code accordingly.

Artiprocher avatar Oct 20 '25 03:10 Artiprocher

@dylanzonix We have also observed similar issues: the Qwen-Image-Edit and Qwen-Image-Edit-2509 models are difficult to fine-tune, despite following exactly the same mathematical principles as Qwen-Image. If we find a better improvement approach, we will update the code accordingly.

Thanks a lot. I am still actively working on this as well and can give updates from my end as well. I'm also happy to share with you the dataset I am using if you are interested. It's custom made exactly for Qwen-Image-Edit-2509

dylanzonix avatar Oct 20 '25 04:10 dylanzonix

I also encounter the same problem: the loss intend to oscillation instead of converging when fine tuning qwen-image-edit and qwen-image-edit-2509 with lora.

yangguoquan001 avatar Oct 23 '25 04:10 yangguoquan001

same question , is there any solutions?

clytze0216 avatar Nov 03 '25 06:11 clytze0216

same question

lanson07 avatar Nov 04 '25 02:11 lanson07

同样的问题

iva-jhsun avatar Nov 05 '25 03:11 iva-jhsun

Training the LoRa model for Qwen Image Edit 2509 using ModelScope significantly reduced the loss. However, training with DiffSyhtn resulted in a persistently oscillating loss.

DreamOneYou avatar Nov 05 '25 10:11 DreamOneYou

Training the LoRa model for Qwen Image Edit 2509 using ModelScope significantly reduced the loss. However, training with DiffSyhtn resulted in a persistently oscillating loss.

Interesting. You think the issue could be with the DiffSynth library's implementation?

dylanzonix avatar Nov 06 '25 18:11 dylanzonix

Training the LoRa model for Qwen Image Edit 2509 using ModelScope significantly reduced the loss. However, training with DiffSyhtn resulted in a persistently oscillating loss.

Interesting. You think the issue could be with the DiffSynth library's implementation?

Has the problem been solved?

Deng-Xian-Sheng avatar Dec 02 '25 09:12 Deng-Xian-Sheng