DiffSynth-Studio icon indicating copy to clipboard operation
DiffSynth-Studio copied to clipboard

I have published a very detailed Qwen Image models training video for average technical people on Windows

Open FurkanGozukara opened this issue 3 months ago • 3 comments

I hope you find it useful and share the video. You can do LoRA training and full Fine Tuning with as low as 6 GB GPUs on Windows with resonable times (we have configs for 50 epochs, 100 epochs and 200 epochs)

Full tutorial link > https://www.youtube.com/watch?v=DPX3eBTuO_Y

Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model

This is a full comprehensive step-by-step tutorial for how to train Qwen Image models. This tutorial covers how to do LoRA training and full Fine-Tuning / DreamBooth training on Qwen Image models. It covers both the Qwen Image base model and the Qwen Image Edit Plus 2509 model. This tutorial is the product of 21 days of full R&D, costing over $800 in cloud services to find the best configurations for training. Furthermore, we have developed an amazing, ultra-easy-to-use Gradio app to use the legendary Kohya Musubi Tuner trainer with ease. You will be able to train locally on your Windows computer with GPUs with as little as 6 GB of VRAM for both LoRA and Fine-Tuning.

Few example images of character training + style training (GTA5 artworks) + product training (perfume)

Image Image Image Image Image Image Image Image Image Image Image

FurkanGozukara avatar Nov 03 '25 08:11 FurkanGozukara

@FurkanGozukara Hello. We have also released videos about Qwen-Image (https://www.bilibili.com/video/BV1ApYrzsEMT). The video features three presenters: the first is the original author of the Qwen-Image model, the second is me, and the third is the author of the Qinglong trainer. We will continue advancing technologies related to diffusion models.

Artiprocher avatar Nov 04 '25 10:11 Artiprocher

@FurkanGozukara Hello. We have also released videos about Qwen-Image (https://www.bilibili.com/video/BV1ApYrzsEMT). The video features three presenters: the first is the original author of the Qwen-Image model, the second is me, and the third is the author of the Qinglong trainer. We will continue advancing technologies related to diffusion models.

I would like to watch

can you add manual English subtitles?

FurkanGozukara avatar Nov 04 '25 14:11 FurkanGozukara

i shared on bilibili too @Artiprocher

https://www.bilibili.com/video/BV1YC1YBwEse/

FurkanGozukara avatar Nov 04 '25 14:11 FurkanGozukara