DiffSynth-Studio issues

什么时候支持Wan2.2-Fun-5B-Control微调？

1

cat-sun

为什么使用qwen-image-edit微调时不收敛呢

1

iva-jhsun

Qwen-Image-Edit-2509 Extra Input Problem

2

我看到Qwen-image-edit-2509的报告上面支持输入controlnet类型的图达到控制的效果，那我是否可以在您提供的lora脚本上面，把extra_input设置为我们提供的mask以达到训练controlnet+lora的效果？ ` accelerate launch examples/qwen_image/model_training/train.py \ --dataset_base_path data/example_image_dataset \ --dataset_metadata_path data/example_image_dataset/metadata_qwen_imgae_edit_multi.json \ --data_file_keys "image,mask_image,edit_image" \ --extra_inputs "mask_image,edit_image" \ --max_pixels 1048576 \ --dataset_repeat 50 \ --model_id_with_origin_paths "Qwen/Qwen-Image-Edit-2509:transformer/diffusion_pytorch_model*.safetensors,Qwen/Qwen-Image:text_encoder/model*.safetensors,Qwen/Qwen-Image:vae/diffusion_pytorch_model.safetensors" \ --learning_rate 1e-4 \...

Mrzhans

请问有办法降低训练wan2.2 animate的lora的显存要求吗

4

professorhuanghaojing

为什么Qwen_Image的推理代码里面的text_encoder部分和modelscope以及huggingface里面的text_encoder的权重中的key值对不上？

3

我采用下面的代码来单次推理Qwen_image: ``` import glob from diffsynth.pipelines.qwen_image import QwenImagePipeline, ModelConfig from PIL import Image import torch import os os.environ["PYTHONBREAKPOINT"] = "0" pipe = QwenImagePipeline.from_pretrained( torch_dtype=torch.bfloat16, device="cuda", model_configs=[ ModelConfig(path=['/mnt/dolphinfs/hdd_pool/docker/user/hadoop-automaterials/yongzhao41/insert-anything/checkpoints/Qwen-Image/text_encoder/model-00001-of-00004.safetensors', '/mnt/dolphinfs/hdd_pool/docker/user/hadoop-automaterials/yongzhao41/insert-anything/checkpoints/Qwen-Image/text_encoder/model-00002-of-00004.safetensors', '/mnt/dolphinfs/hdd_pool/docker/user/hadoop-automaterials/yongzhao41/insert-anything/checkpoints/Qwen-Image/text_encoder/model-00003-of-00004.safetensors', '/mnt/dolphinfs/hdd_pool/docker/user/hadoop-automaterials/yongzhao41/insert-anything/checkpoints/Qwen-Image/text_encoder/model-00004-of-00004.safetensors']),...

1Yanxiaolin1

Error with Wan2.1-Fun-1.3B-Control inference script - 'WanModel' object has no attribute 'ref_conv'

2

Hi @Artiprocher , Thanks for your awesome project. I am trying to infer video generation based on control video and reference image inputs with [Wan2.1-Fun-1.3B-Control](https://github.com/modelscope/DiffSynth-Studio/blob/main/examples/wanvideo/model_inference/Wan2.1-Fun-1.3B-Control.py). However, I met the error...

fusheng-ji

Question about krea realtime

2

Hi, krea realtime is an autoregressive model with causal attention. Does Diffsynth implement causal attention?

unrealMJ

Wan2.2 animate inference errrors

2

Hi, when i using diffsynth to do inference of wan2.2 animate, if the input video shape is not 1280(h)*720(w) will get the error as blow: ``` File "./DiffSynth-Studio/diffsynth/models/wan_video_animate_adapter.py", line 643,...

rex-29

SDXL 训练完成后输出lora权重，如何与原始模型进行lora合并，有没有相关的示例代码

2

tianbuwei

I have published a very detailed Qwen Image models training video for average technical people on Windows

3

I hope you find it useful and share the video. You can do LoRA training and full Fine Tuning with as low as 6 GB GPUs on Windows with resonable...

FurkanGozukara

DiffSynth-Studio
DiffSynth-Studio copied to clipboard

Metadata

什么时候支持Wan2.2-Fun-5B-Control微调？

为什么使用qwen-image-edit微调时不收敛呢

Qwen-Image-Edit-2509 Extra Input Problem

请问有办法降低训练wan2.2 animate的lora的显存要求吗

为什么Qwen_Image的推理代码里面的text_encoder部分和modelscope以及huggingface里面的text_encoder的权重中的key值对不上？

Error with Wan2.1-Fun-1.3B-Control inference script - 'WanModel' object has no attribute 'ref_conv'

Question about krea realtime

Wan2.2 animate inference errrors

SDXL 训练完成后输出lora权重，如何与原始模型进行lora合并，有没有相关的示例代码

I have published a very detailed Qwen Image models training video for average technical people on Windows

← Metadata

Owner

Metadata

DiffSynth-Studio DiffSynth-Studio copied to clipboard

Metadata

← Metadata

Owner

Metadata

DiffSynth-Studio
DiffSynth-Studio copied to clipboard