DiffSynth-Studio issues

OOM when LORA training Wan2.2-I2V-A14B with 2-H20

Hi all, I’m encountering OOM issues when using LoRA to finetune Wan2.2-I2V-A14B with the script `lora/Wan2.2-I2V-A14B.sh` on 2×80 GB H20 GPUs. Is this expected? I noticed in the documentation that...

ruolinsss

inquiry for details of wan2.2 i2v lora training?

2

Hi, DiffSynth-Studio team. I am recently playing with wan2.2 i2v training and found that the `token replace` style i2v training is newly adopted in wan2.2. The `input_latents`' is first initialized...

shinxg

Will qwen_image_edit Support Batch Inference and DPO Training?

4

I tried to implement DPO training using LoRA with qwen_image_edit , but it seems that the current code does not support batch inference and also has high GPU memory usage,...

huxian0402

nunchaku可以无缝集成吗

1

如题，请问我可以直接用diffsynth-studio训练出来的模型与nunchaku量化的基模进行集成吗

hnsywangxin

Run Wan2.1-VACE-14B failed when set vace_video_mask

3

Run Wan2.1-VACE-14B failed when have `vace_video_mask`. error log: ```python3 Downloading Model from https://www.modelscope.cn to directory: DiffSynth-Studio-main/models/Wan-AI/Wan2.1-T2V-1.3B 2025-06-27 09:18:01,016 - modelscope - INFO - Target directory already exists, skipping creation. VAE...

LiuXiaolong19920720

fsdp with wan2.2

2

[rank1]: File "/home/lcq/lijiarui/DiffSynth-Studio/examples/wanvideo/model_training/train.py", line 165, in [rank1]: launch_training_task( [rank1]: File "/home/lcq/lijiarui/DiffSynth-Studio/diffsynth/trainers/utils.py", line 491, in launch_training_task [rank1]: loss = model(data=None, inputs=data) [rank1]: File "/root/miniconda3/envs/py310/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl [rank1]: return self._call_impl(*args,...

mountain-lee1

为什么max_timestep_boundary和switch_DiT_boundary不同？

您好，我想请问一下，在训练时high noise和low noise模型使用不同的采样时间边界，为什么推理时switch_DiT_boundary不沿用0.358这个边界而是新值0.875呢？

Kkkkkz21

关于 Wan2.1-Fun-V1.1-14B-Control-Camera 的 LoRA 训练

2

您好，非常感谢你们的杰出工作我是一名初学者，基于 Wan2.1-Fun-V1.1-14B-Control-Camera 训练了一个 LoRA ，但lora_base_model是dit，请问 LoRA 是否包含 Camera Adapter 和 Camera Encoder 这两个模块？没包含的话，对lora的相机控制效果是否会有影响？我是否需要将这两个新增模块的参数纳入 LoRA 的优化列表？

CrescentVelvet

qwen-image-edit-2509训练lora有测试用的example数据集吗

2

example/qwen_image里的lora训练脚本，数据集是data/example_image_dataset，这个数据集只能自己准备吗，还是官方有稳定可以复现的数据集供下载

huzjkevin

wan-animated推理训练数据长度问题

3

1. 为什么Wan-animated推理代码和训练代码中的conds的长度都为生成视频长度-4？ https://github.com/modelscope/DiffSynth-Studio/blob/0a1c172a00fb2dd76abedd3b066ddbf62bd4a60d/examples/wanvideo/model_training/validate_lora/Wan2.2-Animate-14B.py#L21 https://github.com/modelscope/DiffSynth-Studio/blob/0a1c172a00fb2dd76abedd3b066ddbf62bd4a60d/diffsynth/pipelines/wan_video_new.py#L1072 2. Wan-animated中训练和推理均没有实现论文中的overlap策略？

Guan-chen-lu

DiffSynth-Studio
DiffSynth-Studio copied to clipboard

Metadata

OOM when LORA training Wan2.2-I2V-A14B with 2-H20

inquiry for details of wan2.2 i2v lora training?

Will qwen_image_edit Support Batch Inference and DPO Training?

nunchaku可以无缝集成吗

Run Wan2.1-VACE-14B failed when set vace_video_mask

fsdp with wan2.2

为什么max_timestep_boundary和switch_DiT_boundary不同？

关于 Wan2.1-Fun-V1.1-14B-Control-Camera 的 LoRA 训练

qwen-image-edit-2509训练lora有测试用的example数据集吗

wan-animated推理训练数据长度问题

← Metadata

Owner

Metadata

DiffSynth-Studio DiffSynth-Studio copied to clipboard

Metadata

← Metadata

Owner

Metadata

DiffSynth-Studio
DiffSynth-Studio copied to clipboard