parryppp comments

Results 6 comments of


                                            parryppp

Ref img for background

@Z-YuPeng can we build some relationship between the background to the noised img, for example, during the denoising process, a model can identify the background tokens?

what's the purpose of bool_matrix1024, bool_matrix4096 in the return of cal_attn_mask_xl function

The reshaped attention mask is shown above. Do you mean that, for example, if i want to generate 4 consistent images, the yellow zone in the attention map would not...

what's the purpose of bool_matrix1024, bool_matrix4096 in the return of cal_attn_mask_xl function

Thank you for your explanation, I now understand much more clearly. But I still have a question about the shape of attention mask. why does the attention mask ensure that...

Temporal MultiDiffusion doesn't work

> We haven't tried the method used in AVID but we found slidding windows works well. would you like to share the details of slidding windows? for example, given a...

除了tenporal latents, env latents似乎也都没有集成？ > [@Artiprocher](https://github.com/Artiprocher) 能否简单说明集成temporal latents后的主要问题，我按照wan2.2官方的推理代码中的实现(https://github.com/Wan-Video/Wan2.2/blob/990af50de458c19590c245151197326e208d7191/wan/animate.py#L522) 将y变量变为参考+时序+生成片段的拼接，在推理结果上出现明显噪声。感谢

why drop last 4 frames in wan-animate training?

@zwplus thank you for your reply. I'm a bit confused — after this change, does it still end up with a latent of 81-frame input video matching against a latent...

parryppp

Ref img for background

what's the purpose of bool_matrix1024, bool_matrix4096 in the return of cal_attn_mask_xl function

what's the purpose of bool_matrix1024, bool_matrix4096 in the return of cal_attn_mask_xl function

Temporal MultiDiffusion doesn't work

wan-animated推理训练数据长度问题

why drop last 4 frames in wan-animate training?