animate-anything icon indicating copy to clipboard operation
animate-anything copied to clipboard

how to control motion magnitude

Open hw-liang opened this issue 10 months ago • 1 comments

In the train.py file, I noticed for motion control, the motion magnitudes are computed in both RGB(batch["motion"]) and latent space (latent_motion = calculate_latent_motion_score(latents)), but only the latter is used in unet prediction. Could you explain why not use the former one but only use the latter one?

hw-liang avatar Apr 21 '24 16:04 hw-liang

In consideration of efficiency, if we opt to employ motion representation in the RGB format, it will be required to transform predict_x0 back into the RGB space in the line of code motion_loss = F.mse_loss(latent_motion, calculate_latent_motion_score(predict_x0)).

sculmh avatar Apr 28 '24 07:04 sculmh