EchoMimic
EchoMimic copied to clipboard
About Conditions for Inference and Training
hello, I have a question, were the audio-driven and audio + pose-driven models trained separately? Is the weak condition for audio training audio and face mask?