AniPortrait icon indicating copy to clipboard operation
AniPortrait copied to clipboard

How long before we can get to this level

Open gibsonhu123 opened this issue 10 months ago • 7 comments

https://www.microsoft.com/en-us/research/project/vasa-1/

gibsonhu123 avatar Apr 19 '24 00:04 gibsonhu123

Just wait until they release new weights and then compare again.

MisterT96 avatar Apr 20 '24 15:04 MisterT96

https://www.microsoft.com/en-us/research/project/vasa-1/

I've made some videos that look better than those VASA-1 examples, but I'm also looking forward to the release of the pre-trained audio model, especially since M$ won't release vasa-1 for some time, if ever.

gessyoo avatar Apr 20 '24 18:04 gessyoo

https://www.microsoft.com/en-us/research/project/vasa-1/

I've made some videos that look better than those VASA-1 examples, but I'm also looking forward to the release of the pre-trained audio model, especially since M$ won't release vasa-1 for some time, if ever.

How and proof video would be nice. We are really interested in your progress.

MisterT96 avatar Apr 21 '24 10:04 MisterT96

https://www.microsoft.com/en-us/research/project/vasa-1/

I've made some videos that look better than those VASA-1 examples, but I'm also looking forward to the release of the pre-trained audio model, especially since M$ won't release vasa-1 for some time, if ever.

How and proof video would be nice. We are really interested in your progress.

I will post a few examples on social media, (https://www.instagram.com/p/C6Krt38ydtG/), and you are welcome to include them here as examples, as long as you attribute the source. The "how," at least until the audio model is released, is a matter of choosing an appropriate video and photo. The illusion breaks down if the head movement is too rapid or the angle is too extreme.

gessyoo avatar Apr 21 '24 17:04 gessyoo

https://www.microsoft.com/en-us/research/project/vasa-1/

I've made some videos that look better than those VASA-1 examples, but I'm also looking forward to the release of the pre-trained audio model, especially since M$ won't release vasa-1 for some time, if ever.

How and proof video would be nice. We are really interested in your progress.

I will post a few examples on social media, (https://www.instagram.com/reel/C6CdHDTOtFW/), and you are welcome to include them here as examples, as long as you attribute the source. The "how," at least until the audio model is released, is a matter of choosing an appropriate video and photo. The illusion breaks down if the head movement is too rapid or the angle is too extreme.

Could you provide a better video the instagram reel is partially cutoff

gibsonhu123 avatar Apr 23 '24 16:04 gibsonhu123

https://www.microsoft.com/en-us/research/project/vasa-1/

I've made some videos that look better than those VASA-1 examples, but I'm also looking forward to the release of the pre-trained audio model, especially since M$ won't release vasa-1 for some time, if ever.

How and proof video would be nice. We are really interested in your progress.

I will post a few examples on social media, (https://www.instagram.com/reel/C6CdHDTOtFW/), and you are welcome to include them here as examples, as long as you attribute the source. The "how," at least until the audio model is released, is a matter of choosing an appropriate video and photo. The illusion breaks down if the head movement is too rapid or the angle is too extreme.

This is quite impressive, thank you very much!

MisterT96 avatar Apr 23 '24 20:04 MisterT96

https://www.microsoft.com/en-us/research/project/vasa-1/

I've made some videos that look better than those VASA-1 examples, but I'm also looking forward to the release of the pre-trained audio model, especially since M$ won't release vasa-1 for some time, if ever.

How and proof video would be nice. We are really interested in your progress.

I will post a few examples on social media, (https://www.instagram.com/p/C5UbGIuPh7G), and you are welcome to include them here as examples, as long as you attribute the source. The "how," at least until the audio model is released, is a matter of choosing an appropriate video and photo. The illusion breaks down if the head movement is too rapid or the angle is too extreme.

This is quite impressive, thank you very much!

Here's another, feel free to use it as an example if you want: https://www.youtube.com/shorts/lgnfAuh5wBY

gessyoo avatar May 04 '24 18:05 gessyoo