ruanjiyang
ruanjiyang
> Hi, the [1, 512]-shaped latent corresponds to the original W space where [1, 14, 512]-shaped latent corresponds to the expanded W+ space. W+ space uses different w vectors for...
got it. thanks. can I only pass the latent codes to generator? I mean don't pass the learnt noises.
> Hello. There is a mobile app called Prisma. They do this, and allow people to add style to their photos. I was wondering if you are associated with the...
you should use Linux shell command like: python xxxxxxx ; python xxxxxxx; python xxxxxxx
you can see, the eyes looks very strange, and the speaking lips also looks very strange.
> @ruanjiyang Hi, > > 1. It seems the eyes are not well learned. In this case, you could try to fix the eye movement using `--fix_eye 0.25`. > 2....
I have tried to use Chinese version wav2vec2, see the following line: parser.add_argument('--model', type=str, default='ydshieh/wav2vec2-large-xlsr-53-chinese-zh-cn-gpt') and I found the audio_dim for this model is 21128, which is much large than...