TriAAN-VC icon indicating copy to clipboard operation
TriAAN-VC copied to clipboard

A question about speaker encoder

Open bigdan12 opened this issue 1 year ago • 1 comments

Hi, why not add speaker classification in speaker encoder, or use Speaker Verification feature. If I only use a speaker encoder, will there be any problems with timbral coupling?

bigdan12 avatar Feb 06 '24 09:02 bigdan12

Hi, I think it's ok since the speaker encoder indirectly learns to extract speaker identity. I tried other features such as wav2vec2.0, but it was less effective than CPC features. I think using SV features for the speaker encoder can be effective, but the auxiliary task (classification) was not meaningful in my case.

winddori2002 avatar Feb 07 '24 05:02 winddori2002