ISGAN icon indicating copy to clipboard operation
ISGAN copied to clipboard

perceptual similarity implementation

Open Raha304 opened this issue 2 years ago • 1 comments

Hi, dear Li-Wei

I am working on dysarthric voice conversion and I want to use your idea in the " Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech " paper. I saw your repo for this paper but I don't understand which part of the code is related to perceptual similarity implementation. I am so thankful if you can help me with this problem. Pleeeeeeeeeeease

Raha304 avatar Oct 27 '22 12:10 Raha304

Hi,

Sorry for the late reply. It's been years since I visit this codebase. If you are referring to the Eq. 4 in the paper, I think it's at https://github.com/b04901014/ISGAN/blob/master/src/model.py#L176 which simply calculates the L1 distance of the discriminator layers between speech segments.

b04901014 avatar Nov 03 '22 20:11 b04901014