DialogWAE icon indicating copy to clipboard operation
DialogWAE copied to clipboard

Wasserstein distance between prior and posterior

Open sadegh-aa opened this issue 6 years ago • 3 comments

Hi,

I'm trying to find the part of the code that attempts to compute the Wasserstein distance between prior and posterior (as in Eq. 5 in your ICLR paper), but couldn't find it. Would you please point to the part of the code for this distance?

Moreover, I found that the latent variables are computed directly from the model (e.g., a fully connected layer) rather than predicting \mu and \sigma and then sampling from that distribution, as stated in Eq. 3 and Eq. 4. Would you please clarify this?

Thanks

sadegh-aa avatar May 19 '19 10:05 sadegh-aa

The Wasserstein distance is implicitly minimized by training a WGAN. The latent variable is implicitly sampled with the reparametrize trick.

guxd avatar May 20 '19 01:05 guxd

If the latent variable is sampled with the reparam trick, it should be a mu and sigma and epsilon, where latent = mu + sigma \odot epsilon

But, as far as I understood, your model directly generates a latent variable. Is that correct?

sadegh-aa avatar May 20 '19 04:05 sadegh-aa

https://github.com/guxd/DialogWAE/blob/29f206af05bfe5fe28fec4448e208310a7c9258d/modules.py#L186

guxd avatar May 20 '19 04:05 guxd