Mutual Information Neural Estimation

This repository contains a pytorch implementation of the Information Bottleneck (IB) using Mutual Information Neural Estimation (MINE). [Belghazi et al., 2018]

A standard baseline MLP (as described in Deep VIB paper [Alemi et al., 2017]) has been used for comparison.

Setup

git clone https://github.com/mohith-sakthivel/mine-pytorch.git mine
cd mine

conda env create -f environment.yml
conda activate mine

Run

To run the baseline model with default parameters
```
python3 -m mine.ib --deter
```
The baseline model is a standard MLP with 3 hidden layers and ReLU non-linearity. During training, an exponential weighted average of the parameters is maintained and these averaged parameters are used at test time.
To run MINE+IB model
```
python3 -m mine.ib --mine
```

Note

This repo contains an implementation of MINE for information minimization only. For information maximization you should also incorporate adaptive gradient clipping as mentioned in Belghazi et al.. This is because MI is unbounded for typical high-dimensional use cases and hence gradients from the MI estimate can overwhelm gradients from the primary loss.

References

M I Belghazi, A Baratin, S Rajeswar, S Ozair, Y Bengio, A Courville, R D Hjelm - MINE: Mutual Information Neural Estimation, ICML, 2018. (paper)
A A Alemi, I Fischer, J V Dillon, K Murphy - Deep Variational Information Bottleneck, ICLR, 2017. (paper)

mine-pytorch
mine-pytorch copied to clipboard

Metadata

Mutual Information Neural Estimation

Setup

Run

Note

References

← Metadata

Owner

Metadata

mine-pytorch mine-pytorch copied to clipboard

Metadata

Mutual Information Neural Estimation

Setup

Run

Note

References

← Metadata

Owner

Metadata

mine-pytorch
mine-pytorch copied to clipboard