DFC-VAE
DFC-VAE copied to clipboard
Variational Autoencoder trained by Feature Perceputal Loss
DFC-VAE
This is the code for the paper
Deep Feature Consistent Variational Autoencoder
The paper trained a Variational Autoencoder (VAE) model for face image generation. In addition, it provided a method to manipluate facial attributes by using attribute-specific vector.
- Pretrained model trained on CelebA dataset
- Code for training on GPU
- Code for different analysis
Installation
Our implementation is based on Torch and several dependencies.
After installing Torch according to this tutorial, use following code to install dependencies:
sudo apt-get install libprotobuf-dev protobuf-compiler
luarocks install loadcaffe
luarocks install https://raw.githubusercontent.com/szym/display/master/display-scm-0.rockspec
luarocks install nngraph
sudo apt-get install libmatio2
luarocks install matio
luarocks install manifold
sudo apt-get install libatlas3-base # for manifold
we use a NVIDIA GPU for training and testing, so you also need install GPU related packages
luarocks install cutorch
luarocks install cunn
luarocks install cudnn
Dataset preparation
cd celebA
Download img_align_celeba.zip from http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html under the link "Align&Cropped Images".
Download list_attr_celeba.txt for annotation.
unzip img_align_celeba.zip; cd ..
DATA_ROOT=celebA th data/crop_celebA.lua
We need pretrained VGG-19 to compute feature perceptual loss.
cd data/pretrained && bash download_models.sh && cd ../..
Training
Open a new terminal and start the server for display images in the browser
th -ldisplay.start 8000 0.0.0.0
The images can be seen at http://localhost:8000 in the browser
Training with feature perceptual loss.
DATA_ROOT=celebA th main_cvae_content.lua
Training with pixel-by-pixel loss.
DATA_ROOT=celebA th main_cvae.lua
Generate face images using pretrained Encoder and Decoder
Pretrained model
mkdir checkpoints; cd checkpoints
We provide both Encoder and Decoder
cvae_content_123_encoder.t7 and cvae_content_123_decoder.t7 trained with relu1_1, relu2_1, relu3_1 in VGG.
Reconstruction with CelebA dataset:
DATA_ROOT=celebA reconstruction=1 th generate.lua
Face images randomly generated from latent variables:
DATA_ROOT=celebA reconstruction=0 th generate.lua
Following are some examples:
data:image/s3,"s3://crabby-images/d8042/d8042785743ad8e9d48d22ecdebbcfa5a07703f3" alt=""
th linear_walk_two_images.lua
data:image/s3,"s3://crabby-images/8d97a/8d97a7cbf9e69b415635865f11d3b6abda63a95d" alt=""
First preprocess the celebA dataset annotations to separate the dataset to two parts for each attribute, indicating whether containing the specific attribute or not.
cd celebA
python get_binary_attr.py
# should generate file: 'celebA/attr_binary_list.txt'
cd ..
th linear_walk_attribute_vector.lua
Here are some examples:
data:image/s3,"s3://crabby-images/af147/af147c32e3c0319d28ef3a1337bbc002c772c6ab" alt=""
Better face attributes manipulation by incorporating GAN
data:image/s3,"s3://crabby-images/61380/61380a9b8409c0bc251c98ac52a8f210d5eaf7c5" alt=""
data:image/s3,"s3://crabby-images/14c9a/14c9a10087b9fcc8c7fb236cbc92be3b3006824e" alt=""
data:image/s3,"s3://crabby-images/f6611/f661180cbb6b4fa3a1cb74da8ee35c19d9215e4c" alt=""
data:image/s3,"s3://crabby-images/6731d/6731d80ab24fedebcab9a8e41316250215e505e8" alt=""
data:image/s3,"s3://crabby-images/b9806/b980655c920b4a99a62bb5f62e6e99d1f9e18d7d" alt=""
data:image/s3,"s3://crabby-images/eff7e/eff7e1673059e11f968b34b1cfcb87f7d9ab8f96" alt=""
Credits
The code is based on neural-style, dcgan.torch, VAE-Torch and texture_nets.
Citation
If you find this code useful for your research, please cite:
@inproceedings{hou2017deep,
title={Deep Feature Consistent Variational Autoencoder},
author={Hou, Xianxu and Shen, Linlin and Sun, Ke and Qiu, Guoping},
booktitle={Applications of Computer Vision (WACV), 2017 IEEE Winter Conference on},
pages={1133--1141},
year={2017},
organization={IEEE}
}
@article{hou2019improving,
title={Improving Variational Autoencoder with Deep Feature Consistent and Generative Adversarial Training},
author={Hou, Xianxu and Sun, Ke and Shen, Linlin and Qiu, Guoping},
journal={Neurocomputing},
volume={341},
pages={183--194},
year={2019},
publisher={Elsevier},
}