Inference Suboptimality in Variational Autoencoders

This repository contains code for a project undertaken as part of the Advanced Topics in Machine Learning course (HT 2020) at Oxford. The code here was written by Mikhail Andrenkov, Maxence Draguet, Sebastian Lee, and Diane Magnin.

It is a reproduction of the code for the paper Inference Suboptimality in Variational Autoencoders by Cremer, Li & Duvenaud.

Our code contains the following features relevant to replicating results from the paper:

Flexible encoder/decoder architectures
Various approximate posteriors inlcuding:
- Factorised Gaussian
- R-NVP flows
- R-NVP flows with auxiliary variables
Relevant binarised image datasets inlcuding:
- MNIST
- Fashion-MNIST
- CIFAR-10
Local optimisation training loop
AIS and IWAE log-likelihood estimators

Additionally we implemented a planar flows approximate posterior.

Quick Links

Prerequisites

To run this code you will need the following:

Python 3.7+

Our code uses PyTorch. We include a requirements file (requirements.txt). We recommend creating a virtual environment (using conda or virtualenv) for this code base e.g.

python3 -m venv aml; source aml/bin/activate

From there, all Python prerequisites should be satisfied by running

pip3 install -r requirements.txt

To run experiments with a GPU, it is essential to use Python 3.7.5 (on Windows). Our code is compatible with CUDA 10.1.

Datasets

We do not provide the datasets directly in this repository. However we are using modifications of standard datasets (e.g. MNIST, CIFAR10) that can be loaded with the torchvision datasets module. To retrieve the datasets, and make the requisite modifications (the binarisation specified by Larochelle et al) run:

python data/get_datasets.py

Running Code

Standalone experiments can be run from the experiment folder using the main.py script. Configuration for such an experiment can be set using the base_config.yaml file for general attributes of the experiment as well as specific config files in the additional_configs/ folder (e.g. for setting parameters of a flow module).

Running a specific experiment from the paper can be done by accessing the relevant hard coded configuration files in the Experiment_List folder, which have been made to match the specifications of the paper. For example to reproduce the configuration of a fully-factorised gaussian approximate posterior with an amortised inference network (𝓛(VAE[q]) | qFFG from Table 2. in the paper), run from the experiments folder:

python main.py -config experiment_list/expA/base_config.yaml -additional_configs experiment_list/expA/additional_configs/

Alternatively, all results from a given experiment can be run at once in sequence using the bash script in the relevant experiment folder.

Accessing Experimental Results

Results of an experiment are by default saved in experiments/results/X/ where X is a timestamp for the experiment. Here you will find a copy of the configuration used to run that experiment, a .csv file containing logging of relevant metrics (e.g. train/test loss), and tensorboard events files. To view the tensorboard logs navigate to this folder and run:

tensorboard --logdir .

Alternatively run the command from elsewhere and modify the path accordingly. Plots of an experiment run can also be made by running the plot_from_df.py script from the experiments/plotting folder and passing the path to the folder containing the csv file to the -save_path flag.

Weights of the models being trained in a given experiment are also saved by default in experiment/saved_models/Y/X/ where Y is a hash of the configuration file and X is a timestamp for the experiment. Saved models can be loaded (e.g. to run local optimisation) by specifying the saved model path in the base config (Note they are saved weights and not full checkpoints so cannot be used to resume training).

Code Structure

Below is the structure of the relevant files in our repository.

│
├── requirements.txt
├── README.md
│     
├── data
│     
├── experiments
│    │
│    │
│    ├── additional_configs
│    │   │
│    │   ├── aux_flow_config.yaml
│    │   ├── esimator_config.yaml
│    │   ├── flow_config.yaml
│    │   ├── local_optimisation_config.yaml
│    │   └── planar_config.yaml
│    │
│    ├── experiment_list (bash scripts for paper experiments)
│    │   │
│    │   ├── expA
│    │   ├── expB
│    │   ├── expB
│    │   ├── expC
│    │   ├── expD
│    │   └── expE
│    │
│    ├── plotting
│    │   │
│    │   ├── plot_config.json
│    │   └── plot_from_df.py
│    │
│    ├── results
│    │   │
│    │   └── **result files (not tracked/commited)**
│    │
│    ├── saved_models
│    │   │
│    │   └── **saved_model files (not tracked/commited)**
│    │
│    ├── base_config.yaml
│    ├── context.py
│    └── main.py
│     
├── models
│    │
│    │
│    ├── approximate_posteriors
│    │   │
│    │   ├── __init__.py
│    │   │
│    │   ├── base_approximate_posterior.py
│    │   ├── base_norm_flow.py
│    │   ├── gaussian.py
│    │   ├── planar_flow.py
│    │   ├── rnvp_aux_flow.py
│    │   ├── rnvp_flow.py
│    │   └── sylv_flow.py
│    │
│    ├── likelihood_estimators
│    │   │
│    │   ├── __init__.py
│    │   │
│    │   ├── ais_estimator.yaml
│    │   ├── base_estimator.yaml
│    │   ├── iwae_estimator.yaml
│    │   └── max_estimator.yaml
│    │
│    ├── local_optimisation_modules
│    │   │
│    │   ├── __init__.py
│    │   │
│    │   ├── base_local_optimisation.py
│    │   ├── gaussian_local_optimisation.py
│    │   ├── rnvp_aux_flow_local_optimisation.py
│    │   └── rnvp_flow_local_optimisation.py
│    │
│    ├── loss_modules
│    │   │
│    │   ├── __init__.py
│    │   │
│    │   ├── base_loss.py
│    │   ├── gaussian_loss.py
│    │   ├── planar_loss.py
│    │   ├── rnvp_aux_loss.py
│    │   └── rnvp_loss.py
│    │
│    ├── networks
│    │   │
│    │   ├── __init__.py
│    │   │
│    │   ├── base_network.py
│    │   ├── convolutional.py
│    │   ├── deconvolutional.py
│    │   ├── fc_encoder.py
│    │   └── fc_decoder.py
│    │
│    ├── decoder.py
│    ├── encoder.py
│    ├── vae_runner.py
│    └── vae.py
│    
└── utils
     │
     │
     ├── __init__.py 
     │     
     ├── custom_torch_transforms.py
     ├── dataloaders.py
     ├── math_operations.py
     ├── parameters.py 
     ├── torch_operations_test.py
     └── torch_operations.py

seblee97 / inference_suboptimality Goto Github PK

inference_suboptimality's Introduction

Inference Suboptimality in Variational Autoencoders

Quick Links

Prerequisites

Datasets

Running Code

Accessing Experimental Results

Code Structure

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent