Git Product home page Git Product logo

tarmac's Introduction

tarmac

Tools for (quickly) visualizing markov chain samples.

Installation

Examples:

Installation

To install, use pip:

pip install tarmac

Prerequisites:

  • numpy
  • matplotlib

Examples

Simple sampling of a gaussian distribution

Let's use emcee to sample from a gaussian distribution, and then use tarmac to see what the Markov chains look like. First, some standard imports:

import numpy as np
import matplotlib.pyplot as plt
import emcee
import tarmac

Let's define the log-probability gaussian distribution, which is needed by emcee. For simplicity, we'll just use a distribution with mean μ = 1 and standard deviation σ = 1:

def ln_gaussian(x):
    μ, σ = 0, 1
    return -np.sqrt(2*np.pi*σ**2) + (-(x-μ)**2/(2*σ**2))

Sample from this distribution using emcee:

nwalkers = 20
ndim = 1
samples = 3000
init_walker_pos = np.random.rand(nwalkers, ndim)

sampler = emcee.EnsembleSampler(nwalkers=nwalkers, dim=ndim, lnpostfn=ln_gaussian)
sampler.run_mcmc(init_walker_pos, N=samples)

Now sampler.chain contains the Markov chains which sample from the distribution; this object is an array of shape (nwalkers, nsamples, ndim). Let's plot the resulting distribution using tarmac:

fig = plt.figure()
tarmac.corner_plot(fig, chain, labels=['$x_1$'])

gaussian_corner

Seems great! It looks like emcee is sampling from the distribution as intended. What about the convergence of the walkers? That's easy to see with tarmac.walker_trace:

fig = plt.figure()
tarmac.walker_trace(fig, chain, labels=['$x_1$'])

tarmac.walker_trace plots the value of each walker at each step. Here, we used 20 walkers, so tarmac.walker_trace produces a plot with 20 lines:

gaussian_trace

The walkers have all converged to the gaussian mean, and are sampling from the distribution in the region of parameter space with most probability. Excellent!

Sampling the product of two gaussian distributions

Now for something slightly more complicated. Let's sample from a bi-gaussian. Define the log-probability and sample from it using emcee, just as before. To make things a bit trickier, let's use a product of two gaussians with means (μ1, μ2) = (1, -13) and standard deviations (σ1, σ2) = (1, 0.3):

def ln_doublegaussian(X):
    x1, x2 = X
    μ1, σ1 = 1, 1
    μ2, σ2 = -1, 0.3
    return -np.sqrt(2*np.pi*σ1**2) + (-(x1-μ1)**2/(2*σ1**2)) - np.sqrt(2*np.pi*σ2**2) + (-(x2-μ2)**2/(2*σ2**2))

nwalkers = 20
ndim = 2
samples = 3000
init_walker_pos = np.random.rand(nwalkers, ndim)

sampler = emcee.EnsembleSampler(nwalkers=nwalkers, dim=ndim, lnpostfn=ln_doublegaussian)
sampler.run_mcmc(init_walker_pos, N=samples)

Looking at the tarmac.corner_plot and tarmac.walker_trace plots, it's immediately clear that something is a bit off:

double_gaussian

In particular, the tarmac.walker_trace shows that the walkers didn't converge until ~200 step into the run. This is a common issue with MCMC sampling, and typically these steps are discarded. This behavior is a direct result of the fact that the walkers were initialized above using a uniform distribution across the interval [0, 1):

init_walker_pos = np.random.rand(nwalkers, ndim)

This behavior is nothing to worry about, and if we toss out the first 250 samples we see the walkers are all well behaved:

double_gaussian_burnin

The corner plot shows the two random variables x1 and x2 are uncorrelated, as expected. The walker trace shows that all walkers have converged. Nice!

tarmac's People

Contributors

peytondmurray avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.