Latent.jl

Latent.jl is a julia package that contains a variety of latent variable models. These models use either Expectation-Maximization or MCMC sampling or a mixture of both.

It currently contains an implementation of a Gaussian Mixture Model (GMM) used to cluster continuous data. It also contains an implementation of a time-homogenous and stationary Hidden Markov Model (HMM) for continuous univariate data.

Installation

You can install the package through Julia's package manager:

>julia using Pkg
>julia Pkg.add(PackageSpec(url="https://github.com/JasperHG90/Latent.jl"))

Examples

Gaussian Mixture Model (GMM) using EM-estimation

The GMM clusters continuous data by decomposing a mixture of clusters into separate multivariate Gaussian distributions.

We can simulate such data as follows:

using Latent;

#=  
Create a dataset
This dataset will consist of three multivariate normal Distributions
And is used for the purposes of clustering using a GMM
=#

using Plots, Random 
Random.seed!(5236);

K = 3
N = [100 90 35];
μ = [1.8 9.0 -.3; 11.0 10.2 4.0];
Σ = cat([5. .6; .6 3.2], [4.2 3; 3 3.6], [3 2.2 ; 2.2 3], dims=K);

# Simulate dataset 
X, lbls = Latent.GMM.simulate(K, N, μ, Σ);

# Plot
plot(X[:,1], X[:,2], group=lbls, seriestype = :scatter, title = "GMM with 3 clusters")

To cluster the data using a GMM, we call clust:

# Number of clusters we think are in our dataset
K = 3;
# Retrieve labels and optim history
params, lblsp, history = Latent.GMM.clust(X, K; maxiter = 200, epochs = 150);

The clust function runs the EM algorithm several times (in this case 150 times). We can plot the loss for each of these epochs as follows:

# Plot history
Latent.GMM.plot_history(history)

We can also plot the clusters:

# Plot clusters
plot(X[:,1], X[:,2], group=lblsp, seriestype = :scatter, title = "GMM with 3 clusters (estimated)")

GMM using MCMC sampling

We can also use MCMC sampling instead of the EM-algorithm to obtain the posterior distributions.

Using the same generated data as we did in the previous example, we first need to specify the following hyperpriors:

## Hyperpriors
using LinearAlgebra
using Distributions
N, M = size(X)
# Prior means 
κ0 = zeros((size(X)[2], K))
# Prior covariance matrix 
T0 = zeros((M, M, K))
for k ∈ 1:K
    T0[:,:,k] = Matrix(I, (M,M))
end;
# Hypothesized number of subjects in each group 
ν0 = ones(Int64, (K)) .+ 1
# Hypothesized prior sums-of-squares matrix 
Ψ0 = T0
# Hypothesized number of subjects 
α0 = [1, 1, 1]

We can now call the MCMC sampler:

# Sample the posterior distributions
history = Latent.BGMM.gibbs_sampler(X, K, α0, κ0, Τ0, ν0, Ψ0; iterations=5000);

We can plot the trace plots as follows:

# Specify burn-in samples
burnin = 2000
# Get means 

# Get MCMC history for means.
# (chains x iterations x variables x clusters)
μ_h = history[1];
μ_h1 = reshape(μ_h[1,:,:,:],(size(μ_h)[2], size(μ_h)[3] * size(μ_h)[4])) # Chain 1 
μ_h2 = reshape(μ_h[2,:,:,:],(size(μ_h)[2], size(μ_h)[3] * size(μ_h)[4])) # Chain 2
# Trace plots
plot(μ_h1, alpha=0.8, title="Trace plot (means)")
plot!(μ_h2, alpha=0.5)

After establishing that the clusters have converged, we can inspect the Maximum A Posteriori (MAP) estimates:

# Obtain MAP estimates
# (use only chain 1)

# Means (chains x iterations x variables x clusters)
μ_MAP = mapslices(mean, history[1][1,burnin:end,:,:], dims=[1]) |> x -> reshape(x, size(μ))
# Covariance matrix (Chains x iterations x variables x variables x clusters)
Σ_MAP = mapslices(mean, history[2][1,burnin:end,:,:,:], dims=[1]) |> x -> reshape(x, size(Σ))
# Mixing proportions (Chains x iterations x K)
ζ_MAP = mapslices(mean, history[3][1,burnin:end,:], dims=[1]) |> x -> reshape(x, size(x) |> reverse)

# Get cluster assignments
clstrs = Latent.BGMM.cluster_assignments(X, ζ_MAP, μ_MAP, Σ_MAP)

# Plot
plot(X[:,1], X[:,2], group=clstrs, seriestype = :scatter, title = "GMM with 3 clusters")

Compared to EM estimation, Bayesian inference may seem like a chore. However, it offers many benefits in that Bayesian methods can update their parameters by using Bayesian updating. Bayesian inference also automatically yields uncertainty estimates that are easier to interpret than their Frequentist counterparts.

Hidden Markov Model (HMM)

This library currently contains an implementation of an HMM for univariate Gaussian emission distributions.

We can simulate some data using the following function:

# Set seed
Random.seed!(425234);
# Number of hidden states
M = 3
# Sequence length
T = 800
# Transition probabilities
Γ = [0.7 0.12 0.18 ; 0.17 0.6 0.23 ; 0.32 0.38 0.3]
# component distribution means
μ = [-6.0 ; 0; 6]
# Component distribution variances
σ = [0.1 ; 2.0; 1.4]
# Simulate data
X, Z = Latent.HMM.simulate(M, T, Γ, μ, σ);

From the histogram below, we see that the data is multimodal.

histogram(X, bins=20)

We can fit the HMM as follows:

# Fit HMM
θ, stats, S = Latent.HMM.fit(X, M; epochs =3);
# View parameters
θ[1]
θ[2]
θ[3]

julia> θ[1]
3×3 Array{Float64,2}:
 0.672852  0.120804  0.200405
 0.233165  0.535989  0.23532
 0.332107  0.38422   0.289379

julia> θ[2]
3-element Array{Float64,1}:
 -6.000510239439516
  0.18597721602692183
  5.996999220639854

julia> θ[3]
3-element Array{Float64,1}:
 0.097743419737968
 2.0935618423151405
 1.473874383048123

# Plot state-switching
plot(S)

jasperhg90 / latent.jl Goto Github PK

latent.jl's Introduction

Latent.jl

Installation

Examples

Gaussian Mixture Model (GMM) using EM-estimation

GMM using MCMC sampling

Hidden Markov Model (HMM)

latent.jl's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent