pints-team / pints Goto Github PK

View Code? Open in Web Editor NEW

214.0 12.0 32.0 182.73 MB

Probabilistic Inference on Noisy Time Series

Home Page: http://pints.readthedocs.io

License: Other

Python 99.87% TeX 0.13%

bayesian-methods inverse-problems parameter-estimation numerical-optimization

pints's Issues

adaptive covariance MCMC functionality

need a function/class that takes a model, a set of parameters, prior information and a data set and returns distributions on the parameters using adaptive covariance MCMC

Investigate Gradient Profiling

Following Gary's suggestion on slack

"Gradient Profiling (Giles Hooker) CollocInfer in R - a mixture of Nonlinear least squares and Gradient matching I think… need to read up"

Update tests for electrochemistry so they use generated data

Add an example using summary statistics (e.g. IV curve)

@jonc says: do this!

Maybe look into ABC?

Implement "stupid-mcmc" (also known as "prefetching")

See:

Brockwell (2006) Parallel Markov chain Monte Carlo simulation by pre-fetching
Strid (2009) Efficient parallelisation of Metropolis-Hastings algorithms using a pre-fetching approach

Rewrite hierarchical gibbs sampling code to use new framework

Add a method to the echem problem that generates some temporary toy data

Due to the large filesize, the data should either not be stored at all, or store it in a directory with a suitable .gitignore file.

This can then be used to test some optimisation methods on echem data

Reproduce Ross's pyhillfit results

Get distributions, mcmc, etc. from PyHillFit

https://github.com/mirams/PyHillFit/blob/master/python/PyHillFit.py

Create cardiac forward model (IKr)

Share interface with #2

Build 1 and then add the rest later

Add simple method that plots 1d log-likelihoods near points in parameter space

Non-linear optimizer functionality

Need a function/class that takes a model, some data, a set of parameters and some bounds and gives the best-fit parameters for that data

Add at least one optimiser or mcmc method that uses R

@mirams says: R has lots of libraries. We should set up a first method that uses on of those, test how well it works

Update CMAES method to new interface

Assess the impact of filtering the current data on estimates

Test different filtering methods from basic thinning to more sophisticated filters.

test script: STAN versus adaptive covariance MCMC

need a test script that compares STAN versus our own adaptive covariance MCMC, using

Electrochem FeIII dataset
Kylie's dataset (or other, as I don't have this yet...)

Add brute-force samplers (for uniform priors)

For example, explore each parameter individually (param x, score y) or plot any two parameters against each other (param 1 x, param 2 z, score y)

Use evaluator interface to parallelise

Uniform
Latin hypercube
Sobol

Add linear interpolation of input to STAN script, see if stuff should/can be optimised

Add score function including prior

Martin wrote:

"[C]an you create a score function that depends on the parameters (i.e. incorporate a prior into the score function)?"

Work out how to use Seaborn/CMA-ES without having it affect matplotlib too much

Libraries shouldn't affect other libraries, it would be pretty rubbish if people imported pints and then found matplotlib's behavior had changed because of the cma and seaborn modules.

Add methods to search in transformed parameter space

Ideally, this would automatically update priors too, somehow...?

Investigate different methods of implementing boundaries

Two options:

Implement a periodic transform on the parameter space (strategy used by CMA-ES, for example)
Implement a transform on the score function

allow forward models to output the model in STAN language

Need to implement each model in the STAN language. Perhaps have a member function for each model that returns the STAN language representation as a string?

Gradient Matching (fitting the slopes rather than absolute values)

Following Gary's suggestion on slack

Add gamma, beta, and exponential LogPrior

We need

gamma
beta
exponential
student-t

And tests for each

cvsin_type_1 reader

create a cvsin_type_1 reader for electrochemistry data

Add toy data for cardiac model

Generate data and store in CSV, add different levels of gaussian noise

create EC electrochemistry forward model

CellML Model class

It would be nice to have a class that implements a Model concept, which takes a cellml file (or string?) defining what the model is.

@MichaelClerx: you have some cellml conversion routines don't you. Would this be useful here?

Add tools for repeated optimisiations

~~The current cmaes method has some unused ipop code:~~

Once someone figures out a good way to get random samples in the parameter space we can either add an ipop setting to the CMAES class or rename the class IPOP_CMAES and create a wrapper called CMAES that disables it

Methods like IPOP_CMAES use multiple restarts from random positions in the search space to improve chances of finding optima and reducing chances of getting stuck.

We could add some code that does this automatically, maybe using the Boundaries class to generate new starting points or perhaps a Prior class.

Investigate POMP - Partially Observed Markov Processes

Following @mirams suggestion

Add hierarchical mcmc method from Ross's paper

Add new project to reproduce Ross's pyhillfit examples

Not the actual results, just the examples of MCMC

Work out interface to get 1st order sensitivities into Pints

Gary wrote: "Another 'whilst I remember' type thing! It would be good to get boring-old-Fisher Information / Hessian at the MLE and the covariance matrix that that implies, so we could compare max likelihood with Bayesian for some of these problems. Some of our peaks are so unimodal I suspect it may be an excellent approximation for a lot of our problems, and a zillion times faster."

Implement system to deal with singularities / numerical errors etc. in score functions

CMA-ES uses a NaN returned from the score function to trigger a resampling
PSO uses Inf, because it wants a ranking of scores, and inf is the worst possible value (note that x < NaN --> false, x > NaN --> false, etc)
We could use exceptions instead (but I quite like returning NaN, as it might happen undetected anyway!)

Set up travis-ci.org

@martinjrobins Jonathan Cooper suggested we set up this repo to have automated testing with Travis (travis-ci.org).
I had a look but it tells me I don't have the authority. Would you like to give this a go?

Replace priors by log-priors

Sanmitra says it's better :-)

Are all the algorithms happy with this?
Should this only happen under the hood? I imagine users would prefer to specify a prior rather than a log prior... We could even think about giving the Prior class a log() method?

Add optional R-hat convergence criterion to adaptive_mcmc code

Burn-in should always happen first
But after that, stop either when max iterations is reached or when converged (if it works well, maybe set default max-iter to something very big?)

Add gaussian noise log-likelihood with fixed sigma

Maybe update the names of the fixed/inferred ones to make the distinction clearer

Add FFT-based score function

Martin wrote:

[C]an you create an efficient score function that depends on the distance between experiment and model in the frequency domain, rather than time domain? I guess the score class can just take an FFT of the values when its created and re-use this?

[ikr] Look into global search strategies (repeated restarts, hybrids?)

One simple option is repeated restarts: #29

Another option might be to use PSO to find N good starting points (by running a search with N+M, M>=0 particles and returning the best N results) and then starting a search from each of these

Add SNES optimizer

Implement Kylie's models

Skipping Kiehn 1999 because it doesn't have equations for the rate constants but a look-up table instead

Add Model wrapper that uses Gaussian processes to evaluate

Possibly based on GPy or GPflow

STAN interface

Need a function/class that takes a model and a data set, passes this into STAN to be solved (using HMC) and returns distributions on the parameters

parallel CMA-ES

I think @MichaelClerx changes to CMA-ES to get it working in the new infrastructure removed the parallel aspects of CMA-ES? This is still in there as a comment, so should just need to integrate it with the new code

Look into model selection using 'reversible jump mcmc'

Chris Gill wrote:

I had an idea last summer about how one might go about doing model selection and parameter fitting in one go using mcmc but didn’t have time to get the details working enough to share it with anyone. It turns out someone has already developed the idea in quite a general framework, and it is useful - it’s called reversible jump mcmc. Essentially you can jump between different parameter spaces provided you have a suitable map between them. The wikipedia page has a fairly short intro to it. I wondered if that might be an interesting direction to try out with the electrochemistry stuff, e.g. determining mechanism of action and the parameters in one (admittedly computationally expensive) go? Just a thought, and I’m sure it will depend on how the different reaction models are specified, but I’ve been meaning to email you about it for some time now.

pints-team / pints Goto Github PK

pints's Issues

Recommend Projects

Recommend Topics

Recommend Org