lsstdesc / bayesian-pipelines-pixels Goto Github PK
View Code? Open in Web Editor NEWBayesian cosmological inference directly from pixels.
License: MIT License
Bayesian cosmological inference directly from pixels.
License: MIT License
Some notes:
could be useful in development and to test things quickly, but I think we will mostly use DC2 catalogs to sample
Useful links:
Goals:
Tasks:
Our Bayesian inference codes for analyzing galaxy images, such as BLISS and JIF, produce Monte Carlo samples from the approximate posterior distribution of galaxy image model parameters given the pixel data. To infer shear, we need to marginalize over the galaxy model parameters while inferring a shear model common to all the galaxies. This script implements importance sampling to perform this marginalization and shear inference. See the papers here and here.
Following our discussion last week and some reflection, I have attempted to identify the minium set of tasks to get our stage 0 shear inference pipeline up and running:
Data Format:
Train / Inference:
Validation:
No need to be complicated, just check code runs. This has the added benefit of showing how to run the code.
Once most of Stage 0 is implemented
With @aguinot we have converged on what a first version of the pipeline will look like. We have decided to split the effort in two stages. In the first stage we will use simulations from descwl-shear-sims
with constant shear per coadd and shape noise cancellation to evaluate the extent to which we incur in biases. For the second stage, the pipeline will be designed to take in simulations with a GLASS prior on clustering + shear + intrinsic ellipticities to output shear maps.
We will start by targeting the simplest possible simulations where we can measure biases. The point is to setup the pipeline (detection, split into groups, joint measurement with HMC, shear posterior with MagicBeans) and have sufficient statisticts to distinguish a shear bias (with the mininum number of coadds possible).
LensMC Euclid paper contains useful guidance.
Note: Recall no blending in stage 0. DM cutouts will always have brightest image in the center.
Use galcheat parameters?
List:
Notes:
For example, passing around images rather than galsim objects. Create a container for image information like pixel_scale
that can be passed around.
@EiffL brought this up in today's meeting. It might be good to think about this earlier rather than later so that we don't have to do a lot of work later in the inference stage to make our code compatible with tensorflow, pytorch, jax, etc.
Comparable to the Sersic profile and doesn't have the same issues with varying the sersic index continuously in Galsim
As @aguinot mentioned, starting with a Bulge+Disk model might be overkill. We can already test shear recovery in the simple Gaussian setting, and this model has a well-defined simple intrinsic ellipticity (no need for second-moment calculations, etc. ).
One note is that Bulge galaxies do not have a well defined size either (why?)
Later we can revisit the Bulge+Disk model (we need it for DC2)
can sample n
parameter randomly instead of Bulge+Disk
Add high-level functions that enable us to use descwl-shear-sims
images for evaluation.
We should decide on a set of guidelines on how to contribute in put them in a CONTRIBUTING.md
I should also mention that I made some choices on formatters, linting and added ci tests in #30. These are all up for discussion of course.
Some thoughts:
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.