Simulation Based Inference

Simulation Based Inference (SBI), or Likelihood-free inference, is a state of the art approach to Bayesian inference that leverages the power of modern numerical simulations alongside modern neural density estimation methods. For a review, see Cranmer, Brehmer & Louppe 2020.

I have recently been working on SBI approaches to a range of problems in physics. I was involved in the LtU-ILI suite (Ho et al., 2024), a framework for performing SBI in cosmology and astrophysics. I applied this framework to forward modelled photometry from the CAMELS simulations (Lovell et al., 2024), to perform parameter inference. I’ve also been involved with a number of other projects utilising SBI approaches, for parameter inference and model comparison (missing reference).

Figure showing the typical components and structure of an SBI methodology (courtesy of TransferLab)

In 2024 we organised the first dedicated meeting on Simulation Based Inference for Galaxy Evolution. The next installment is scheduled for 27th - 30th May 2025 - come and join us in Bristol!

References

2024

OJA

LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and Cosmology

Matthew Ho, Deaglan J. Bartlett, Nicolas Chartier, and 12 more authors

OJA, Jul 2024

ADS Bibcode: 2024OJAp....7E..54H

Abs DOI

This paper presents the Learning the Universe Implicit Likelihood Inference (LtU-ILI) pipeline, a codebase for rapid, user-friendly, and cutting-edge machine learning (ML) inference in astrophysics and cosmology. The pipeline includes software for implementing various neural architectures, training schema, priors, and density estimators in a manner easily adaptable to any research workflow. It includes comprehensive validation metrics to assess posterior estimate coverage, enhancing the reliability of inferred results. Additionally, the pipeline is easily parallelizable, designed for efficient exploration of modeling hyperparameters. To demonstrate its capabilities, we present real applications across a range of astrophysics and cosmology problems, such as: estimating galaxy cluster masses from X-ray photometry; inferring cosmology from matter power spectra and halo point clouds; characterising progenitors in gravitational wave signals; capturing physical dust parameters from galaxy colors and luminosities; and establishing properties of semi-analytic models of galaxy formation. We also include exhaustive benchmarking and comparisons of all implemented methods as well as discussions about the challenges and pitfalls of ML inference in astronomical sciences. All code and examples are made publicly available at https://github.com/maho3/ltu-ili.
arXiv

Learning the Universe: Cosmological and Astrophysical Parameter Inference with Galaxy Luminosity Functions and Colours

Christopher C. Lovell, Tjitske Starkenburg, Matthew Ho, and 9 more authors

arXiv, Nov 2024

arXiv:2411.13960

Abs DOI

We perform the first direct cosmological and astrophysical parameter inference from the combination of galaxy luminosity functions and colours using a simulation based inference approach. Using the Synthesizer code we simulate the dust attenuated ultraviolet–near infrared stellar emission from galaxies in thousands of cosmological hydrodynamic simulations from the CAMELS suite, including the Swift-EAGLE, Illustris-TNG, Simba & Astrid galaxy formation models. For each galaxy we calculate the rest-frame luminosity in a number of photometric bands, including the SDSS {}textit{ugriz} and GALEX FUV & NUV filters; this dataset represents the largest catalogue of synthetic photometry based on hydrodynamic galaxy formation simulations produced to date, totalling \textgreater200 million sources. From these we compile luminosity functions and colour distributions, and find clear dependencies on both cosmology and feedback. We then perform simulation based (likelihood-free) inference using these distributions, and obtain constraints on both cosmological and astrophysical parameters. Both colour distributions and luminosity functions provide complementary information on certain parameters when performing inference. Most interestingly we achieve constraints on {}sigma_8 describing the clustering of matter. This is attributable to the fact that the photometry encodes the star formation–metal enrichment history of each galaxy; galaxies in a universe with a higher {}sigma_8 tend to form earlier and have higher metallicities, which leads to redder colours. We find that a model trained on one galaxy formation simulation generalises poorly when applied to another, and attribute this to differences in the subgrid prescriptions, and lack of flexibility in our emission modelling. The photometric catalogues are publicly available at: https://camels.readthedocs.io/ .