Simulation-based inference

Introduction

Simulators are the modern manifestation of scientific theories. They implement mechanistic models of the underlying natural phenomena of interest as well as models for the instruments used to observe those phenomena. The expressiveness of programming languages facilitates the development of complex, high-fidelity simulations and the power of modern computing provides the ability to generate synthetic data from them. The flexibility of simulators has made them critical research tools (and major cyberinfrastructure investments) for predicting how systems will behave across many areas of science and engineering. Unfortunately, despite their predictive power, these simulators are poorly suited for statistical inference, which is a core aspect of data-intensive science. To meet this challenge, there are an emerging set of techniques for simulation-based inference (SBI).

Simulation-based inference is the next step in the methodological evolution of statistical practice in the sciences. SBI provides qualitatively new capabilities that can transform scientific practice in fields as diverse as evolutionary biology, systems biology, neuroscience, gravitational wave astronomy, dark matter astrophysics, cosmology, and particle physics. Inference problems in these areas are challenging because they involve high-dimensional, richly-structured spaces. Empowering domain scientists with the ability to directly infer from data the properties of the underlying mechanistic models that they are developing would be transformative.

SBI has also proven to be an effective lingua franca that facilitates communication between domain scientists and methodological experts, supports convergence research, and accelerates cross-pollination of ideas between fields.

Selected Resources

Papers

The plan is to turn this page into a crowd-sourced community resource that can collect recent papers including methodological developments and applications. Here are some links to get started:

Reviews

Curated Awesome List

  • Awesome Neural SBI - A similar effort with a less automated, more human-curated list of SBI papers initiated by Siddharth Mishra-Sharma.

Applications

Furthermore, genome-scale metabolic models (GEMs) can be used to simulate disparate types of ‘omics data, even though the explicit calculation of likelihoods may be intractable. Thus, the use of “simulation-based” Bayesian inference approaches is a promising route for informing GEM structure and parameters from data [198]. However, scaling Bayesian approaches up to deal with the large space of possible GEM reconstructions is an open, exciting and challenging research direction.

Other curated resources list

Latest Posts

C Pacilio, S Bhagwat, R Cotesta - arXiv preprint arXiv:2404.11373, 2024 - arxiv.org

… In this work, we present a proof-of-concept strategy to perform likelihood-free Bayesian inference on ringdown gravitational waves using simulation based inference. …

E Maceda, EC Hector, A Lenzi, BJ Reich - arXiv preprint arXiv:2404.10899, 2024 - arxiv.org

Classic Bayesian methods with complex models are frequently infeasible due to an intractable likelihood. Simulation-based inference methods, such as Approximate …

M Gloeckler, M Deistler, C Weilbach, F Wood… - arXiv preprint arXiv …, 2024 - arxiv.org

… Conclusion We developed the Simformer, a new method for amortized simulation-based inference. On benchmark tasks, it performs at least as well as existing methods …

K Karchev, M Grayling, BM Boyd… - Monthly Notices of …, 2024 - academic.oup.com

… By utilising truncated marginal neural ratio estimation (TMNRE), a neural network-enabled simulation-based inference technique, we implicitly marginalise over 4000 …