Simulation-based inference

Introduction

Simulators are the modern manifestation of scientific theories. They implement mechanistic models of the underlying natural phenomena of interest as well as models for the instruments used to observe those phenomena. The expressiveness of programming languages facilitates the development of complex, high-fidelity simulations and the power of modern computing provides the ability to generate synthetic data from them. The flexibility of simulators has made them critical research tools (and major cyberinfrastructure investments) for predicting how systems will behave across many areas of science and engineering. Unfortunately, despite their predictive power, these simulators are poorly suited for statistical inference, which is a core aspect of data-intensive science. To meet this challenge, there are an emerging set of techniques for simulation-based inference (SBI).

Simulation-based inference is the next step in the methodological evolution of statistical practice in the sciences. SBI provides qualitatively new capabilities that can transform scientific practice in fields as diverse as evolutionary biology, systems biology, neuroscience, gravitational wave astronomy, dark matter astrophysics, cosmology, and particle physics. Inference problems in these areas are challenging because they involve high-dimensional, richly-structured spaces. Empowering domain scientists with the ability to directly infer from data the properties of the underlying mechanistic models that they are developing would be transformative.

SBI has also proven to be an effective lingua franca that facilitates communication between domain scientists and methodological experts, supports convergence research, and accelerates cross-pollination of ideas between fields.

Selected Resources

Papers

The plan is to turn this page into a crowd-sourced community resource that can collect recent papers including methodological developments and applications. Here are some links to get started:

Reviews

Curated Awesome List

  • Awesome Neural SBI - A similar effort with a less automated, more human-curated list of SBI papers initiated by Siddharth Mishra-Sharma.

Applications

Furthermore, genome-scale metabolic models (GEMs) can be used to simulate disparate types of ‘omics data, even though the explicit calculation of likelihoods may be intractable. Thus, the use of “simulation-based” Bayesian inference approaches is a promising route for informing GEM structure and parameters from data [198]. However, scaling Bayesian approaches up to deal with the large space of possible GEM reconstructions is an open, exciting and challenging research direction.

Other curated resources list

Latest Posts

W Teng, LJ Durlofsky - Available at SSRN 4990538 - papers.ssrn.com

Data assimilation will be essential for the management and expansion of geological carbon storage operations. In traditional data assimilation approaches a fixed set of …

P Cordero-Encinar, T Schröder, P Yatsyshin… - arXiv preprint arXiv …, 2024 - arxiv.org

… Unlike existing simulation-based inference approaches, which must be tied to a specific set of point evaluations, we learn a functional representation of parameters and …

AS Mancini, K Lin, JD McEwen - arXiv preprint arXiv:2410.10616, 2024 - arxiv.org

… We achieve this by taking a simulation-based inference (SBI) approach using neural likelihood estimation, which we couple with the learned harmonic mean estimator in …

K Lin, B Joachimi, JD McEwen - arXiv preprint arXiv:2410.11883, 2024 - arxiv.org

… We demonstrate the first successful use of scattering representations without further compression for simulation-based inference (SBI) with images (ie field-level), …