Papers

Data-Driven High-Dimensional Statistical Inference with Generative Models

O Amram, M Szewc - arXiv preprint arXiv:2506.06438, 2025 - arxiv.org
Physics paper hep-ph Suggest

… A class of analysis techniques, called simulation based inference (SBI) has been proposed to optimally perform inference in this case [4–7]. Nearly all SBI proposals in …

Link to paper

BibTeX

@article{2506.06438v2,
Author = {Oz Amram and Manuel Szewc},
Title = {Data-Driven High-Dimensional Statistical Inference with Generative Models},
Eprint = {2506.06438v2},
DOI = {10.1007/JHEP11(2025)129},
ArchivePrefix = {arXiv},
PrimaryClass = {hep-ph},
Abstract = {Crucial to many measurements at the LHC is the use of correlated multi-dimensional information to distinguish rare processes from large backgrounds, which is complicated by the poor modeling of many of the crucial backgrounds in Monte Carlo simulations. In this work, we introduce HI-SIGMA, a method to perform unbinned high-dimensional statistical inference with data-driven background distributions. In contradistinction to many applications of Simulation Based Inference in High Energy Physics, HI-SIGMA relies on generative ML models, rather than classifiers, to learn the signal and background distributions in the high-dimensional space. These ML models allow for interpretable inference while also incorporating model errors and other sources of systematic uncertainties. We showcase this methodology on a simplified version of a di-Higgs measurement in the $bbγγ$ final state, where the di-photon resonance allows for background interpolation from sidebands into the signal region. We demonstrate that HI-SIGMA provides improved sensitivity as compared to standard classifier-based methods, and that systematic uncertainties can be straightforwardly incorporated by extending methods which have been used for histogram based analyses.},
Year = {2025},
Month = {Jun},
Note = {J. High Energ. Phys. Nov. 2025, 129},
Url = {http://arxiv.org/abs/2506.06438v2},
File = {2506.06438v2.pdf}
}

Share