Field-Level Comparison and Robustness Analysis of Cosmological N-body Simulations

BibTeX
@article{2505.13620v2,
Author        = {Adrian E. Bayer and Francisco Villaescusa-Navarro and Sammy Sharief and Romain Teyssier and Lehman H. Garrison and Laurence Perreault-Levasseur and Greg L. Bryan and Marco Gatti and Eli Visbal},
Title         = {Field-Level Comparison and Robustness Analysis of Cosmological N-body Simulations},
Eprint        = {2505.13620v2},
DOI           = {10.3847/1538-4357/adef4e},
ArchivePrefix = {arXiv},
PrimaryClass  = {astro-ph.CO},
Abstract      = {We present the first field-level comparison of cosmological N-body simulations, considering various widely used codes: Abacus, CUBEP$^3$M, Enzo, Gadget, Gizmo, PKDGrav, and Ramses. Unlike previous comparisons focused on summary statistics, we conduct a comprehensive field-level analysis: evaluating statistical similarity, quantifying implications for cosmological parameter inference, and identifying the regimes in which simulations are consistent. We begin with a traditional comparison using the power spectrum, cross-correlation coefficient, and visual inspection of the matter field. We follow this with a statistical out-of-distribution (OOD) analysis to quantify distributional differences between simulations, revealing insights not captured by the traditional metrics. We then perform field-level simulation-based inference (SBI) using convolutional neural networks (CNNs), training on one simulation and testing on others, including a full hydrodynamic simulation for comparison. We identify several causes of OOD behavior and biased inference, finding that resolution effects, such as those arising from adaptive mesh refinement (AMR), have a significant impact. Models trained on non-AMR simulations fail catastrophically when evaluated on AMR simulations, introducing larger biases than those from hydrodynamic effects. Differences in resolution, even when using the same N-body code, likewise lead to biased inference. We attribute these failures to a CNN's sensitivity to small-scale fluctuations, particularly in voids and filaments, and demonstrate that appropriate smoothing brings the simulations into statistical agreement. Our findings motivate the need for careful data filtering and the use of field-level OOD metrics, such as PQMass, to ensure robust inference.},
Year          = {2025},
Month         = {May},
Note          = {ApJ 989 207 (2025)},
Url           = {http://arxiv.org/abs/2505.13620v2},
File          = {2505.13620v2.pdf}
}
Papers

Field-Level Comparison and Robustness Analysis of Cosmological N-body Simulations

AE Bayer, F Villaescusa-Navarro, S Sharief… - arXiv preprint arXiv …, 2025 - arxiv.org