Data Analysis, Statistics and Probability

2025-02-14 | | Total: 3

#1 Communicating Likelihoods with Normalising Flows [PDF] [Copy] [Kimi] [REL]

Authors: Jack Y. Araz, Anja Beck, Méril Reboud, Michael Spannowsky, Danny van Dyk

We present a machine-learning-based workflow to model an unbinned likelihood from its samples. A key advancement over existing approaches is the validation of the learned likelihood using rigorous statistical tests of the joint distribution, such as the Kolmogorov-Smirnov test of the joint distribution. Our method enables the reliable communication of experimental and phenomenological likelihoods for subsequent analyses. We demonstrate its effectiveness through three case studies in high-energy physics. To support broader adoption, we provide an open-source reference implementation, nabu.

Subjects: High Energy Physics - Phenomenology , Machine Learning , High Energy Physics - Experiment , Data Analysis, Statistics and Probability

Publish: 2025-02-13 17:00:11 UTC


#2 Machine learning for modelling unstructured grid data in computational physics: a review [PDF] [Copy] [Kimi] [REL]

Authors: Sibo Cheng, Marc Bocquet, Weiping Ding, Tobias Sebastian Finn, Rui Fu, Jinlong Fu, Yike Guo, Eleda Johnson, Siyi Li, Che Liu, Eric Newton Moro, Jie Pan, Matthew Piggott, Cesar Quilodran, Prakhar Sharma, Kun Wang, Dunhui Xiao, Xiao Xue, Yong Zeng, Mingrui Zhang, Hao Zhou, Kewei Zhu, Rossella Arcucci

Unstructured grid data are essential for modelling complex geometries and dynamics in computational physics. Yet, their inherent irregularity presents significant challenges for conventional machine learning (ML) techniques. This paper provides a comprehensive review of advanced ML methodologies designed to handle unstructured grid data in high-dimensional dynamical systems. Key approaches discussed include graph neural networks, transformer models with spatial attention mechanisms, interpolation-integrated ML methods, and meshless techniques such as physics-informed neural networks. These methodologies have proven effective across diverse fields, including fluid dynamics and environmental simulations. This review is intended as a guidebook for computational scientists seeking to apply ML approaches to unstructured grid data in their domains, as well as for ML researchers looking to address challenges in computational physics. It places special focus on how ML methods can overcome the inherent limitations of traditional numerical techniques and, conversely, how insights from computational physics can inform ML development. To support benchmarking, this review also provides a summary of open-access datasets of unstructured grid data in computational physics. Finally, emerging directions such as generative models with unstructured data, reinforcement learning for mesh generation, and hybrid physics-data-driven paradigms are discussed to inspire future advancements in this evolving field.

Subjects: Machine Learning , Computational Engineering, Finance, and Science , Data Analysis, Statistics and Probability , Fluid Dynamics

Publish: 2025-02-13 14:11:33 UTC


#3 Glacier data assimilation on an Arctic glacier: Learning from large ensemble twin experiments [PDF] [Copy] [Kimi] [REL]

Authors: Wenxue Cao, Kristoffer Aalstad, Louise S. Schmidt, Sebastian Westermann, Thomas V. Schuler

Glacier modeling is crucial for quantifying the evolution of cryospheric processes. At the same time, uncertainties hamper process understanding and predictive accuracy. Here, we suggest improving glacier mass balance simulations for the Kongsvegen glacier in Svalbard through the application of Bayesian data assimilation techniques in a set of large ensemble twin experiments. Noisy synthetic observations of albedo and snow depth, generated using the multilayer CryoGrid community model with a full energy balance, are assimilated using two ensemble-based data assimilation schemes: the particle batch smoother and the ensemble smoother. A comprehensive evaluation exercise demonstrates that the joint assimilation of albedo and snow depth improves the simulation skill by up to 86% relative to the prior in specific glacier regions. The particle batch smoother excels in representing albedo dynamics, while the ensemble smoother is particularly effective for snow depth under low snowfall conditions. By combining the strengths of both observations, the joint assimilation achieves improved mass balance simulations across different glacier zones using either assimilation scheme. This work underscores the potential of ensemble-based data assimilation methods for refining glacier models by offering a robust framework to enhance predictive accuracy and reduce uncertainties in cryospheric simulations. Further advances in glacier data assimilation will be critical to better understanding the fate and role of Arctic glaciers in a changing climate.

Subjects: Geophysics , Data Analysis, Statistics and Probability

Publish: 2025-02-13 13:28:03 UTC