2026-05-29 | | Total: 9
We present a comprehensive benchmarking dataset and empirical scaling law analysis for neural network wavefunctions by matching them to a wide spectrum of famous many body target wavefunctions. The dataset, WF-Bench, spans multiple distinct regimes of strongly correlated quantum matter, including topological states, Wigner crystals, and superconducting wavefunctions, providing a diverse and challenging test bed for neural network wavefunction expressivity. We introduce a systematic and reproducible benchmarking protocol for target wavefunction matching, enabling consistent performance evaluation across different neural network wavefunction architectures. By using wavefunction fidelity as the uniform metric, we discover empirical scaling laws that characterize how representability depends on system size and key model parameters, including number of determinant and model depth. By applying our benchmark protocol on Psiformer and Ferminet, we show that WF-Bench establishes a unified dataset driven framework for evaluating and comparing neural network wavefunctions and for guiding the design of future architectures.
The kinetic isotope effect (KIE) is the conventional probe for quantum tunneling, yet its composite nature conflates tunneling with zero-point energy and classical kinetics. Here, we introduce the tunneling phase diagram, a machine-learning framework that decouples true tunneling strength by decoding the nonlinear relationship between KIE and the tunneling factor (\k{appa}). With exceptional fidelity (R^2 > 0.98, RMSE = 0.21), this framework reveals an anomalous high KIE-low \k{appa} spanning 300-600 K, thereby defining a paradigm for the quantitative assessment of quantum tunneling.
Tailoring microscopic details to tune bulk rheology is a key paradigm in soft matter physics, yet the vast parameter space associated with constituent interactions precludes a fully systematic approach. To address this, we have designed a synergistic strategy to explore the parameter space that comprises simulations, experimental rheology, and machine learning. As a case study, we choose DNA-based self-assembled fluids whose viscoelastic response can be fine-tuned by manipulating the base sequencing of the constituent nucleic acid nanostars. We use coarse-grained simulations, benchmarked against experimental data, to obtain the rheology of the DNA fluids, which feeds forward to a framework of Gaussian Process Regression and active learning. The latter is then used to explore the rheological design space with high predictive precision. The pipeline is designed to be deployed iteratively for the rational design and accelerated discovery of generic soft matter suspensions.
We study the impact of the coupling topology on the ability of various networked dynamical systems to generate extreme events. By determining the coupling strength that is necessary to generate an extreme event in the collective dynamics of a given system, we observe a power-law-like relationship between this coupling threshold and both topological (edge density) and spectral (algebraic connectivity) properties of various coupling topologies. Interestingly, this relationship appears to be largely independent of both the investigated system and the underlying mechanism to generate extreme events. This may indicate that the observed relationship is primarily mediated by aspects of the coupling topology.
Multigraphs are graphs in which multiple links between pairs of nodes are allowed, whereas they are forbidden in simple graphs, the latter being widely used in network science. Simple graphs generated by the configuration model have served as a benchmark for validating theoretical approaches to dynamical processes on networks. However, generating large scale-free networks with degree exponent $γ<3$ introduces uncontrolled disassortative correlations and severe computational limitations due to the prohibition of reconnecting hubs. These constraints do not exist in multigraphs. We investigate how multiple connections affect epidemic spreading by comparing several epidemic models exhibiting an active steady state on simple graphs and multigraphs sharing the same degree sequence and natural upper cutoff. By analyzing epidemic thresholds, finite-size scaling, and localization, we show that differences between simple graphs and multigraphs emerge only when epidemic activity can persist on isolated hubs (star subgraphs) for times exponentially long in the hub degree. Our results remove a methodological barrier to the study of dynamical processes on large scale-free networks.
This manuscript explores a variational quantum formulation for nonlinear elasticity problems arising from hyperelastic material models, targeting near term noisy intermediate scale quantum (NISQ) devices. The approach leverages the potential energy structure of hyperelasticity and employs a hybrid quantum classical framework in which the energy functional is evaluated using parameterized quantum circuits and optimized through classical routines. To enable implementation on current quantum hardware, polynomial approximations of the nonlinear strain energy density are introduced, yielding a representation compatible with variational quantum algorithms. The methodology is demonstrated on a one dimensional NeoHookean material model using finite element discretizations with first and second order shape functions and nonhomogeneous boundary conditions. Numerical experiments investigate the influence of the polynomial approximation order on the accuracy and efficiency of the proposed approach, illustrating its feasibility for near term quantum devices.
Neural networks trained under different hyperparameter settings can fall into distinct training "regimes," with consistent behavior within regimes and qualitative differences across regimes. In this paper, we study such multi-regime behavior in scientific machine learning (SciML) models through a regime-aware diagnostic framework that jointly analyzes performance, training dynamics, and loss-landscape geometry. We identify three key findings: (i) a consistent three-regime structure emerges across many standard SciML models, different constraint enforcements, and various optimizer designs; (ii) optimization effectiveness is regime-specific, with no single method performing well across all regimes; and (iii) SciML models can exhibit fine-grained failure modes that can challenge conventional interpretations of standard loss-landscape metrics. Our results provide an approach to establish a unified, task-oblivious perspective on failure modes in SciML and to inform regime-aware guidance for improving robustness. We validate these findings across widely-used SciML models, including physics-informed neural networks, neural operators, and neural ordinary differential equations, on benchmarks spanning representative ordinary and partial differential equations.
In this paper, training a neural network is identified, exactly, as a search through Hamilton--Jacobi initial-value problems: each gradient step selects the initial data of a viscous Hamilton--Jacobi equation whose Hopf--Cole propagator best fits the observations; at inference, the input is the spatial point at which that solution is evaluated and the initial condition is already encoded in the weights. The correspondence is exact for log-sum-exp layers and structural for broader architectures: residual networks, transformers, and recurrent architectures (RNNs, LSTMs, SSMs) each discretize the same class of Hamilton--Jacobi equations, with architecture-dependent Hamiltonian and viscosity. A single deformation parameter $\varepsilon$ unifies all four perspectives (network, tropical algebra, viscous PDE, convex optimization) in a commutative diagram closed under Lipschitz conditions. Quantitative consequences include: the minimax optimal generalization rate $O(n^{-1/(d+2)})$ for fixed $t$; adversarial robustness controlled by $\varepsilon$; backpropagation as the co-state equation of the Hamiltonian system for residual networks (Pontryagin Maximum Principle); scaling exponents consistent with data intrinsic dimension via PDE quadrature; and a closed-form $O(N)$ influence function (softmax attribution weights $π_j$) whose entropy landscape undergoes fold bifurcations as $\varepsilon$ increases, each merging attribution basins.
Deploying Scientific Machine Learning surrogates in industrial CFD workflows requires adapting pretrained models to new vehicle families without large datasets; yet whether geometric representations learned by a geometry encoder transfer to topologically distinct shapes remains unvalidated. We address this through leave-one-family-out experiments on a 61.47M-parameter Transformer surrogate (AB-UPT) pretrained on four vehicle families (411 external aerodynamics cases) and adapted to the held-out fifth with only 20 samples. Three strategies are compared: Full Fine-Tuning (FFT), Lightweight Fine-Tuning (LFT), and Low-Rank Adaptation (LoRA). The central finding is that pretrained geometry encoders learn transferable representations, but the adaptation mechanism determines whether they can be exploited. FFT destabilizes as 61.47M unconstrained parameters overfit to 20 samples (R^2=0.40); LFT fails because the frozen encoder cannot represent unseen shapes (R^2<0). LoRA resolves both: rank-constrained adapters injected into all layers regularize the loss landscape while preserving pretrained features, achieving R^2=0.85+/-0.02 across all five families with 50% lower force RMSE than FFT and 28% lower pointwise field errors. LoRA also outperforms from-scratch training using 3x more target-family data, eliminating the need for large per-family datasets. These results recast LoRA from a memory-saving convenience into a convergence enabler for geometry transfer: a shared backbone paired with lightweight per-family adapters trainable in hours from minimal data.