2026-05-29 | | Total: 273
We establish two structural majorization relations, which we call precursors, underlying the properties of supermodularity and subadditivity on the lattice induced by majorization. These are precursors in that they immediately imply that all sums of concave functions, which we dub sum-concave functions, are supermodular and subadditive on the majorization lattice. Using these majorization relations, we then show the supermodularity and subadditivity (in the lattice-theoretic sense) of Tsallis entropies (for all $α$) and Rényi entropies (for all $α> 1$), also recovering these properties for the Shannon entropy in the process. We further strengthen these inequalities, showing that: (i) all these entropic functionals are strictly subadditive on the majorization lattice; (ii) Tsallis entropies (and therefore the Shannon entropy as well) are strictly supermodular on the majorization lattice.
In this paper, we give a short Bayesian proof of Talagrand's celebrated majorizing-measure theorem (MMT). While the upper-bound direction of MMT follows relatively directly from standard arguments, the lower-bound direction is widely regarded as the more difficult part and has received several distinct proofs. Unlike previous approaches, our proof does not rely on existing Gaussian processes lower bounds techniques, nor on combinatorial, geometric, or coding-theoretic constructions. Instead, we derive the lower bound from two area identities for Gaussian additive models. We show that the Gaussian width of a finite set is the integrated mean-squared error of the maximum-likelihood estimator (MLE), while the integrated minimum mean-squared error (MMSE) is larger than the Fernique-Talagrand functional, up to a universal constant. Simply then comparing the MLE with Bayes-optimal estimation gives a direct proof of the hard direction of MMT.
This article shortly provides related proofs of the ergodic theorems of von Neumann, Birkhoff, Wiener, and Rokhlin's lemma for $Z^d$-actions with an invariant measure. It is shown how some deviations of ergodic averages can be structured. The deviations tend to zero almost everywhere. They are asymptotically almost invariant with respect to the action due to averaging. In this situation, the question of the distribution of the values of such deviations is meaningful. It turns out that for any free ergodic $Z^d$-action these distributions can be weakly close to any given distribution if we change the scale on the value line.
We study invariant statistical connections on the space $\mathcal{N}_0^n$ of zero-mean multivariate normal distributions (the multivariate centered Gaussian model) equipped with the Fisher metric $g^F$. We introduce moduli spaces of invariant statistical connections on homogeneous Riemannian manifolds via two natural equivalence relations arising from a categorical viewpoint, and apply this framework to $(\mathcal{N}_0^n, g^F)$. We explicitly determine the $GL(n,\mathbb{R})$-invariant and $\mathrm{Isom}(\mathcal{N}_0^n, g^F)$-invariant statistical connections, with particular emphasis on the dually flat case, and describe the corresponding moduli spaces.
We study Bernoulli percolation on $\mathbb Z^d$ in dimensions ${d>6}$. We prove that a classical consequence of the van den Berg-Kesten inequality, often referred to as the Simon-Lieb inequality in the context of the Ising model, admits a partial reversal. As a main application, we show that the quantity $\varphi_{p_c}(S)$, introduced by Duminil-Copin and Tassion (Comm.\ Math.\ Phys., 2016), is uniformly bounded over all $S\subset \mathbb Z^d$. This partial reversal further yields a short and self-contained route to several key results, including near-critical estimates on the two-point function and sharp bounds on the critical one-arm probability.
We study the moduli stacks of real vector bundles of fixed rank and degree on a type I real algebraic curve and determine its mod 2 cohomology algebra in terms of characteristic classes.
In this paper, we are dealing with constrained vector optimisation problems where the objective function acts between real linear-topological spaces. Our aim is to study the relationships between the sets of properly efficient solutions to constrained and unconstrained vector optimisation problems under certain cone convexity assumptions on the objective function using a vectorial penalisation approach.
Given a finite abelian group $G$ and a Sylow $p$-subgroup $N_p$, we prove that the $KU_G/p$-local sphere spectrum is equivalent to the homotopy fixed points of a $p$-complete $KO_{N_p}$-module spectrum. Then we compute the $\mathbb{Z}$-graded homotopy Mackey functors of the $KU_G$-local sphere spectrum. This result generalizes the computation of arXiv:2303.12271 for finite $p$-groups, where $p$ is an odd prime. Finally, by comparing the Bousfield classes of $KU_G/p$ and $G$-equivariant Morava $K$-theory, we prove that the $KU_G/p$-local sphere spectrum is equivalent to a wedge sum of equivariant Morava $K$-theory localized sphere spectra, and describe the $RO(G)$-graded homotopy Mackey functors of the $KU_G/p$-local sphere spectrum.
The crystalline differential operators on a smooth variety X give rise to a non-split Azumaya algebra over the cotangent bundle of the Frobenius twist X'. In some cases, this Azumaya algebra splits when restricted to finite covers of X'. In this short note, we show that, whenever X has a non-closed global one-form, there is a degree one cover of X' on which the Azumaya algebra does not split, answering a question of Sasha Petrov.
Physics-informed neural networks (PINNs) formulate the solution of partial differential equations as residual minimization problems over neural network parameterizations. Although highly flexible, optimization of PINNs using modern variants of Stochastic Gradient Descent algorithms is expensive. On the other hand, iterative computation of PINN parameterization using the Gauss-Newton method suffers from convergence difficulties, dense Jacobian structures, and poor conditioning that limit the effectiveness of second-order optimization methods. In this work, we introduce IGA-ODIL, a spline-based residual minimization framework combining ideas from Optimizing DIscrete Loss (ODIL), robust variational residual minimization, and Isogeometric Analysis (IGA). Instead of neural-network parameterizations of PINNs, the unknown solution is represented by smooth B-spline basis functions, leading to sparse structured Jacobians and efficient Gauss--Newton optimization. We also derive robust residual formulations based on weighted Gram operators, making the loss function related with the true error. The resulting systems inherit locality, sparsity, and approximation-theoretic properties of classical finite element and isogeometric methods while preserving the residual-learning philosophy of scientific machine learning. The proposed methodology is evaluated on several benchmark problems, including Poisson equations, convection-dominated advection--diffusion equations, Helmholtz problems with highly oscillatory solutions, nonlinear Allen--Cahn equations, and inverse Helmholtz parameter identification. Numerical experiments demonstrate orders-of-magnitude speedups compared with PINNs and CRVPINNs while maintaining high accuracy and robustness.
We propose Acc-Sinkhorn, a simple accelerated variant of Sinkhorn for entropy-regularized optimal transport (EOT). The method is derived from a bilevel optimization view: Sinkhorn row scaling solves the inner variable $u$ exactly and defines the reduced dual objective $f(v)=\min_u F(u,v)$, while the remaining column scaling is a unit-step dual mirror descent step in $v$. This structure yields a Hessian-driven Nesterov acceleration that keeps Sinkhorn's scaling form and per-iteration cost, using only extrapolated combinations of Sinkhorn iterates. We prove an $\mathcal{O}(1/k^2)$ rate under a verifiable stability condition. For an $\varepsilon$-approximation of unregularized OT, the resulting complexity is $\widetilde{\mathcal{O}}(n^2/\varepsilon)$, improved from $\widetilde{\mathcal{O}}(n^2/\varepsilon^2)$ for Sinkhorn. On synthetic problems, color transfer, and word alignment, Acc-Sinkhorn gives a $10\times$--$30\times$ speedup over Sinkhorn at small regularization.
We perform a mathematical and statistical analysis of the Wasserstein least squares problem, a regression method for vector-valued covariates and distribution-valued responses. Our proposal contrasts with other distributional regression methods by having a direct interpretation in terms of random variables, as a nonparametric analogue of the classic random-effects model. On the mathematical side, we use a strategy of Lavenant (2024) to show that Wasserstein least squares is the canonical extension of Euclidean least squares to the space of probability distributions from the perspective of convex analysis; this viewpoint gives rise to multimarginal and dual formulations of the Wasserstein least squares problem, extending a similar theory for Wasserstein barycenters. We perform a statistical analysis of the Wasserstein least squares problem under the template deformation model, showing, surprisingly, that estimation is possible at the n^{-1/2} rate. As a special case, we obtain improved rates of estimation for Wasserstein barycenters, which are an exponential improvement over those established by Ahidar-Coutrix, Le Gouic and Paris (2020). Finally, we propose a heuristic particle method for Wasserstein least squares and use it to conduct a novel analysis of large-scale demographic data from the RAND Health and Retirement Study.
In the generality of a rigidly-compactly generated tensor triangulated category, we introduce semi-Bousfield classes in terms of the vanishing of the tensor product in positive degrees with respect to a fixed reasonable $t$-structure. We show that semi-Bousfield classes provide a common generalisation of Bousfield classes and compactly generated tensor-compatible $t$-structures. Then we specialise to the setting of the unbounded derived category $\mathcal{D}_{\mathrm{qc}}(X)$ of a Noetherian scheme $X$ and show that the stratification bijection naturally extends to an assignment which takes a (not necessarily monotone) perversity on $X$ to a semi-Bousfield class in $\mathcal{D}_{\mathrm{qc}}(X)$. If $X$ is regular, this assignment constitutes a stratification of the whole semi-Bousfield lattice, while in the singular case, its image consists precisely of those semi-Bousfield classes arising from objects of finite Tor-dimension. Restricting this bijection to monotone perversities recovers the recent classification of compactly generated tensor-compatible $t$-structures of Dubey and Sahoo, (arXiv:2204.05015).
For a compact, connected, orientable Riemannian manifold with $b$ boundary components, we obtain geometric lower bounds for the low Steklov eigenvalues, namely $σ_k$, $1\le k\le b-1$. Our results complement earlier results, which apply only to $σ_k$ with $k\ge b$ and depend on the geometry near the boundary, by showing how the interior geometry influences the low eigenvalues. Our result also yields lower bounds for the low Steklov eigenvalues in the setting of pinched negatively curved manifolds, thus recovering similar results in that context through an alternative proof. The proof of the main result is based on the trace inequality relating the Steklov eigenvalue to the Neumann eigenvalues of the connected subdomains of the manifold containing a boundary collar. The geometric coefficient appearing in this inequality is given by an explicit formula in terms of a quantity that can be interpreted as the electrical resistance of the boundary collar.
We investigate transient clustering dynamics in nonlocal aggregation-diffusion systems from an energetic perspective. Starting from a stochastic interacting particle system, we study the associated macroscopic McKean-Vlasov equation on the torus and exploit its Wasserstein gradient-flow structure to analyse the thermodynamic competition between interaction-driven aggregation and entropy-driven diffusion. Through numerical experiments for locally attractive interaction kernels, we identify alternating aggregation- and diffusion-dominated transient regimes along trajectories converging to fixed equilibria. These dynamics can be interpreted as a form of non-monotone clustering behaviour. Moreover, we demonstrate that clustering observables, such as the density peak height, are only partially coupled to the underlying energetic mechanisms and therefore do not uniquely characterise the relevant macroscopic transport dynamics. Our results highlight the role of the variational structure not only for equilibrium analysis, but also as a framework for understanding transient clustering phenomena in interacting particle systems.
We exploit the connection between quantum dot Dirac operators and $\overline\partial$-Robin Laplacians. First, we find a graphical relation between their smallest positive eigenvalues, which allows us to deduce a recipe for translating bounds (from above and below) from one to the other. As an application, we provide new upper and lower bounds for the eigenvalues of the quantum dot Dirac operators, which depend only on geometric quantities of the underlying domain. In particular, we obtain some Faber-Krahn type inequalities for convex thin domains.
We prove an inversion theorem for recursive formulas satisfied by certain families of converging power series in two variables. These power series are indexed by the Harder-Narasimhan types of principal $G$-bundles of degree $d \in π_1 G$ on a smooth projective curve $X$, where $G$ is a connected complex reductive group. As an application, we obtain a closed formula for the Hodge-Poincaré series of moduli stacks of semistable principal $G$-bundles of degree $d$. We also compute the variation of Hodge structure of the moduli stack of all principal $G$-bundles over $X$, as a function of the period matrix of that curve.
We construct the causal fermion system for globally hyperbolic spacetimes starting in the framework of algebraic quantum field theory. The fermionic projector is identified with the one-particle density operator of a quasi-free Hadamard state. The ultraviolet regularization is built into the fermionic projector via a chart-independent $i\varepsilon$-regularization scheme. The continuum limit analysis is developed in globally hyperbolic spacetimes. It is shown that the Euler-Lagrange equations of the causal action principle are satisfied in this setup if and only if the coupled Einstein-Dirac equations hold.
Recent literature shows that hypocoercivity properties of linear evolution equations (in particular their exponential decay and the sharp short time decay of their propagator norm) carry over to their discretization via the midpoint rule. This note discusses this connection for the (other) $θ$-methods, i.e.\ for $θ\ne\frac12$. It is shown that any implicit discretization with $θ\in (\frac12,1]$ (pertaining to a hypocoercive continuous-time evolution equation) is contractive, and not only hypocontractive -- in contrast to the midpoint rule. For a coercive continuous-time evolution equation, a discretization with $θ\in [0,\frac12)$ is contractive for time steps small enough.
We study Lusin-measurable functions with values in locally convex spaces. In particular, the behavior of pointwise limits of sequences of Lusin-measurable functions and exhibit pathological phenomena arising in the nonmetrizable setting. Moreover, we establish approximation and density results for $L^p$-spaces constructed with this notion of measurability, including the density of simple functions in Hausdorff locally convex spaces and convergence results obtained through dyadic approximations.
The scope of this text is to study a process that induces another proof of the Spectral Embedding Theorem: that any densely defined symmetric operator can be extended by a multiplication operator through an embedding of the Hilbert space into an $L_2$ space. Furthermore, that process is meant to be used for specific operators, where natural spectral embeddings or equivalences may be found. That process has previously been considered in arXiv:2411.06281 and in arXiv:2511.18189, where it has been introduced through nonstandard techniques. Our contribution aims to be the reformulation of the theory through classical analysis arguments, without the use of nonstandard techniques nor ultraproducts.
Given a matrix $A$, a matrix nearness problem seeks an $X$ that most closely approximates $A$ in the sense of minimizing $\lVert A - X\rVert$ under a variety of constraints on $X$. A generalized matrix nearness problem seeks the same but with three given matrices $A,B,C$ and $\lVert A - BXC\rVert$ in place of $\lVert A - X\rVert$. We extend previous studies of the latter problem in three directions: incorporating an affine term, replacing matrix product by Kronecker product in various manners, and generalizing Frobenius norm to any orthogonally invariant norm. We will solve several of these in closed form. For the rest, we develop an iterative algorithm that works for any Schatten norm, proving that it converges to a global minimizer regardless of the initial point. In addition, the algorithm relies purely on numerical linear algebra, and notably does not compute any explicit gradients or subgradients. Along the way, we will also show that there is no Mirsky-type theorem for rank constrained generalized matrix nearness problems.
We study the Dirac equation in the Reissner-Nordström geometry in horizon-penetrating coordinates up to the Cauchy horizon. A mass decomposition theorem is proved, which gives a covariant representation of the spacetime inner product that naturally involves the fermionic signature operator and the fermionic flux operator. We compute their spectra and show that both are bounded symmetric operators on the solution space $\mathcal{H}_m$ of the massive Dirac equation. The corresponding fermionic projector state is constructed and shown to satisfy the Hadamard condition. Lastly, we give some physical interpretations of the fermionic flux operator.
Given a non-zero polynomial $P(x)$, we study Fuchsian differential operators of the form $L=\partial_x^2-u(x)$ such that for all $λ\in\mathbb{C}$ the operator $L+λP(x)$ is monodromy free. We prove that all such operators are obtained from populations of critical points of ${\widehat{\mathfrak{sl}}_2}$ master functions. Moreover, we show that the reproduction procedure of critical points corresponds to a Darboux transformation of operator $P^{-1}(x)L$. As a result, we obtain a classification of all operators $L$ with such properties in the case of $P(x)=x^k$.