2025-03-28 | | Total: 26
Germ order is a non-standard stochastic order defined through the comparison of the generating functions of the processes. This order was first introduced for branching random walks with a constant breeding law and independent dispersal of offspring, which are characterized by a one-dimensional generating function. In this work, we investigate the properties of the extension of this concept to processes characterized by a multidimensional generating function, such as general branching random walks and rumor processes. In particular, we use germ ordering to characterize the behavior of certain branching random walks and rumor processes with inhomogeneous breeding/transmitting laws.
We study Voronoi percolation on a large class of $d$-dimensional Riemannian manifolds, which includes hyperbolic space $\mathbb{H}^d$ for $d\geq 2$. We prove that as the intensity $\lambda$ of the underlying Poisson point process tends to infinity, both critical parameters $p_c(M,\lambda)$ and $p_u(M,\lambda)$ converge to the Euclidean critical parameter $p_c(\mathbb{R}^d)$. This extends a recent result of Hansen & Müller in the special case $M=\mathbb{H}^2$ to a general class of manifolds of arbitrary dimension. A crucial step in our proof, which may be of independent interest, is to show that if $M$ is simply connected and one-ended, then embedded graphs induced by a general class of tessellations on $M$ have connected minimal cutsets. In particular, this result applies to $\varepsilon$-nets, allowing us to implement a "fine-graining" argument. We also develop an annealed way of exploring the Voronoi cells that we use to characterize the uniqueness phase.
In this paper, the stochastic theta (ST) method is investigated for a class of stochastic differential equations driven by a time-changed Brownian motion, whose coefficients are time-space-dependent and satisfy the local Lipschitz condition. It is proved that under the local Lipschitz and some additional assumptions, the ST method with $\theta\in[1/2,1]$ is strongly convergent. It is also obtained that, for all positive stepsizes, the ST method with $\theta\in[1/2,1]$ is asymptotically mean square stable under a coercivity condition. With some restrictions on the stepsize, the ST method with $\theta\in[0,1/2)$ is asymptotically mean square stable under a stronger assumption. Some numerical simulations are presented to illustrate the theoretical results.
The Glivenko-Cantelli theorem is a uniform version of the strong law of large numbers. It states that for every IID sequence of random variables, the empirical measure converges to the underlying distribution (in the sense of uniform convergence of the CDF). In this work, we provide tools to study such limits of empirical measures in categorical probability. We propose two axioms, permutation invariance and empirical adequacy, that a morphism of type $X^\mathbb{N} \to X$ should satisfy to be interpretable as taking an infinite sequence as input and producing a sample from its empirical measure as output. Since not all sequences have a well-defined empirical measure, ``such empirical sampling morphisms'' live in quasi-Markov categories, which, unlike Markov categories, allow partial morphisms. Given an empirical sampling morphism and a few other properties, we prove representability as well as abstract versions of the de Finetti theorem, the Glivenko-Cantelli theorem and the strong law of large numbers. We provide several concrete constructions of empirical sampling morphisms as partially defined Markov kernels on standard Borel spaces. Instantiating our abstract results then recovers the standard Glivenko-Cantelli theorem and the strong law of large numbers for random variables with finite first moment. Our work thus provides a joint proof of these two theorems in conjunction with the de Finetti theorem from first principles.
The continuous generalized exchange-driven growth model (CGEDG) is a system of integro-differential equations describing the evolution of cluster mass under mass exchange. The rate of exchange depends on the masses of the clusters involved and the mass being exchanged. This can be viewed as both a continuous generalization of the exchange-driven growth model and a coagulation-fragmentation equation that generalizes the continuous Smoluchowski equation. Starting from a Markov jump process that describes a finite stochastic interacting particle system with exchange dynamics, we prove the weak law of large numbers for this process for sublinearly growing kernels in the mean-field limit. We establish the tightness of the stochastic process on a measure-valued Skorokhod space induced by the $1$-Wasserstein metric, from which we deduce the existence of solutions to the (CGEDG) system. The solution is shown to have a Lebesgue density under suitable assumptions on the initial data. Moreover, within the class of solutions with density, we establish the uniqueness under slightly more restrictive conditions on the kernel.
We study mean field stochastic differential equations with a diffusion coefficient that depends on the distribution function of the unknown process in a discontinuous manner, which is a type of distribution dependent regime switching. To determine the distribution function we show that under certain conditions these equations can be transformed into SDEs with deterministic coefficients using a Lamperti-type transformation. We prove an existence and uniqueness result and consider cases when the uniqueness may fail or a solution exists only for a finite time.
We study the joint spectral properties of two coupled random matrices $H^{(1)}$ and $H^{(2)}$, which are either real symmetric or complex Hermitian. The entries of these matrices exhibit polynomially decaying correlations, both within each matrix and between them. Surprisingly, we find that under extremely weak decorrelation condition, permitting $H^{(1)}$ and $H^{(2)}$ to be almost fully correlated, the fluctuations of their individual eigenvalues in the bulk of the spectrum are still asymptotically independent. Furthermore, we demonstrate that this decorrelation condition is optimal.
Population genetic processes, such as the adaptation of a quantitative trait to directional selection, may occur on longer time scales than the sweep of a single advantageous mutation. To study such processes in finite populations, approximations for the time course of the distribution of a beneficial mutation were derived previously by branching process methods. The application to the evolution of a quantitative trait requires bounds for the probability of survival $\Sn$ up to generation $n$ of a single beneficial mutation. Here, we present a method to obtain a simple, analytically explicit, either upper or lower, bound for $\Sn$ in a supercritical Galton-Watson process. We prove the existence of an upper bound for offspring distributions including Poisson and binomial. They are constructed by bounding the given generating function, $\varphi$, by a fractional linear one that has the same survival probability $\Sinf$ and yields the same rate of convergence of $\Sn$ to $\Sinf$ as $\varphi$. For distributions with at most three offspring, we characterize when this method yields an upper bound, a lower bound, or only an approximation. Because for many distributions it is difficult to get a handle on $\Sinf$, we derive an approximation by series expansion in $s$, where $s$ is the selective advantage of the mutant. We briefly review well-known asymptotic results that generalize Haldane's approximation $2s$ for $\Sinf$, as well as less well-known results on sharp bounds for $\Sinf$. We apply them to explore when bounds for $\Sn$ exist for a family of generalized Poisson distributions. Numerical results demonstrate the accuracy of our and of previously derived bounds for $\Sinf$ and $\Sn$. Finally, we treat an application of these results to determine the response of a quantitative trait to prolonged directional selection.
We recently proposed a method for estimation of states and parameters in stochastic differential equations, which included intermediate time points between observations and used the Laplace approximation to integrate out these intermediate states. In this paper, we establish a Laplace approximation for the transition probabilities in the continuous-time limit where the computational time step between intermediate states vanishes. Our technique views the driving Brownian motion as a control, casts the problem as one of minimum effort control between two states, and employs a Girsanov shift of probability measure as well as a weak noise approximation to obtain the Laplace approximation. We demonstrate the technique with examples; one where the approximation is exact due to a property of coordinate transforms, and one where contributions from non-near paths impair the approximation. We assess the order of discrete-time scheme, and demonstrate the Strang splitting leads to higher order and accuracy than Euler-type discretization. Finally, we investigate numerically how the accuracy of the approximation depends on the noise intensity and the length of the time interval.
Suppose $n$ independent random variables $X_1, X_2, \dots, X_n$ have zero mean and equal variance. We prove that if the average of $\chi^2$ distances between these variables and the normal distribution is bounded by a sufficiently small constant, then the $\chi^2$ distance between their normalized sum and the normal distribution is $O(1/n)$.
In this article, we fill a gap in the literature regarding quantitative functional central limit theorems (qfCLT) for Hawkes processes by providing an upper bound for the convergence of a nearly unstable Hawkes process toward a Cox-Ingersoll-Ross (CIR) process. Note that in this case no speed of convergence has been established even for one-dimensional marginals; we provide in this paper a control in terms of a supremum norm in $2$-Wasserstein distance. To do so, we make use of the so-called Poisson imbedding representation and provide a qfCLT formulation in terms of a Brownian sheet. Incidentally, we construct an optimal coupling between a rescaled bi-dimensional Poisson random measure and a Brownian sheet with respect to the $2$-Wasserstein distance and analyze the asymptotic quality of this coupling in detail.
This paper establishes a CLT for linear statistics of the form $\langle \mathbf{q},\boldsymbol{\sigma} \rangle$ with quantitative Berry-Esseen bounds, where $\boldsymbol{\sigma}$ is an observation from an exponential family with a quadratic form as its sufficient statistic, in the \enquote{high-temperature} regime. We apply our general result to random field Ising models with both discrete and continuous spins. To demonstrate the generality of our techniques, we apply our results to derive both quenched and annealed CLTs in various examples, which include Ising models on some graph ensembles of common interest (Erdős-Rényi, regular, dense bipartite), and the Hopfield spin glass model. Our proofs rely on a combination of Stein's method of exchangeable pairs and Chevet type concentration inequalities.
Let $\mathbb{T}$ be the two-dimensional triangular lattice, and $\mathbb{Z}$ the one-dimensional integer lattice. Let $\mathbb{T}\times \mathbb{Z}$ denote the Cartesian product graph. Consider the Ising model defined on this graph with inverse temperature $\beta$ and external field $h$, and let $\beta_c$ be the critical inverse temperature when $h=0$. We prove that for each $\beta\in[0,\beta_c)$, there exists $h_c(\beta)>0$ such that both a unique infinite $+$cluster and a unique infinite $-$cluster coexist whenever $|h|<h_c(\beta)$. The same coexistence result also holds for the three-dimensional triangular lattice.
We establish the joint scaling limit of a critical Bienaymé-Galton-Watson process with immigration (BGWI) and its (counting) local time at zero to the corresponding self-similar continuous-state branching process with immigration (CBI) and its (Markovian) local time at zero for balanced offspring and immigration laws in stable domains of attraction. Using a general framework for invariance principles of local times~\cite{MR4463082}, the problem reduces to the analysis of the structure of excursions from zero and positive levels, together with the weak convergence of the hitting times of points of the BGWI to those of the CBI. A key step in the proof of our main limit theorem is a novel Yaglom limit for the law at time $t$ of an excursion with lifetime exceeding $t$ of a scaled infinite-variance critical BGWI. Our main result implies a joint septuple scaling limit of BGWI $Z_1$, its local time at $0$, the random walks $X_1$ and $Y_1$ associated to the reproduction and immigration mechanisms, respectively, the counting local time at $0$ of $X_1$, an additive functional of $Z_1$ and $X_1$ evaluated at this functional. In the septuple limit, four different scaling sequences are identified and given explicitly in terms of the offspring generating function (modulo asymptotic inversion), the local extinction probabilities of the BGWI and the tails of return times to zero of $X_1$.
Most Kalman filters for non-linear systems, such as the unscented Kalman filter, are based on Gaussian approximations. We use Poincaré inequalities to bound the Wasserstein distance between the true joint distribution of the prediction and measurement and its Gaussian approximation. The bounds can be used to assess the performance of non-linear Gaussian filters and determine those filtering approximations that are most likely to induce error.
Cardiac real-time magnetic resonance imaging (MRI) is an emerging technology that images the heart at up to 50 frames per second, offering insight into the respiratory effects on the heartbeat. However, this method significantly increases the number of images that must be segmented to derive critical health indicators. Although neural networks perform well on inner slices, predictions on outer slices are often unreliable. This work proposes sparse Bayesian learning (SBL) to predict the ventricular volume on outer slices with minimal manual labeling to address this challenge. The ventricular volume over time is assumed to be dominated by sparse frequencies corresponding to the heart and respiratory rates. Moreover, SBL identifies these sparse frequencies on well-segmented inner slices by optimizing hyperparameters via type -II likelihood, automatically pruning irrelevant components. The identified sparse frequencies guide the selection of outer slice images for labeling, minimizing posterior variance. This work provides performance guarantees for the greedy algorithm. Testing on patient data demonstrates that only a few labeled images are necessary for accurate volume prediction. The labeling procedure effectively avoids selecting inefficient images. Furthermore, the Bayesian approach provides uncertainty estimates, highlighting unreliable predictions (e.g., when choosing suboptimal labels).
We consider the problem of constructing a least conservative estimator of the expected value $\mu$ of a non-negative heavy-tailed random variable. We require that the probability of overestimating the expected value $\mu$ is kept appropriately small; a natural requirement if its subsequent use in a decision process is anticipated. In this setting, we show it is optimal to estimate $\mu$ by solving a distributionally robust optimization (DRO) problem using the Kullback-Leibler (KL) divergence. We further show that the statistical properties of KL-DRO compare favorably with other estimators based on truncation, variance regularization, or Wasserstein DRO.
We consider the problem of estimating states and parameters in a model based on a system of coupled stochastic differential equations, based on noisy discrete-time data. Special attention is given to nonlinear dynamics and state-dependent diffusivity, where transition densities are not available in closed form. Our technique adds states between times of observations, approximates transition densities using, e.g., the Euler-Maruyama method and eliminates unobserved states using the Laplace approximation. Using case studies, we demonstrate that transition probabilities are well approximated, and that inference is computationally feasible. We discuss limitations and potential extensions of the method.
In this paper, we analyze the relative errors in various reliability measures due to the tacit assumption that the components associated with a $n$-component series system or a parallel system are independently working where the components are dependent. We use Copula functions in said error analysis. This technique generalizes the existing work on error assessment for many wide class of distributions.
In this paper, we analyze the relative errors that crop up in the various reliability measures due to the tacit assumption that the components are independently working associated with a $n$-component series system or a parallel system where the components are dependent and follow a well-defined multivariate Weibull or exponential distribution. We also list some important observations which the previous authors have not noted in their earlier works. In this paper, we focus on the incurred error in multi-component series and parallel systems having multivariate Weibull distributions. In the upcoming sections, we establish that the present study has relevance with stochastic orders and statistical dependence which were not previously pointed out by previous authors.
Although the valuation of life contingent assets has been thoroughly investigated under the framework of mathematical statistics, little financial economics research pays attention to the pricing of these assets in a non-arbitrage, complete market. In this paper, we first revisit the Fundamental Theorem of Asset Pricing (FTAP) and the short proof of it. Then we point out that discounted asset price is a martingale only when dividends are zero under all random states of the world, using a simple proof based on pricing kernel. Next, we apply Fundamental Theorem of Asset Pricing (FTAP) to find valuation formula for life contingent assets including life insurance policies and life contingent annuities. Last but not least, we state the assumption of static portfolio in a dynamic economy, and clarify the FTAP that accommodates the valuation of a portfolio of life contingent policies.
Designing efficient learning algorithms with complexity guarantees for Markov decision processes (MDPs) with large or continuous state and action spaces remains a fundamental challenge. We address this challenge for entropy-regularized MDPs with Polish state and action spaces, assuming access to a generative model of the environment. We propose a novel family of multilevel Monte Carlo (MLMC) algorithms that integrate fixed-point iteration with MLMC techniques and a generic stochastic approximation of the Bellman operator. We quantify the precise impact of the chosen approximate Bellman operator on the accuracy of the resulting MLMC estimator. Leveraging this error analysis, we show that using a biased plain MC estimate for the Bellman operator results in quasi-polynomial sample complexity, whereas an unbiased randomized multilevel approximation of the Bellman operator achieves polynomial sample complexity in expectation. Notably, these complexity bounds are independent of the dimensions or cardinalities of the state and action spaces, distinguishing our approach from existing algorithms whose complexities scale with the sizes of these spaces. We validate these theoretical performance guarantees through numerical experiments.
Linear least squares (LLS) is perhaps the most common method of data analysis, dating back to Legendre, Gauss and Laplace. Framed as linear regression, LLS is also a backbone of mathematical statistics. Here we report on an unexpected new connection between LLS and random walks. To that end, we introduce the notion of a random walk based on a discrete sequence of data samples (data walk). We show that the slope of a straight line which annuls the net area under a residual data walk equals the one found by LLS. For equidistant data samples this result is exact and holds for an arbitrary distribution of steps.
This document is an extended version of an abstract for a talk, with approximately the same title, to be held at the 7th Joint Statistical Meeting of the Deutsche Arbeitsgemeinschaft Statistik, from 24 to 28 March 2025 in Berlin. Here ``teachable'' is meant to apply to people ranging from sufficiently advanced high school pupils to university students in mathematics or statistics: For understanding most of the proposed approximation results, it should suffice to know binomial laws, their means and variances, and the standard normal distribution function (but not necessarily the concept of a corresponding normal random variable). Of the proposed approximations, some are well-known (at least to experts), and some are based on teaching experience and research at Trier University.
As $s\rightarrow0^+$, we establish limiting formulas of Besov seminorms and nonlocal perimeters associated with the Dunkl operator, a (nonlocal) differential-difference operator parameterized by multiplicity functions and finite reflection groups. Our results are further developments of both the Maz'ya--Shaposhnikova limiting formula for the Gagliardo seminorm and the asymptotic behavior of the (relative) fractional $s$-perimeter. The main contribution is twofold. On the one hand, to establish our dimension-free Maz'ya--Shaposhnikova limiting formula, we develop a simplified approach which do not depend on the density property of the corresponding Besov space and turns out to be quite robust. On the other hand, to derive the limiting formula of our nonlocal perimeter, we do not demand additional regularity on the (topological) boundary of the domain, and to obtain the converse assertion, our assumption on the boundary regularity of the domain, which allows for fractals, is much weaker than those in existing literatures.