Statistics

2026-03-03 | | Total: 118

#1 Comparative Analysis of Spatiotemporal Volatility Models: An Empirical Study on Financial Network Series [PDF] [Copy] [Kimi] [REL]

Authors: Ariane N. Meli Chrisko, Jessie Li, Philipp Otto, Wolfgang Schmid

Various spatiotemporal and network GARCH models have recently been proposed to capture volatility interactions, such as the transmission of market risk across financial networks. These approaches rely heavily on the specification of the adjacency or spatiotemporal weight matrix, for which several alternatives exist in the literature. This paper evaluates the out-of-sample forecasting performance of a range of spatiotemporal volatility models and multivariate GARCH benchmarks under nine alternative network specifications. The empirical analysis uses daily data for 16 sectorally diversified S&P 500 stocks from 22 December 1998 to 20 October 2024. A one-step-ahead forecasting framework is implemented, and models are assessed using BIC, RMSFE, and MAFE, with forecasts evaluated against a single realised volatility proxy based on squared log-returns. The nine spatial weight matrices reflect diverse economic and statistical relationships, including Granger-filtered and EGARCH-based spillovers. Results show that some spatiotemporal models outperform standard GARCH benchmarks in out-of-sample forecasting accuracy. Notably, the Dynamic Spatiotemporal ARCH model achieves the lowest RMSFE and MAFE across all network specifications at minimal computational cost. Pairwise Diebold-Mariano tests confirm significant differences in predictive accuracy. These findings underscore the value of incorporating spatial structure into volatility modelling as a parsimonious and interpretable alternative for financial network analysis.

Subject: Applications

Publish: 2026-03-02 18:54:35 UTC


#2 Algebraic statistics of Hüsler-Reiss graphical models in multivariate extremes [PDF] [Copy] [Kimi] [REL]

Authors: Carlos Améndola, Jane Ivy Coons, Alexandros Grosdos, Frank Röttger

The field of extreme value statistics is concerned with modeling and predicting rare events. In a Hüsler-Reiss graphical model, a graph represents extremal conditional independence (CI) relations between random variables. These models are exponential families parameterized by a graph Laplacian and are considered the analogue of multivariate Gaussian models in the extremal setting. We study these models from the perspective of algebraic geometry. Translating the CI relations into polynomial constraints in the parameters, we define extremal CI ideals and find a determinantal representation of their generators. In terms of parametric inference, we study the extremal maximum likelihood degree as the number of solutions to a conditionally negative definite matrix completion problem. We also define and analyze the extremal maximum likelihood threshold for Hüsler-Reiss graphical models, which provides a certificate for the existence of a surrogate MLE in terms of the dimensionality of the point configuration that realizes the underlying summary statistic as a Euclidean distance matrix. We highlight throughout many interesting similarities but also differences with respect to Gaussian graphical models.

Subjects: Statistics Theory , Algebraic Geometry

Publish: 2026-03-02 18:53:28 UTC


#3 Setwise Hierarchical Variable Selection and the Generalized Linear Step-Up Procedure for False Discovery Rate Control [PDF] [Copy] [Kimi] [REL]

Authors: Sarah Organ, Toby Kenney, Hong Gu

Controlling the false discovery rate (FDR) in variable selection becomes challenging when predictors are correlated, as existing methods often exclude all members of correlated groups and consequently perform poorly for prediction. We introduce a new setwise variable-selection framework that identifies clusters of potential predictors rather than forcing selection of a single variable. By allowing any member of a selected set to serve as a surrogate predictor, our approach supports strong predictive performance while maintaining rigorous FDR control. We construct sets via hierarchical clustering of predictors based on correlation, then test whether each set contains any non-null effects. Similar clustering and setwise selection have been applied in the familywise error rate (FWER) control regime, but previous research has been unable to overcome the inherent challenges of extending this to the FDR control framework. To control the FDR, we develop substantial generalizations of linear step-up procedures, extending the Benjamini-Hochberg and Benjamini-Yekutieli methods to accommodate the logical dependencies among these composite hypotheses. We prove that these procedures control the FDR at the nominal level and highlight their broader applicability. Simulation studies and real-data analyses show that our methods achieve higher power than existing approaches while preserving FDR control, yielding more informative variable selections and improved predictive models.

Subject: Methodology

Publish: 2026-03-02 18:24:16 UTC


#4 Instrumental and Proximal Causal Inference with Gaussian Processes [PDF1] [Copy] [Kimi] [REL]

Authors: Yuqi Zhang, Krikamol Muandet, Dino Sejdinovic, Edwin Fong, Siu Lun Chau

Instrumental variable (IV) and proximal causal learning (Proxy) methods are central frameworks for causal inference in the presence of unobserved confounding. Despite substantial methodological advances, existing approaches rarely provide reliable epistemic uncertainty (EU) quantification. We address this gap through a Deconditional Gaussian Process (DGP) framework for uncertainty-aware causal learning. Our formulation recovers popular kernel estimators as the posterior mean, ensuring predictive precision, while the posterior variance yields principled and well-calibrated EU. Moreover, the probabilistic structure enables systematic model selection via marginal log-likelihood optimization. Empirical results demonstrate strong predictive performance alongside informative EU quantification, evaluated via empirical coverage frequencies and decision-aware accuracy rejection curves. Together, our approach provides a unified, practical solution for causal inference under unobserved confounding with reliable uncertainty.

Subjects: Machine Learning , Machine Learning

Publish: 2026-03-02 18:23:26 UTC


#5 Socio-Spatial Patterns of Suicide Mortality in the United States [PDF] [Copy] [Kimi] [REL]

Authors: Kushagra Tiwari, M. Amin Rahimian, Marie-Laure Charpignon, Philippe J. Giabbanelli, Praveen Kumar

Suicides cause over 49000 deaths yearly in the United States, 55% involving firearms. Suicide mortality exhibits substantial geographical and sociodemographic heterogeneity; yet the role of social networks remains underexplored. To assess how suicide risk and firearm restriction policies propagate through social ties, we integrate county-level suicide mortality data (2010-2022) with the Facebook Social Connectedness Index (SCI). We also examine Extreme Risk Protection Orders (ERPO), state-level policies restricting firearm access for individuals at risk of self-harm. In two-way fixed effects regressions, a one-standard-deviation increase in the SCI-weighted average suicide mortality rate of connected counties was associated with +2.78 deaths per 100,000 in a focal county, while a one-standard-deviation increase in ERPO social exposure was associated with -0.214 deaths per 100,000. These associations persisted when adjusting for geographic proximity and including state-by-year fixed effects, and confirm the effect of social networks on diffusion of both harmful exposures and protective interventions.

Subjects: Applications , Social and Information Networks , Physics and Society , Other Statistics

Publish: 2026-03-02 17:47:27 UTC


#6 TRAKNN: Efficient Trajectory Aware Spatiotemporal kNN for Rare Meteorological Trajectory Detection [PDF] [Copy] [Kimi] [REL]

Authors: Guillaume Coulaud, Davide Faranda

Extreme weather events, such as windstorms and heatwaves, are driven by persistent atmospheric circulation patterns that evolve over several consecutive days. While traditional circulation-based studies often focus on instantaneous atmospheric states, capturing the temporal evolution, or trajectory, of these spatial fields is essential for characterizing rare and potentially impactful atmospheric behavior. However, performing an exhaustive similarity search on multi-decadal, continental-scale gridded datasets presents significant computational and memory challenges. In this paper, we propose TRAKNN (TRajectory Aware KNN), a fully unsupervised and data-agnostic framework for detecting geometrically rare short trajectories in spatio-temporal data with an exact kNN approach. TRAKNN leverages a recurrence-based algorithm that decouples computational complexity from trajectory length and efficient batch operations, maximizing computational intensity. These optimizations enable exhaustive analysis on standard workstations, either on CPU or on GPU. We evaluate our approach on 75 years of daily European sea-level pressure data. Our results illustrate that rare trajectories identified by TRAKNN correspond to physically coherent atmospheric anomalies and align with independent extreme-event databases.

Subjects: Machine Learning , Machine Learning

Publish: 2026-03-02 16:49:02 UTC


#7 Analysis of Stepped-Wedge Randomised Cluster Trial using a generalized pairwise comparison approach : a simulation study [PDF] [Copy] [Kimi] [REL]

Authors: Yohan Bard, Emilie Presles, Marc Buyse, Silvy Laporte, Paul Zufferey, Frederikus A. Klok, Olivier Sanchez, Francis Couturaud, Edouard Ollier

Stepped-wedge cluster randomised trials (SW-CRTs) increasingly evaluate complex interventions, yet methodological guidance for analysing composite endpoints using generalized pairwise comparisons (GPC)remains limited. This work investigates the performance of several GPC-based estimators in the presence of clustering, temporal trends, and varying correlation structures typical of SW-CRTs. We conducted an extensive simulation study covering a range of intraclass correlations (ICC), cluster autocorrelation coefficients (CAC), time effects, and treatment effect sizes. Eight analytical approaches were compared, including unadjusted estimators, cluster-stratified win odds, mixed-effects models applied to cluster-period win odds, and probabilistic index models (PIMs). Type I error control was strongly compromised for methods ignoring time or clustering, whereas only two approaches consistently maintained nominal error rates: a hierarchical mixed-effects model with sequence and cluster-level random slopes (b4) and a cluster-restricted PIM (c2). These two methods were further evaluated in terms of statistical power, where c2 generally showed higher efficiency, particularly under strong clustering, low CAC, or the presence of temporal trends, while both converged to similar performance for large treatment effects. Overall, our findings identify b4 and c2 as the most reliable GPC-based strategies for SW-CRT analysis and provide practical guidance for their application, including for ongoing trials such as ETHER.

Subject: Methodology

Publish: 2026-03-02 15:54:01 UTC


#8 Wasserstein-based identification of metastable states in time series data via change point detection and segment clustering [PDF] [Copy] [Kimi] [REL]

Authors: David Gentile, Joshua Huang, James M. Murphy

Change point detection for time series analysis is a difficult and important problem in applied statistics, for which a variety of approaches have been developed in the past several decades. Here, the Wasserstein metric is employed as a tool for change-point identification in multi-dimensional time series data in order to identify clusters in time series in an unsupervised way. We leverage the simplicity of the optimal transport cost in the 1-dimensional setting to quickly identify both a segmentation (family of change points for a trajectory) and a clustering for the data when the number of segments is much smaller than the number of data points, making no parametric assumptions about the particular distributions involved. Our change point detection method scales linearly in the size of the data and in the dimension of the samples. We test our approach on idealized synthetic data trajectories, as well as real world trajectories coming from the domain of molecular dynamics simulations and underwater acoustics. We find that segmenting these time series via change points obtained by estimating the Wasserstein metric derivative and then clustering the identified segments as measures with similarity measured by the Wasserstein metric, successfully identifies metastable states in the law of the processes.

Subject: Statistics Theory

Publish: 2026-03-02 15:42:04 UTC


#9 Quantifying Uncertainty in Void Swelling Prediction: A Conformal Prediction Framework for Reactor Safety Margins [PDF] [Copy] [Kimi] [REL]

Authors: Minhee Kim, Yong Yang

Irradiation-induced void swelling is a critical degradation mechanism for structural materials in nuclear reactors, dictating component operational lifespan and safety. While recent machine learning (ML) approaches have improved the accuracy of swelling rate predictions, they often fail to account for the inherent stochasticity of radiation damage, providing point estimates without rigorous uncertainty quantification. This lack of probabilistic context limits their applications in materials qualification, reactor licensing and risk assessment. In this work, we develop a framework that integrates ensemble ML models with Conformal Prediction (CP) to generate statistically calibrated prediction intervals. Unlike standard error estimation or Bayesian methods that often rely on rigid distributional assumptions, this approach specifically addresses the physical heteroscedasticity of swelling data, where variance transitions from the nucleation-dominated incubation regime to the growth-dominated steady-state regime. We demonstrate that log-transformed conformal prediction inference provides valid empirical coverage consistent with target confidence levels even in sparse data regimes. This framework offers a pathway to replace overly conservative upper-bound curves with Probabilistic Risk Assessment (PRA) tools for high-dose reactor core internals.

Subject: Applications

Publish: 2026-03-02 15:37:34 UTC


#10 Density-Matrix Spectral Embeddings for Categorical Data: Operator Structure and Stability [PDF] [Copy] [Kimi] [REL]

Authors: Raquel Bosch-Romeu, Antonio Falcó, osé-Antonio Rodríguez-Gallego

We introduce a supervised dimensionality reduction methodology for categorical (and discretized mixed-type) data based on a density-matrix construction induced by class-conditional frequencies. Given a labeled dataset encoded in a one-hot survey space, we assemble a frequency matrix whose columns aggregate feature occurrences within each class, and define a normalized Gram-type operator that satisfies the axioms of a density matrix. The resulting representation admits an intrinsic rank bound controlled by the number of classes, enabling low-dimensional spectral embeddings via dominant eigenmodes. Classification is performed in the reduced space through class-conditional kernel density estimation and a maximum-likelihood decision rule. We establish structural invariances, provide complexity estimates, and validate the approach on synthetic benchmarks probing high cardinality, sparsity, noise, and class imbalance.

Subjects: Machine Learning , Numerical Analysis

Publish: 2026-03-02 15:29:54 UTC


#11 LOCUS: A Distribution-Free Loss-Quantile Score for Risk-Aware Predictions [PDF] [Copy] [Kimi] [REL]

Authors: Matheus Barreto, Mário de Castro, Thiago R. Ramos, Denis Valle, Rafael Izbicki

Modern machine learning models can be accurate on average yet still make mistakes that dominate deployment cost. We introduce Locus, a distribution-free wrapper that produces a per-input loss-scale reliability score for a fixed prediction function. Rather than quantifying uncertainty about the label, Locus models the realized loss of the prediction function using any engine that outputs a predictive distribution for the loss given an input. A simple split-calibration step turns this function into a distribution-free interpretable score that is comparable across inputs and can be read as an upper loss level. The score is useful on its own for ranking, and it can optionally be thresholded to obtain a transparent flagging rule with distribution-free control of large-loss events. Experiments across 13 regression benchmarks show that Locus yields effective risk ranking and reduces large-loss frequency compared to standard heuristics.

Subjects: Machine Learning , Machine Learning

Publish: 2026-03-02 15:25:50 UTC


#12 A Simulation Study to Compare Inferential Properties when Modelling Ordinal Outcomes: The Case for the (Plain but Robust) Proportional Odds Model [PDF] [Copy] [Kimi] [REL]

Authors: Stefan Inerle, Markus Pauly, Moritz Berger

Ordinal measurements are common outcomes in studies within psychology, as well as in the social and behavioral sciences. Choosing an appropriate regression model for analysing such data poses a difficult task. This paper aims to facilitate modeling decisions for quantitative researchers by presenting the results of an extensive simulation study on the inferential properties of common ordinal regression models: the proportional odds model, the category-specific odds model, the location-shift model, the location-scale model, and the linear model, which incorrectly treats ordinal outcomes as metric. The simulations were conducted under different data generating processes based on each of the ordinal models and varying parameter configurations within each model class. We examined the bias of parameter estimates as well as type I error rates ($α$-errors) and the power of statistical parameter testing procedures corresponding to the respective models. Our findings reveal several highlights. For parameter estimates, we observed that cumulative ordinal regression models exhibited large biases in cases of large parameter values and high skewness of the outcome distribution in the true data generation process. Regarding statistical hypothesis testing, the proportional odds model and the linear model showed the most reliable results. Due to its better fit and interpretability for ordinal outcomes, we recommend the use of the proportional odds model unless there are relevant contraindications.

Subject: Methodology

Publish: 2026-03-02 14:57:24 UTC


#13 Co-optimization for Adaptive Conformal Prediction [PDF1] [Copy] [Kimi] [REL]

Authors: Xiaoyi Su, Zhixin Zhou, Rui Luo

Conformal prediction (CP) provides finite-sample, distribution-free marginal coverage, but standard conformal regression intervals can be inefficient under heteroscedasticity and skewness. In particular, popular constructions such as conformalized quantile regression (CQR) often inherit a fixed notion of center and enforce equal-tailed errors, which can displace the interval away from high-density regions and produce unnecessarily wide sets. We propose Co-optimization for Adaptive Conformal Prediction (CoCP), a framework that learns prediction intervals by jointly optimizing a center $m(x)$ and a radius $h(x)$.CoCP alternates between (i) learning $h(x)$ via quantile regression on the folded absolute residual around the current center, and (ii) refining $m(x)$ with a differentiable soft-coverage objective whose gradients concentrate near the current boundaries, effectively correcting mis-centering without estimating the full conditional density. Finite-sample marginal validity is guaranteed by split-conformal calibration with a normalized nonconformity score. Theory characterizes the population fixed point of the soft objective and shows that, under standard regularity conditions, CoCP asymptotically approaches the length-minimizing conditional interval at the target coverage level as the estimation error and smoothing vanish. Experiments on synthetic and real benchmarks demonstrate that CoCP yields consistently shorter intervals and achieves state-of-the-art conditional-coverage diagnostics.

Subjects: Machine Learning , Machine Learning

Publish: 2026-03-02 10:43:19 UTC


#14 A spatial scan statistical for categorical, functional data [PDF] [Copy] [Kimi] [REL]

Authors: Camille Frévent, Moustapha Sarr, Sophie Dabo-Niang

We have developed and tested a spatial scan statistic for categorical, functional data (CFSS) - a data structure within which current approaches cannot identify spatial clusters. Our methodology combines an encoding scheme for categorical, functional observations with a nonparametric scan statistic. In a simulation study with three distinct scenarios, the CFSS accurately recovered the simulated spatial clusters and gave very low false positive rates, high true positive rates, and high positive predictive values. We have also used the CFSS to identify and characterize spatial clusters in French air pollution data from the winter of 2024.

Subject: Methodology

Publish: 2026-03-02 10:41:54 UTC


#15 Power and Sample Size Calculations for Bayes Factors in two-arm clinical Phase II Trials with binary Endpoints [PDF] [Copy] [Kimi] [REL]

Author: Riko Kelter

Bayesian sample size calculations in clinical trials usually rely on complex Monte Carlo simulations in practice. Obtaining bounds on Bayesian notions of the false-positive rate and power often lack closed-form or approximate numerical solutions. In this paper, we focus on power and sample size calculations for Bayes factors in the two-arm binomial setting of phase II trials. We cover point-null versus composite and directional hypothesis tests, derive the corresponding Bayes factors, and discuss relevant aspects to consider when pursuing Bayesian design of experiments with the introduced approach. Based on these Bayes factors, we propose a numerical approach which allows to determine the necessary sample size to obtain prespecified bounds of Bayesian power and type-I-error rate in a computationally efficient way. Our method does not rely on Monte Carlo simulations and instead solely relies on standard numerical methods. Real-world examples of phase II trials from oncology and autoimmune diseases illustrate the advantage of the proposed calibration method. In summary, our approach allows for a Bayes-frequentist compromise by providing a Bayesian analogue to a frequentist power analysis for various Bayes factors in the two-arm binomial setting of a phase II clinical trial. The methods are implemented in our R package bfbin2arm.

Subjects: Methodology , Applications

Publish: 2026-03-02 10:41:34 UTC


#16 Probabilistic forecasting of weather-driven faults in electricity networks: a flexible approach for extreme and non-extreme events [PDF] [Copy] [Kimi] [REL]

Authors: Mateus Maia, Daniela Castro-Camilo, Jethro Browell

Electricity networks are vulnerable to weather damage, with severe events often leading to faults and power outages. Timely forecasts of fault occurrences, ranging from nowcasts to several days ahead, can enhance preparedness, support faster response, and reduce outage durations. To be operationally useful, such forecasts must quantify uncertainty, enabling risk-informed resource allocation. We present a novel probabilistic framework for forecasting fault counts that captures typical and extreme events. Non-extreme faults are modeled linearly interpolating estimates from multiple additive quantile regressions, while extreme events are described through a discrete generalized Pareto distribution. To incorporate the impact of weather fluctuations, we use ensemble numerical weather predictions, which helps to quantify uncertainty in the forecasts. This approach is designed to provide reliable fault predictions up to four days ahead. We evaluate the model through numerical experiments and apply it to historical fault data from two electricity distribution networks in Great Britain. The resulting forecasts demonstrate substantial improvements over business-as-usual and alternative modeling approaches. A practitioner trial conducted with Scottish Power Energy Networks from October 2024 to March 2025 further demonstrates the operational value of the forecasts. Engineers found them sufficiently reliable to inform decision-making, offering benefits to both network operators and electricity consumers.

Subject: Applications

Publish: 2026-03-02 09:34:43 UTC


#17 Wild Bootstrap Inference for Non-Negative Matrix Factorization with Random Effects [PDF] [Copy] [Kimi] [REL]

Author: Kenichi Satoh

Non-negative matrix factorization (NMF) is widely used for parts-based representations, yet formal inference for covariate effects is rarely available when the basis is learned under non-negativity. We introduce non-negative matrix factorization with random effects (NMF-RE), a mean-structure latent-variable model $Y=X(ΘA+U)+\mathcal{E}$ that combines covariate-driven scores with unit-specific deviations. Random effects act as a working device for modeling heterogeneity and controlling complexity; we monitor their effective degrees of freedom and enforce a df-based cap to prevent near-saturated fits. Estimation alternates closed-form ridge (BLUP-like) updates for $U$ with multiplicative non-negative updates for $X$ and $Θ$. For inference on $Θ$, we condition on $(\widehat X,\widehat U)$ and obtain fast uncertainty quantification via asymptotic linearization, a one-step Newton update, and a multiplier (wild) bootstrap; this avoids repeated constrained re-optimization. Simulations include a targeted stress test showing that, without df control, the random-effects penalty can collapse and inference for $Θ$ becomes degenerate, whereas the df-cap prevents this failure mode. The non-negativity constraint induces sparse, parts-based loadings -- a measurement-side variable selection -- while inference on $Θ$ identifies which covariates affect which components, providing covariate-side selection. Longitudinal, psychometric, spatial-flow, and text examples further illustrate stable, interpretable covariate-effect inference.

Subjects: Methodology , Machine Learning

Publish: 2026-03-02 05:29:46 UTC


#18 A Laplace-based perspective on conditional mean risk sharing [PDF] [Copy] [Kimi] [REL]

Author: Christopher Blier-Wong

The conditional mean risk-sharing (CMRS) rule is an important tool for distributing aggregate losses across individual risks, but its implementation in continuous multivariate models typically requires complicated multidimensional integrals. We develop a framework to compute CMRS allocations from the joint Laplace--Stieltjes transform of the risk vector. The LSTs of the allocation measures $ν_i(B)=\mathbb{E}[X_i\boldsymbol{1}_{\{S\in B\}}]$ are expressed as partial derivatives of the joint LST evaluated on the diagonal $t_1=\cdots=t_n$. When densities exist, this yields one-dimensional Laplace inversions for $f_S$ and $ξ_i$, and hence $h_i(s)=ξ_i(s)/f_S(s)$ on the absolutely continuous part, providing closed-form or semi-analytic solutions for a broad class of distributions. We also develop numerical inversion methods for cases where analytic inversion is unavailable. We introduce an exponential tilting procedure to stabilize numerical inversion in low-probability aggregate events. We provide several examples to illustrate the approach, including in some high-dimensional settings where existing approaches are infeasible.

Subjects: Statistics Theory , Risk Management

Publish: 2026-03-02 04:29:21 UTC


#19 A Hybrid Particle Gaussian Mixture Filtering Method for Cislunar Orbit Determination Under Extreme Uncertainty [PDF] [Copy] [Kimi] [REL]

Authors: Ishan Paranjape, Tarun Hejmadi, Utkarsh Ranjan Mishra, Suman Chakravorty

Gauss's method of orbit determination (OD) and its variants are among the most popular initial state estimation techniques for astronomers and engineers alike. However, owing to its assumptions regarding the two-body problem, Gauss's method is inapplicable in the cislunar domain, where three body effects dominate. We introduce a hybrid Particle Gaussian Mixture filtering method, a purely recursive probabilistic orbit determination framework based on a combination of the Markov Chain Monte Carlo based Particle Gaussian Mixture-II (PGM-II) and Particle Gaussian Mixture-I (PGM-I) filters. This method enables us to fuse probabilistic information with angles-only observations from terrestrial telescopes for short and long-term cislunar target tracking. We demonstrate this technique on an important cislunar orbit regime.

Subjects: Applications , Instrumentation and Methods for Astrophysics

Publish: 2026-03-02 04:20:48 UTC


#20 Wrapped flat-top kernel density estimation with circular data [PDF] [Copy] [Kimi] [REL]

Author: Yasuhito Tsuruta

Kernel density estimators with circular data have been studied extensively for decades, as they allow flexible estimations even when the shape of the underlying density is complex. Many recent studies have examined bias correction methods; however, these methods are limited by the order when trying to improve the convergence rate of the bias, even if the true density is sufficiently smooth. To overcome this limitation, the present study considers a new bias correction approach based on the characteristic functions of the underlying circular density. We introduce wrapped flat-top kernels, which are generated by wrapping the standard flat-top kernels defined on the real line onto the circumference of a unit circle. The asymptotic mean squared errors of the wrapped flat-top kernel density estimators are then derived. The results show that the convergence rate of these estimators is faster than that of previously introduced estimators. Furthermore, wrapped flat-top kernel density estimators achieve $\sqrt{n}$-consistency under the characteristic function of finite support, such as the circular uniform and cardioid distributions. We confirm these theoretical results in the numerical experiments. In empirical analyses, we also show that wrapped flat-top kernel density estimators effectively capture the shape of data. Therefore, such estimators are expected to allow flexible and accurate estimation in circular data analysis.

Subject: Methodology

Publish: 2026-03-02 03:09:19 UTC


#21 Differential gene expression analysis via two-component mixture models with a semiparametric skew-normal scale mixture alternative [PDF] [Copy] [Kimi] [REL]

Authors: Sangkon Oh, Geoffrey J. McLachlan

Two-component mixture models are particularly useful for identifying differentially expressed genes, but their performance can deteriorate markedly when the alternative distribution departs from parametric assumptions or symmetry. We propose a semiparametric mixture model in which the null component is standard normal and the alternative follows a skew-normal scale mixture with an unspecified scale mixing distribution. This formulation accommodates skewness and heavy tails, providing a flexible and computationally tractable tool for differential gene-expression analysis without restrictive distributional assumptions. We establish identifiability and consistency of the model and develop an efficient estimation algorithm that incorporates nonparametric maximum likelihood estimation of the scale distribution. Numerical studies show notable improvements over existing parametric and nonparametric approaches for modeling the alternative distribution, and applications to colon cancer and leukemia datasets demonstrate reduced false discovery and false negative rates.

Subject: Methodology

Publish: 2026-03-02 02:22:09 UTC


#22 Integration of Individual Participant and Aggregate Data Under Dataset Shift: Summary Statistic Comparison and Scalable Computation [PDF] [Copy] [Kimi] [REL]

Authors: Ming-Yueh Huang, Jing Qin, Chiung-Yu Huang

Integrated IPD-AD analysis, which combines individual participant data (IPD) with aggregate data (AD), is increasingly recognized as an effective strategy for generating more reliable and generalizable inferences from heterogeneous studies. While most existing work has focused on algorithmic approaches, this paper investigates a complementary yet underexplored question: how different forms of AD influence the efficiency of data integration. Working within a constrained maximum likelihood estimation framework, we compare commonly reported summary statistics and show that subgroup-specific summaries can substantially improve estimation efficiency. In particular, we find that AD derived from outcome-stratified subgroups (e.g., cases and controls) consistently yield greater efficiency gains than those based on covariate-stratified subgroups (e.g., age or exposure categories), especially when the outcome is continuous. Although outcome-stratified summaries are commonly reported for discrete outcomes, they are rarely provided when the outcome is continuous. Our findings therefore support the routine inclusion of outcome-stratified summaries for continuous endpoints in trial reports and public data repositories to facilitate more efficient evidence synthesis. We further extend the constrained maximum likelihood framework to accommodate dataset shift and develop a fast, non-iterative estimation procedure to improve numerical stability and scalability. We illustrate the proposed methodology with two applications: an analysis of income data under covariate shift and an analysis of housing data under prior probability shift.

Subject: Methodology

Publish: 2026-03-02 02:19:26 UTC


#23 Multi-pathogen situational assessment and forecasting of respiratory disease in Aotearoa New Zealand [PDF] [Copy] [Kimi] [REL]

Authors: M. J. Plank, A. R. Young, K. L. Senior, R. J. Tobin, M. O'Hara-Wild, F. Callaghan, F. Shearer, O. Eales

Real-time analysis of epidemic trends and forecasts can help support public health planning and the response to seasonal respiratory disease. Here, we present two models that were used in a 2025 New Zealand winter situational assessment programme for three respiratory pathogens: SARS-CoV-2, influenza and respiratory syncytial virus (RSV). These models were run weekly from May to October 2025 on real-time disease surveillance data and provided a quantitative representation of the current epidemic trend, along with estimates of the epidemic growth rate and 28-day ahead forecasts of case incidence. Model results and interpretation were provided in weekly reports to public health partners as part of a trans-Tasman winter programme run by the Australia--Aotearoa Consortium for Epidemic Forecasting and Analytics (ACEFA). We compare in-season results that were included in these reports to a retrospective analysis of the complete data for the season. We conclude that real-time analyses performed reasonably well, and identify some areas for improvement in future winter situational assessment programmes.

Subject: Applications

Publish: 2026-03-02 02:15:51 UTC


#24 Causal Effects with Unobserved Unit Types in Interacting Human-AI Systems [PDF] [Copy] [Kimi1] [REL]

Authors: William Overman, Sadegh Shirani, Mohsen Bayati

We study experiments on interacting populations of humans and AI agents, where both unit types and the interaction network remain unobserved. Although causal effects propagate throughout the system, the goal is to estimate effects on humans. Examples include online platforms where human users interact alongside AI-driven accounts. We assume a human-AI prior that gives each unit a probability of being human. While humans cannot be distinguished at the unit level, the prior allows us to compute the average human composition within large subpopulations. We then model outcome dynamics through a causal message passing (CMP) framework and analyze sample-mean outcomes across subpopulations. We show that by constructing subpopulations that vary in expected human composition and treatment exposure, one can consistently recover human-specific causal effects. Our results characterize when distributional knowledge of population composition (without observing unit types or the interaction network) is sufficient for identification. We validate the approach on a simulated human-AI platform driven by behaviorally differentiated LLM agents. Together, these results provide a theoretical and practical framework for experimentation in emerging human-AI systems.

Subjects: Machine Learning , Machine Learning

Publish: 2026-03-02 00:31:48 UTC


#25 Adaptive Estimation and Inference in Conditional Moment Models via the Discrepancy Principle [PDF] [Copy] [Kimi] [REL]

Authors: Jiyuan Tan, Vasilis Syrgkanis

We study adaptive estimation and inference in ill-posed linear inverse problems defined by conditional moment restrictions. Existing regularized estimators such as Regularized DeepIV (RDIV) require prior knowledge of the smoothness of the nuisance function, typically encoded by a beta source condition to tune their regularization parameters. In practice, this smoothness is unknown, and misspecified hyperparameters can lead to suboptimal convergence or instability. We introduce a discrepancy-principle-based framework for adaptive hyperparameter selection that automatically balances bias and variance without relying on the unknown smoothness parameter. Our framework applies to both RDIV (Li et al. [2024]) and the Tikhonov Regularized Adversarial Estimator (TRAE) (Bennett et al. [2023a]) and achieves the same rates in both weak and strong metrics. Building on this, we construct a fully adaptive doubly robust estimator for linear functionals that attains the optimal rate of the better-conditioned primal or dual problem, providing a practical, theoretically grounded approach for adaptive inference in ill-posed econometric models.

Subjects: Machine Learning , Machine Learning

Publish: 2026-03-02 00:23:20 UTC