Optimal Mean Estimation without a Variance | Cool Papers

#1 Optimal Mean Estimation without a Variance [PDF] [Copy] [Kimi¹] [REL]

Authors: Yeshwanth Cherapanamjeri, Nilesh Tripuraneni, Peter Bartlett, Michael Jordan

We study the problem of heavy-tailed mean estimation in settings where the variance of the data-generating distribution does not exist. Concretely, given a sample $\bm{X} = \{X_i\}_{i = 1}^n$ from a distribution $\mc{D}$ over $\mb{R}^d$ with mean $\mu$ which satisfies the following \emph{weak-moment} assumption for some ${\alpha \in [0, 1]}$ :

$\begin{equation*} \forall \norm{v} = 1: \mb{E}_{X \ts \mc{D}}[\abs{\inp{X - \mu}{v}}^{1 + \alpha}] \leq 1, \end{equation*}$ and given a target failure probability,

$\delta$ , our goal is to design an estimator which attains the smallest possible confidence interval as a function of

$n,d,\delta$ . For the specific case of

$\alpha = 1$ , foundational work of Lugosi and Mendelson exhibits an estimator achieving \emph{optimal} subgaussian confidence intervals, and subsequent work has led to computationally efficient versions of this estimator. Here, we study the case of general

$\alpha$ , and provide a precise characterization of the optimal achievable confidence interval by establishing the following information-theoretic lower bound:

$\begin{equation*} \Omega \lprp{\sqrt{\frac{d}{n}} + \lprp{\frac{d}{n}}^{\frac{\alpha}{(1 + \alpha)}} + \lprp{\frac{\log 1 / \delta}{n}}^{\frac{\alpha}{(1 + \alpha)}}}. \end{equation*}$ and devising an estimator matching the aforementioned lower bound up to constants. Moreover, our estimator is computationally efficient.

Subject: COLT.2022 - Accept

cherapanamjeri22a@v178@PMLR

#1 Optimal Mean Estimation without a Variance [PDF] [Copy] [Kimi1] [REL]

#1 Optimal Mean Estimation without a Variance [PDF] [Copy] [Kimi¹] [REL]