A Systematic Evaluation of Black-Box Uncertainty Estimation Methods for Large Language Models

#1 A Systematic Evaluation of Black-Box Uncertainty Estimation Methods for Large Language Models [PDF] [Copy] [Kimi] [REL]

Although large language models (LLMs) have shown strong capabilities across a wide range of tasks, their outputs often remain unreliable and may contain hallucinations, making uncertainty estimation (UE) essential for building trustworthy LLMs. In practice, many mainstream LLMs are only accessible through restricted APIs, where internal signals such as logits and hidden states are unavailable, making black-box UE especially important. However, existing work on black-box UE for LLMs remains fragmented in methodology and lacks a unified empirical comparison. To address this gap, we present a systematic review of black-box UE methods and organize them into five categories: verbalization-based, sampling-based, explanation-based, multi-agent, and hybrid methods. We further build a unified evaluation framework and benchmark 24 representative methods across 4 models and 4 dataset settings. Our results show that no single method consistently dominates across all settings. Nevertheless, methods that reason over and compare candidates in the answer space are generally effective, and hybrid methods that combine multiple uncertainty signals perform well under most conditions. By releasing the benchmark data and a unified evaluation framework, we aim to facilitate reproducible comparisons and support future research, while our empirical findings provide practical guidance for developing future black-box UE methods for LLMs.

Subject: Artificial Intelligence

Publish: 2026-06-18 07:27:34 UTC

2606.19868

#1 A Systematic Evaluation of Black-Box Uncertainty Estimation Methods for Large Language Models [PDF] [Copy] [Kimi] [REL]