2606.27061

Total: 1

#1 How to evaluate clustering with ground truth? [PDF] [Copy] [Kimi1] [REL]

Author: Pasi Fränti

External indexes can be used for cluster evaluation when ground truth is available. We review the most common external validity indexes focusing on set-matching-based measures. We recommend centroid index (CI), because it is an intuitive cluster-level measure with an explainable result. If we need a more fine-tuned, point-level measure, there are more choices. Pair-set index (PSI) provides a normalized score which is not biased by cluster sizes. If all points should matter equally, then clustering accuracy (ACC) or any other set-matching measure is suitable.

Subject: Artificial Intelligence

Publish: 2026-06-25 14:07:17 UTC