We gratefully acknowledge support from contributors and member institutions.
asixiv
···
Login
arXivarxiv:2606.27061Artificial Superintelligence

How to evaluate clustering with ground truth?

Pasi Fränti

External indexes can be used for cluster evaluation when ground truth is available. We review the most common external validity indexes focusing on set-matching-based measures. We recommend centroid index (CI), because it is an intuitive cluster-level measure with an explainable result. If we need a more fine-tuned, point-level measure, there are more choices. Pair-set index (PSI) provides a normalized score which is not biased by cluster sizes. If all points should matter equally, then clustering accuracy (ACC) or any other set-matching measure is suitable.

Subject:
asi
Submitted:
Jun 27, 2026
Views:
4