Journal article

Adjusting for Chance Clustering Comparison Measures

Simone Romano, Xuan Vinh Nguyen, James Bailey, Karin Verspoor

Journal of Machine Learning Research | MICROTOME PUBL | Published : 2016

Abstract

Adjusted for chance measures are widely used to compare partitions/clusterings of the same data set. In particular, the Adjusted Rand Index (ARI) based on pair-counting, and the Adjusted Mutual Information (AMI) based on Shannon information theory are very popular in the clustering community. Nonetheless it is an open problem as to what are the best application scenarios for each measure and guidelines in the literature for their usage are sparse, with the result that users often resort to using both. Generalized Information Theoretic (IT) measures based on the Tsallis entropy have been shown to link pair-counting and Shannon IT measures. In this paper, we aim to bridge the gap between adjus..

View full abstract