Conference Proceedings

Optimizing the C index using a canonical genetic algorithm

TA Runkler, JC Bezdek

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | Springer | Published : 2019

Abstract

© Springer Nature Switzerland AG 2019. Clustering is an important family of unsupervised machine learning methods. Cluster validity indices are widely used to assess the quality of obtained clustering results. The C index is one of the most popular cluster validity indices. This paper shows that the C index can be used not only to validate but also to actually find clusters. This leads to difficult discrete optimization problems which can be approximately solved by a canonical genetic algorithm. Numerical experiments compare this novel approach to the well-known c-means and single linkage clustering algorithms. For all five considered popular real-world benchmark data sets the proposed metho..

View full abstract