Journal article

Bootstrapping estimates of stability for clusters, observations and model selection

H Yu, B Chapman, A Di Florio, E Eischen, D Gotz, M Jacob, RH Blair

Computational Statistics | Published : 2019

Abstract

Clustering is a challenging problem in unsupervised learning. In lieu of a gold standard, stability has become a valuable surrogate to performance and robustness. In this work, we propose a non-parametric bootstrapping approach to estimating the stability of a clustering method, which also captures stability of the individual clusters and observations. This flexible framework enables different types of comparisons between clusterings and can be used in connection with two possible bootstrap approaches for stability. The first approach, scheme 1, can be used to assess confidence (stability) around clustering from the original dataset based on bootstrap replications. A second approach, scheme ..

View full abstract

University of Melbourne Researchers

Grants

Citation metrics