Dealing with Inliers in Feature Vector Data
Dheeraj Kumar, Zahra Ghafoori, James C Bezdek, Christopher Leckie, Kotagiri Ramamohanarao, Marimuthu Palaniswami
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS | WORLD SCIENTIFIC PUBL CO PTE LTD | Published : 2018
Inliers (bridge points) between clusters degrade the ability of many algorithms to find clusters in numerical data. We present three new approaches to the detection and removal of inliers. Two approaches are based on Local Outlier Factor (LOF) scores. We also discuss using LOF scores for an isolation Nearest Neighbour Ensemble (iNNE) approach to inlier detection. The third approach uses MaxiMin (MM) sampling to remove both inliers and outliers. We compare the three approaches on a synthetic and two real-life datasets. The failure of single linkage clustering due to the existence of bridging points is used as a means for evaluating the relative effectiveness of the three methods. We also show..View full abstract
We thank the support from EU FP7 SocIoTal and H2020-ICT-2014-1 OrganiCity.