Journal article

Missing data imputation using decision trees and fuzzy clustering with iterative learning

S Nikfalazar, CH Yeh, S Bedingfield, HA Khorshidi

Knowledge and Information Systems | Springer | Published : 2020


Various imputation approaches have been proposed to address the issue of missing values in data mining and machine learning applications. To improve the accuracy of missing data imputation, this paper proposes a new method called DIFC by integrating the merits of decision tress and fuzzy clustering into an iterative learning approach. To compare the performance of the DIFC method against five effective imputation methods, extensive experiments are conducted on six widely used datasets with numerical and categorical missing data, and with various amounts and types of missing values. The experimental results show that the DIFC method outperforms other methods in terms of imputation accuracy. F..

View full abstract