Conference Proceedings
Comparison of Distance metrics for hierarchical data in medical databases
D Hassan, U Aickelin, C Wagner
Proceedings of the International Joint Conference on Neural Networks | IEEE | Published : 2014
DOI: 10.2139/ssrn.2828084
Abstract
Distance metrics are broadly used in different research areas and applications, such as bio-informatics, data mining and many other fields. However, there are some metrics, like pg-gram and Edit Distance used specifically for data with a hierarchical structure. Other metrics used for non-hierarchical data are the geometric and Hamming metrics. We have applied these metrics to The Health Improvement Network (THIN) database which has some hierarchical data. The THIN data has to be converted into a tree-like structure for the first group of metrics. For the second group of metrics, the data are converted into a frequency table or matrix, then for all metrics, all distances are found and normali..
View full abstract