Conference Proceedings
Mining minimal distinguishing subsequence patterns with gap constraints
XN Ji, J Bailey, GZ Dong, J Han (ed.), BW Wah (ed.), V Raghavan (ed.), X Wu (ed.), R Rastogi (ed.)
FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | IEEE COMPUTER SOC | Published : 2005
DOI: 10.1109/ICDM.2005.96
Abstract
Discovering contrasts between collections of data is an important task in data mining. In this paper, we introduce a new type of contrast pattern, called a Minimal Distinguishing Subsequence (MDS). An MDS is a minimal subsequence, that occurs frequently in one class of sequences and infrequently in sequences of another class. It is a natural way of representing strong and succinct contrast information between two sequential datasets and can be useful in applications such as protein comparison, document comparison and building sequential classification models. Mining MDS patterns is a challenging task and is significantly different from mining contrasts between relational/ transactional data...
View full abstract