Finding short DNA motifs using permuted Markov models
X Zhao, H Huang, TP Speed
Proceedings of the Annual International Conference on Computational Molecular Biology, RECOMB | Published : 2004
Many short DNA motifs such as transcription factor binding sites (TFBS) and splice sites exhibit strong local as well as non-local dependence. We introduce permuted variable length Markov models (PVLMM) which could capture the potentially important dependencies among positions, and apply them to the problem of detecting splice and TFB sites. They have been satisfactory from the viewpoint of prediction performance, and also give ready biological interpretations of the sequence dependence observed. The issue of model selection is also studied.