Journal article

Over- and underrepresentation of short DNA words in herpesvirus genomes

MY Leung, GM Marsh, TP Speed

JOURNAL OF COMPUTATIONAL BIOLOGY | MARY ANN LIEBERT INC PUBL | Published : 1996

Abstract

The relative abundance and rarity of DNA words have been recognized in previous biological studies to have implications for the regulation, repair, and evolutionary mechanisms of a genome. In this paper, we review several different measures of abundance and rarity of DNA words, including z-scores, representation ratios, and cross-ratios, that have appeared in the recent literature, and examine the concordance among them using the human cytomegalovirus genome sequence. We then rank all words of length k = 2, ..., 5 of seven herpesvirus genomes according to their abundance, as measured by one of the z-scores based upon a stationary Markov model of order k-2. Using a simple metric on the ranks ..

View full abstract

University of Melbourne Researchers