Journal article

Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling

Anil Raj, Sidney H Wang, Heejung Shim, Arbel Harpak, Yang I Li, Brett Engelmann, Matthew Stephens, Yoav Gilad, Jonathan K Ptitchard



Accurate annotation of protein coding regions is essential for understanding how genetic information is translated into function. We describe riboHMM, a new method that uses ribosome footprint data to accurately infer translated sequences. Applying riboHMM to human lymphoblastoid cell lines, we identified 7273 novel coding sequences, including 2442 translated upstream open reading frames. We observed an enrichment of footprints at inferred initiation sites after drug-induced arrest of translation initiation, validating many of the novel coding sequences. The novel proteins exhibit significant selective constraint in the inferred reading frames, suggesting that many are functional. Moreover, ..

View full abstract

University of Melbourne Researchers


Awarded by National Institutes of Health

Funding Acknowledgements

National Institutes of Health HG007036 Jonathan K PritchardNational Institutes of Health MH084703 Yoav Gilad Jonathan K PritchardNational Institutes of Health HG02585 Matthew StephensHoward Hughes Medical Institute Jonathan K PritchardThe funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication