Conference Proceedings

Statistical inference of protein "LEGO bricks"

AS Konagurthu, L Allison, D Abramson, PJ Stuckey, AM Lesk

Proceedings IEEE International Conference on Data Mining Icdm | IEEE | Published : 2013

Abstract

Proteins are biomolecules of life. They fold into a great variety of three-dimensional (3D) shapes. Underlying these folding patterns are many recurrent structural fragments or building blocks (analogous to 'LEGO® bricks'). This paper reports an innovative statistical inference approach to discover a comprehensive dictionary of protein structural building blocks from a large corpus of experimentally determined protein structures. Our approach is built on the Bayesian and information-theoretic criterion of minimum message length. To the best of our knowledge, this work is the first systematic and rigorous treatment of a very important data mining problem that arises in the cross-disciplinary ..

View full abstract

University of Melbourne Researchers