Journal article

Large-scale protein-protein post-translational modification extraction with distant supervision and confidence calibrated BioBERT

A Elangovan, Y Li, DEV Pires, MJ Davis, K Verspoor

BMC Bioinformatics | Published : 2022

Abstract

Motivation: Protein-protein interactions (PPIs) are critical to normal cellular function and are related to many disease pathways. A range of protein functions are mediated and regulated by protein interactions through post-translational modifications (PTM). However, only 4% of PPIs are annotated with PTMs in biological knowledge databases such as IntAct, mainly performed through manual curation, which is neither time- nor cost-effective. Here we aim to facilitate annotation by extracting PPIs along with their pairwise PTM from the literature by using distantly supervised training data using deep learning to aid human curation. Method: We use the IntAct PPI database to create a distant super..

View full abstract