Coupling Dynamics and Evolutionary Information with Structure to Identify Protein Regulatory and Functional Binding Sites

Thumbnail Image
Date
2019-05-29
Authors
Mishra, Sambit Kumar
Kandoi, Gaurav
Jernigan, Robert
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Authors
Person
Jernigan, Robert
Distinguished Professor
Research Projects
Organizational Units
Organizational Unit
Organizational Unit
Organizational Unit
Bioinformatics and Computational Biology
The Bioinformatics and Computational Biology (BCB) Program at Iowa State University is an interdepartmental graduate major offering outstanding opportunities for graduate study toward the Ph.D. degree in Bioinformatics and Computational Biology. The BCB program involves more than 80 nationally and internationally known faculty—biologists, computer scientists, mathematicians, statisticians, and physicists—who participate in a wide range of collaborative projects.
Journal Issue
Is Version Of
Versions
Series
Department
Biochemistry, Biophysics and Molecular BiologyElectrical and Computer EngineeringBioinformatics and Computational Biology
Abstract

Binding sites in proteins can be either specifically functional binding sites (active sites) that bind specific substrates with high affinity or regulatory binding sites (allosteric sites), that modulate the activity of functional binding sites through effector molecules. Owing to their significance in determining protein function, the identification of protein functional and regulatory binding sites is widely acknowledged as an important biological problem. In this work, we present a novel binding site prediction method, AR-Pred (Active and Regulatory site Prediction), which supplements protein geometry, evolutionary and physicochemical features with information about protein dynamics to predict putative active and allosteric site residues. Since the intrinsic dynamics of globular proteins plays an essential role in controlling binding events, we find it to be an important feature for the identification of protein binding sites. We train and validate our predictive models on multiple balanced training and validation sets with random forest machine learning and obtain an ensemble of discrete models for each prediction type. Our models for active site prediction yield a median AUC of 91% and MCC of 0.68, whereas the less welldefined allosteric sites are predicted at a lower level with a median AUC of 80% and MCC of 0.48. When tested on an independent set of proteins, our models for active site prediction show comparable performance to two existing methods and gains compared to two others, while the allosteric site models show gains when tested against three existing prediction methods. AR-Pred is available as a free downloadable package at https://github.com/sambitmishra0628/ARPRED_ source.

Comments

This is the peer reviewed version of the following article: Mishra, Sambit Kumar, Gaurav Kandoi, and Robert L. Jernigan. "Coupling Dynamics and Evolutionary Information with Structure to Identify Protein Regulatory and Functional Binding Sites." Proteins: Structure, Function, and Bioinformatics (2019), which has been published in final form at doi: 10.1002/prot.25749. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versions.

Description
Keywords
Citation
DOI
Copyright
Tue Jan 01 00:00:00 UTC 2019
Collections