Needham, C.J., Bradford, J.R. orcid.org/0000-0002-7771-914X, Bulpitt, A.J. et al. (2 more authors) (2006) Predicting the effect of missense mutations on protein function: analysis with Bayesian networks. BMC Bioinformatics, 7. 405. ISSN 1471-2105
Abstract
Background: A number of methods that use both protein structural and evolutionary information are available to predict the functional consequences of missense mutations. However, many of these methods break down if either one of the two types of data are missing. Furthermore, there is a lack of rigorous assessment of how important the different factors are to prediction.
Results: Here we use Bayesian networks to predict whether or not a missense mutation will affect the function of the protein. Bayesian networks provide a concise representation for inferring models from data, and are known to generalise well to new data. More importantly, they can handle the noisy, incomplete and uncertain nature of biological data. Our Bayesian network achieved comparable performance with previous machine learning methods. The predictive performance of learned model structures was no better than a naïve Bayes classifier. However, analysis of the posterior distribution of model structures allows biologically meaningful interpretation of relationships between the input variables.
Conclusion: The ability of the Bayesian network to make predictions when only structural or evolutionary data was observed allowed us to conclude that structural information is a significantly better predictor of the functional consequences of a missense mutation than evolutionary information, for the dataset used. Analysis of the posterior distribution of model structures revealed that the top three strongest connections with the class node all involved structural nodes. With this in mind, we derived a simplified Bayesian network that used just these three structural descriptors, with comparable performance to that of an all node network.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © Needham et al; licensee BioMed Central Ltd. 2006 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Sheffield Teaching Hospitals |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 28 Nov 2016 11:55 |
Last Modified: | 28 Nov 2016 12:02 |
Published Version: | http://dx.doi.org/10.1186/1471-2105-7-405 |
Status: | Published |
Publisher: | BioMed Central |
Refereed: | Yes |
Identification Number: | 10.1186/1471-2105-7-405 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:107769 |