Elhaik, E. orcid.org/0000-0003-4795-1084, Pellegrini, M. and Tatarinova, T. V. (2014) Gene expression and nucleotide composition are associated with genic methylation level in Oryza sativa. BMC Bioinformatics, 15. 23. ISSN 1471-2105
Abstract
Background The methylation of cytosines at CpG dinucleotides, which plays an important role in gene expression regulation, is one of the most studied epigenetic modifications. Thus far, the detection of DNA methylation has been determined mostly by experimental methods, which are not only prone to bench effects and artifacts but are also time-consuming, expensive, and cannot be easily scaled up to many samples. It is therefore useful to develop computational prediction methods for DNA methylation. Our previous studies highlighted the existence of correlations between the GC content of the third codon position (GC3), methylation, and gene expression. We thus designed a model to predict methylation in Oryza sativa based on genomic sequence features and gene expression data.
Results We first derive equations to describe the relationship between gene methylation levels, GC3, expression, length, and other gene compositional features. We next assess gene compositional features involving sixmers and their association with methylation levels and other gene level properties. By applying our sixmer-based approach on rice gene expression data we show that it can accurately predict methylation (Pearson’s correlation coefficient r = 0.79) for the majority (79%) of the genes. Matlab code with our model is included.
Conclusions Gene expression variation can be used as predictors of gene methylation levels.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © Elhaik et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
Keywords: | DNA methylation; Gene expression; GC3; Prediction; Oryza sativa |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Science (Sheffield) > School of Biosciences (Sheffield) > Department of Animal and Plant Sciences (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 07 Dec 2016 12:11 |
Last Modified: | 07 Dec 2016 12:17 |
Published Version: | http://dx.doi.org/10.1186/1471-2105-15-23 |
Status: | Published |
Publisher: | BioMed Central |
Refereed: | Yes |
Identification Number: | 10.1186/1471-2105-15-23 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:108697 |