White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Structural fingerprints of transcription factor binding site regions

Gardiner, E.J., Hunter, C.A. and Willett, P. (2009) Structural fingerprints of transcription factor binding site regions. Algorithms for Molecular Sciences, 2 (1). pp. 448-469. ISSN 1999-4893

[img] Text

Download (1258Kb)


Fourier transforms are a powerful tool in the prediction of DNA sequence properties, such as the presence/absence of codons. We have previously compiled a database of the structural properties of all 32,896 unique DNA octamers. In this work we apply Fourier techniques to the analysis of the structural properties of human chromosomes 21 and 22 and also to three sets of transcription factor binding sites within these chromosomes. We find that, for a given structural property, the structural property power spectra of chromosomes 21 and 22 are strikingly similar. We find common peaks in their power spectra for both Sp1 and p53 transcription factor binding sites. We use the power spectra as a structural fingerprint and perform similarity searching in order to find transcription factor binding site regions. This approach provides a new strategy for searching the genome data for information. Although it is difficult to understand the relationship between specific functional properties and the set of structural parameters in our database, our structural fingerprints nevertheless provide a useful tool for searching for function information in sequence data. The power spectrum fingerprints provide a simple, fast method for comparing a set of functional sequences, in this case transcription factor binding site regions, with the sequences of whole chromosomes. On its own, the power spectrum fingerprint does not find all transcription factor binding sites in a chromosome, but the results presented here show that in combination with other approaches, this technique will improve the chances of identifying functional sequences hidden in genomic data.

Item Type: Article
Copyright, Publisher and Additional Information: All articles published by MDPI are made available under an Open Access license worldwide immediately.
Keywords: DNA structure; sequence-dependent structure; transcription factor binding site; Fourier transform; structural fingerprint
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)
The University of Sheffield > Faculty of Science (Sheffield) > Department of Chemistry (Sheffield)
Depositing User: Miss Anthea Tucker
Date Deposited: 28 Sep 2009 15:53
Last Modified: 15 Jun 2014 19:34
Published Version: http://dx.doi.org/10.3390/a2010448
Status: Published
Refereed: Yes
Identification Number: 10.3390/a2010448
URI: http://eprints.whiterose.ac.uk/id/eprint/9789

Actions (repository staff only: login required)