White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Turbo similarity searching: effect of fingerprint and dataset on virtual-screening performance

Gardiner, E.J., Gillet, V.J., Haranczyk, M., Hert, J., Holliday, J.D., Malim, N., Patel, Y. and Willett, P. (2009) Turbo similarity searching: effect of fingerprint and dataset on virtual-screening performance. Statistical Analysis and Data Mining, 2 (2). pp. 103-114. ISSN 1932-1864

Warning

There is a more recent version of this eprint available. Click here to view it.

Full text not available from this repository.

Abstract

Turbo similarity searching uses information about the nearest neighbors in a conventional chemical similarity search to increase the effectiveness of virtual screening with a data fusion approach being used to combine the nearest-neighbor information. A previous paper suggested that the approach was highly effective in operation; this paper further tests the approach using a range of different databases and of structural representations. Searches were carried out on three different databases of chemical structures, using seven different types of fingerprints, as well as molecular holograms, physicochemical properties, topological indices and reduced graphs. The results show that turbo similarity searching can indeed enhance retrieval but that this is normally achieved only if the similarity search that acts as its starting point has already achieved at least some reasonable level of search effectiveness. In other cases, a modified version of TSS that uses the nearest-neighbor information for approximate machine learning can be used effectively. Though useful for qualitative (active/inactive) predictions of biological activity, turbo similarity searching does not appear to exhibit any predictive power when quantitative property data is available. Copyright © 2009 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 2: 103-114, 2009

Item Type: Article
Keywords: chemical database; chemoinformatics; similar property principle; similarity searching; turbo similarity searching
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)
Depositing User: Information Studies
Date Deposited: 25 Aug 2009 14:45
Last Modified: 25 Aug 2009 14:45
Published Version: http://dx.doi.org/10.1002/sam.10037
Status: Published
Publisher: John Wiley & Sons
Identification Number: 10.1002/sam.10037
URI: http://eprints.whiterose.ac.uk/id/eprint/9223

Available Versions of this Item

Actions (login required)

View Item View Item