Willett, P. (2015) Maximum Common Substructure-based Data Fusion in Similarity Searching. Journal of Chemical Information and Modeling. Published Online 20 January 2015. ISSN 1549-960X
Abstract
Data fusion has been shown to work very well when applied to fingerprint-based similarity searching, yet little is known of its application to Maximum Common Substructure (MCS)-based similarity searching. Two similarity search applications of the MCS will be focussed on here. Typically, the number of bonds in the MCS, as well as the bonds in the two molecules being compared, are used in a similarity coefficient. The power of this technique can be extended using data fusion, where the MCS similarities of a set of reference molecules against one database molecule are fused. This “group fusion” technique forms the first application of the MCS in this work. The other application is that of the chemical hyperstructure. The hyperstructure concept is an alternative form of data fusion, being a hypothetical molecule that is constructed from the overlap of a set of existing molecules. This paper compares fingerprint group fusion (extended-connectivity fingerprints), MCS similarity group fusion, and hyperstructure similarity searching, and describes their relative merits and complementarity in virtual screening. It is concluded that the hyperstructure approach as implemented here is less generally effective than conventional fingerprint approaches.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2015 ACS. This is an author produced version of a paper subsequently published in Journal of Chemical Information and Modelling. Uploaded in accordance with the publisher's self-archiving policy. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 28 Jan 2015 10:08 |
Last Modified: | 09 Mar 2016 01:43 |
Published Version: | http://dx.doi.org/10.1021/ci5005702 |
Status: | Published |
Publisher: | American Chemical Society |
Refereed: | Yes |
Identification Number: | 10.1021/ci5005702 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:83029 |