White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

On the significance of topological-indices based non-binary molecular similarity measures

Salim, N., Holliday, J.D. and Willett, P. (2004) On the significance of topological-indices based non-binary molecular similarity measures. Sains Malaysiana, 33 (2). pp. 157-172. ISSN 0126-6039

Full text not available from this repository.

Abstract

This paper describes experiments to study on how well the whole range of topological indices-based non-binary similarity values represents the physicochemical similarities between compounds. Measured log P values have been compared with the log P values predicted from compounds at different range of similarities calculated based on various topological indices of the compounds. Analysis shows that the non-binary Cosine, Simpson and Pearson coefficients might give misleading results when certain compounds are compared. Similarity values involving 1% most similar compounds based on the non-binary Tanimoto or Euclidean coefficients has been found to be able to represent physicochemical similarities between the molecules compared. Therefore, for searches requiring around 1% most similar compounds, rational selection methods based on the non-binary Tanimoto or Euclidean coefficients are likely to produce better results than random selection. Similarity values involving 5% most dissimilar compounds based on the non-binary Tanimoto coefficients has also been found to be able to represent physicochemical dissimilarities between the molecules compared. Therefore, for diverse selection requiring less than 5% most dissimilar compounds, rational selection methods based on the non-binary Tanimoto coefficient is likely to produce better results than random selection. However, in both focused and diverse selection using the coefficients mentioned, as more and more compounds are selected, the selection becomes more and more like random selection in terms of physicochemical properties similarity and dissimilarity.

Item Type: Article
Academic Units: The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)
Depositing User: Information Studies
Date Deposited: 26 Aug 2009 09:42
Last Modified: 26 Aug 2009 09:43
Published Version: http://pkukmweb.ukm.my/~jsm/index.html
Status: Published
URI: http://eprints.whiterose.ac.uk/id/eprint/9228

Actions (login required)

View Item View Item