Bath, P.A., Craigs, C., Maheswaran, R., Raymond, J. and Willett, P. (2002) Validation of graph-theoretical methods for pattern identification in public health datasets. Health Informatics Journal, 8 (4). pp. 167-173. ISSN 1741-2811
Pattern identification issues are commonly used in public health practice to identify disease clusters and tendencies towards clustering. The basic building blocks or units for such patterns may be individuals or geographical units, but the key factor is the association between units in terms of time, space or other complex links. A range of methods has been developed for cluster detection but these methods are not designed to handle complex pattern searching. This paper describes early work in developing a novel method of tackling this problem, using graph theoretical techniques developed for computational chemistry. A modified version of the maximum common subgraph isomorphism method was used to search and retrieve enumeration districts (EDs) using 27 user-defined patterns from a set of 106 EDs. The results were then checked manually to ensure that all the appropriate and no additional patterns and EDs were retrieved. The program successfully retrieved all the relevant patterns and EDs and did not retrieve any patterns not specified by the query patterns. This study demonstrates the applicability of using graph theory for identifying and retrieving patterns in public health datasets.
|Institution:||The University of Sheffield|
|Academic Units:||The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)|
|Depositing User:||Information Studies|
|Date Deposited:||23 Mar 2009 14:57|
|Last Modified:||25 Mar 2009 14:40|