Danso, S, Atwell, ES, Johnson, O et al. (11 more authors) (2013) A semantically annotated verbal autopsy corpus for automatic analysis of cause of death. ICAME Journal of the International Computer Archive of Modern and Medieval English, 37. 37 - 69.
Abstract
An annotated corpus is essential to the development and evaluation of automatic approaches in corpus linguistics research. The biomedical domain is one area that is witnessing a high growth of corpus based approaches to the development of automatic systems. This paper presents a method employed in building a semantically annotated corpus of 11,741 Verbal Autopsy documents based on verbal records of deaths of mothers, stillbirths, and infants up to 1 year of age, captured for analysis in Ghana between December 2000 and July 2010. An evaluation is carried out based on established criteria to demonstrate that the Verbal Autopsy corpus possesses the qualities of many referenced corpora. The experiences drawn from the methods employed, with alternative approaches, may lead to a more efficient and cost effective corpus development framework.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2013 De Gruyter. This is an open access article under the terms of the Creative Commons Attribution Non-Commercial No Derivatives License. http://clu.uni.no/icame/journal.html |
Keywords: | Verbal autopsy; natural language processing; medical text; machine learning; cause of death |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 16 Jun 2014 15:16 |
Last Modified: | 16 Jan 2018 06:05 |
Status: | Published |
Publisher: | De Gruyter Open |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:79252 |