Brierley, C and Atwell, ES (2009) Exploring imagery in literary corpora with the Natural Language ToolKit. In: Mahlberg, M, González-Díaz, V and Smith, C, (eds.) Proceedings of CL2009 International Conference on Corpus Linguistics. CL2009, 20-23 Jul 2009, University of Liverpool, UK.
Abstract
This paper presents a middle way for corpus linguists between use of “off-the-shelf” corpus analysis software and building tools from scratch, which presupposes competence in a general-purpose programming language. The Python Natural Language ToolKit (NLTK) offers a range of sophisticated natural language processing tools which we have applied to literary analysis, through case studies in Macbeth and Hamlet, with code snippets and experiments that can be replicated for research and research-led teaching with other literary texts.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 12 Dec 2014 16:45 |
Last Modified: | 19 Dec 2022 13:29 |
Published Version: | http://ucrel.lancs.ac.uk/publications/cl2009/135_F... |
Status: | Published |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:81707 |