Atwell, ES (1986) How to detect grammatical errors in a text without parsing it. In: Computer Studies Research Report 216. The University of Leeds
Abstract
The Constituent Likelihood Automatic Word-tagging System (CLAWS) was originally designed for the low-level grammatical analysis of the million-word LOB Corpus of English text samples. CLAWS does not attempt a full parse, but uses a firat-order Markov model of language to assign word-class labels to words. CLAWS can be modified to detect grammatical errors, essentially by flagging unlikely word-class transitions in the input text. This may seem to be an intuitively implausible and theoretically inadequate model of natural language syntax, but nevertheless it can successfully pinpoint most grammatical errors in a text. Several modifications to CLAWS have been explored. The resulting system cannot detect all errors in typed documents; but then neither do far more complex systems, which attempt a full parse, requiting much greater computation.
Metadata
Item Type: | Book Section |
---|---|
Authors/Creators: |
|
Keywords: | Computer software |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 06 Jan 2015 12:21 |
Last Modified: | 28 Jul 2015 18:33 |
Published Version: | http://dl.acm.org/citation.cfm?id=976865 |
Status: | Published |
Publisher: | The University of Leeds |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:81938 |