How to detect grammatical errors in a text without parsing it

Abstract

The Constituent Likelihood Automatic Word-tagging System (CLAWS) was originally designed for the low-level grammatical analysis of the million-word LOB Corpus of English text samples. CLAWS does not attempt a full parse, but uses a firat-order Markov model of language to assign word-class labels to words. CLAWS can be modified to detect grammatical errors, essentially by flagging unlikely word-class transitions in the input text. This may seem to be an intuitively implausible and theoretically inadequate model of natural language syntax, but nevertheless it can successfully pinpoint most grammatical errors in a text. Several modifications to CLAWS have been explored. The resulting system cannot detect all errors in typed documents; but then neither do far more complex systems, which attempt a full parse, requiting much greater computation.

Metadata

Item Type:	Book Section
Authors/Creators:	Atwell, ES
Keywords:	Computer software
Dates:	Published: 1986
Institution:	The University of Leeds
Academic Units:	The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds)
Depositing User:	Symplectic Publications
Date Deposited:	06 Jan 2015 12:21
Last Modified:	28 Jul 2015 18:33
Published Version:	http://dl.acm.org/citation.cfm?id=976865
Status:	Published
Publisher:	The University of Leeds
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:81938

CORE (COnnecting REpositories)

How to detect grammatical errors in a text without parsing it

Abstract

Metadata

Download not available

Export

Statistics