Roberts, A and Atwell, E (2002) Unsupervised grammar inference systems for natural language. University of Leeds, School of Computing research report 2002.20
Abstract
In recent years there have been significant advances in the field of Unsupervised Grammar Inference (UGI) for Natural Languages such as English or Dutch. This paper presents a broad range of UGI implementations, where we can begin to see how the theory has been put to practise. Several mature systems are emerging, built using complex models and capable of deriving natural language grammatical phenomena. The range of systems is classified into: models based on Categorical Grammar (GraSp, CLL, EMILE); Memory Based Learning Models (FAMBL, RISE); Evolutionary computing models (ILM, LAgts); and string-pattern searches (ABL, GB). An objectively measurable statistical comparison of performances of the systems reviewed is not yet feasible. However, their merits and shortfalls are discussed, as well as a look at what the future has in store for UGI.
Metadata
Item Type: | Book |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | Roberts, A and Atwell, E (c) 2002, University of Leeds. Reproduced with permission from the copyright holders. |
Keywords: | Grammar inference; grammar induction; categorial grammar; language learning; unsupervised learning; corpus; natural language |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 17 Dec 2014 12:14 |
Last Modified: | 16 Jan 2018 06:54 |
Published Version: | http://www.comp.leeds.ac.uk/research/pubs/reports/... |
Status: | Published |
Publisher: | University of Leeds, School of Computing research report 2002.20 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:81926 |