van Zaanen, M, Roberts, A and Atwell, ES (2004) A multilingual parallel parsed corpus as gold standard for grammatical inference evaluation. In: Proceedings of LREC'04 Workshop on The Amazing Utility of Parallel and Comparable Corpora. LREC 2004 Workshop on the Amazing Utility of Parallel and Comparable Corpora, 25 May 2004, Lisbon, Portugal. European Language Resources Association , 58 - 61.
Abstract
In this article we investigate how (computational) grammar inference systems are evaluated and how the evaluation procedure can be improved. First, we describe the currently used evaluation methods and look at the advantages and disadvantages of each method. The main problems of the methods are: the dependency on language experts, the influence of the annotation scheme of language data, and the language dependency of the evaluation. We then propose a new method that will allow for an evaluation independently of language and annotation scheme. This method requires (syntactically) structured corpora in multiple languages to test for language independency of the grammatical inference system and corpora structured using different annotation schemes to diminish the influence the annotation has on the evaluation.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | van Zaanen, M, Roberts, A and Atwell, ES (c) 2004, University of Leeds. Reproduced with permission from the copyright holders. |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 01 Dec 2014 12:16 |
Last Modified: | 16 Jan 2018 06:00 |
Published Version: | http://www.elra.info/ |
Status: | Published |
Publisher: | European Language Resources Association |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:81661 |