Demetriou, G, Atwell, E and Souter, C (2000) Using lexical semantic knowledge from machine readable dictionaries for domain independent language modelling. In: Gavrilidou, M, (ed.) Proceedings of LREC2000: Language Resources and Evaluation Conference, vol. 2. Second International Conference on Language Resources and Evaluation LREC-2000, 31 May - 02 Jun 2000, Athens, Greece. European Language Resources Association , 777 - 782. ISBN 2-9517408-6-7
Abstract
Machine Readable Dictionaries (MRDs) have been used in a variety of language processing tasks including word sense disambiguation, text segmentation, information retrieval and information extraction. In this paper we describe the utilization of semantic knowledge acquired from an MRD for language modelling tasks in relation to speech recognition applications. A semantic model of language has been derived using the dictionary definitions in order to compute the semantic association between the words. The model is capable of capturing phenomena of latent semantic dependencies between the words in texts and reducing the language ambiguity by a considerable factor. The results of experiments suggest that the semantic model can improve the word recognition rates in “noisy-channel” applications. This research provides evidence that limited or incomplete knowledge from lexical resources such as MRDs can be useful for domain independent language modelling.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | (c) 2000, European Language Resources Association. Reproduced with permission from the publisher. |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 08 Jan 2015 10:10 |
Last Modified: | 19 Dec 2022 13:29 |
Published Version: | http://www.lrec-conf.org/proceedings/lrec2000/ |
Status: | Published |
Publisher: | European Language Resources Association |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:82248 |