Demetriou, G., Skadina, I., Keskustalo, H., Karlgren, J., Deksne, D., Petrelli, D., Hansen, P., Gaizauskas, R. and Sanderson, M. (2004) Cross-lingual document retrieval categorisation and navigation based on distributed services. In: Proceedings of the First Baltic Conference. Human Language Technologies – the Baltic Perspective, April 21 and 22, 2004, Rīga, Latvia. .
The widespread use of the Internet across countries has increased the need for access to document collections that are often written in languages different from a user’s native language. In this paper we describe Clarity, a Cross Language Information Retrieval (CLIR) system for English, Finnish, Swedish, Latvian and Lithuanian. Clarity is a fully-fledged retrieval system that supports the user during the whole process of query formulation, text retrieval and document browsing. We address four of the major aspects of Clarity: (i) the user-driven methodology that formed the basis for the iterative design cycle and framework in the project, (ii) the system architecture that was developed to support the interaction and coordination of Clarity’s distributed services, (iii) the data resources and methods for query translation, and (iv) the support for Baltic languages. Clarity is an example of a distributed CLIR system built with minimal translation resources and, to our knowledge, the only such system that currently supports Baltic languages.
|Institution:||The University of Sheffield|
|Academic Units:||The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)
|Depositing User:||Repository Officer|
|Date Deposited:||23 Sep 2008 12:22|
|Last Modified:||08 Feb 2013 16:56|