Elliott, JR, Atwell, ES and Whyte, WS (2000) Increasing our ignorance of language: identifying language structure in an unknown signal. In: Proceedings of CoNLL-2000: Fourth Conference on Computational Natural Language Learning and the Second Learning Language in Logic Workshop. CoNLL-2000: Fourth Conference on Computational Natural Language Learning and the Second Learning Language in Logic Workshop, 13-14 Sep 2000, Lisbon, Portugal. Association for Computational Linguistics , 25 - 30.
Abstract
This paper describes algorithms and software developed to characterise and detect generic intelligent language-like features in an input signal, using natural language learning techniques: looking for characteristic statistical "language-signatures" in test corpora. As a first step towards such species-independent language-detection, we present a suite of programs to analyse digital representations of a range of data, and use the results to extrapolate whether or not there are language-like structures which distinguish this data from other sources, such as music, images, and white noise. Outside our own immediate NLP sphere, generic communication techniques are of particular interest in the astronautical community, where two sessions are dedicated to SETI at their annual International conference with topics ranging from detecting ET technology to the ethics and logistics of message construction (Elliott and Atwell, 1999; Ollongren, 2000; Vakoch, 2000).
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | (c) 2000, Association for Computational Linguistics. Reproduced with permission from the publisher. |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 14 Jan 2015 11:35 |
Last Modified: | 19 Dec 2022 13:29 |
Published Version: | http://aclweb.org/anthology/W00-0705 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:82280 |