This is the latest version of this eprint.
Aletras, N., Baldwin, T., Lau, J. et al. (1 more author) (2015) Evaluating Topic Representations for Exploring Document Collections. Journal of the Association for Information Science and Technology. ISSN 2330-1635
Abstract
Topic models have been shown to be a useful way of representing the content of large document collections, for example, via visualization interfaces (topic browsers). These systems enable users to explore collections by way of latent topics. A standard way to represent a topic is using a term list; that is the top-n words with highest conditional probability within the topic. Other topic representations such as textual and image labels also have been proposed. However, there has been no comparison of these alternative representations. In this article, we compare 3 different topic representations in a document retrieval task. Participants were asked to retrieve relevant documents based on predefined queries within a fixed time limit, presenting topics in one of the following modalities: (a) lists of terms, (b) textual phrase labels, and (c) image labels. Results show that textual labels are easier for users to interpret than are term lists and image labels. Moreover, the precision of retrieved documents for textual and image labels is comparable to the precision achieved by representing topics using term lists, demonstrating that labeling methods are an effective alternative topic representation.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2015 The Authors. Journal of the Association for Information Science and Technology published by Wiley Periodicals, Inc. on behalf of ASIS&T. This is an open access article under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 08 Oct 2015 08:59 |
Last Modified: | 08 Oct 2015 08:59 |
Published Version: | http://dx.doi.org/10.1002/asi.23574 |
Status: | Published |
Publisher: | Wiley |
Refereed: | Yes |
Identification Number: | 10.1002/asi.23574 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:90654 |
Available Versions of this Item
-
Evaluating Topic Representations for Exploring Document Collections. (deposited 07 Oct 2015 13:36)
- Evaluating Topic Representations for Exploring Document Collections. (deposited 08 Oct 2015 08:59) [Currently Displayed]