Grech, D. and Clough, P. (2016) Investigating cluster stability when analyzing transaction logs. In: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries. Joint Conference on Digital Libraries 2016, June 19-23, 2016, Rutgers University Newark, NJ, USA. , pp. 115-118. ISBN 9781450342292
Abstract
© 2016 ACM.Data-driven approaches have become increasingly popular as a means for analyzing transaction logs from web search engines and digital libraries, for example using cluster analysis to identify common patterns of search and navigation behavior. However, steps must be taken to ensure that results are reliable and repeatable. Although clustering patterns of user interaction behavior has been previously explored, one aspect that has received less attention is cluster stability that can be used to aid cluster validation. In this paper we compute stability based on the Jaccard coefficient to investigate the cluster stability when using different subsets of transaction log data from WorldCat.org. Results provide insights into different types of search behaviors and highlight that clusters of varying degrees of stability will result from the clustering process. However, we show that additional investigation beyond the results of cluster stability is required to fully validate the resulting clusters.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2016 ACM This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in JCDL '16 Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries 2016 9781450342292 http://doi.acm.org/10.1145/2910896.2910923 |
Keywords: | Stability analysis; Clustering algorithms; Navigation, Web search; Engines, Libraries, Data mining |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 19 Jan 2017 10:44 |
Last Modified: | 14 Apr 2017 05:11 |
Published Version: | https://doi.org/10.1145/2910896.2910923 |
Status: | Published |
Refereed: | Yes |
Identification Number: | 10.1145/2910896.2910923 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:108416 |