Sanchan, N., Aker, A. and Bontcheva, K. orcid.org/0000-0001-6152-9600 (Submitted: 2017) Gold Standard Online Debates Summaries and First Experiments Towards Automatic Summarization of Online Debate Data. In: 18th International Conference on Intelligent Text Processing and Computational Linguistics, 17-23 Apr 2017, Budapest, Hungary. (Submitted)
Abstract
Usage of online textual media is steadily increasing. Daily, more and more news stories, blog posts and scientific articles are added to the online volumes. These are all freely accessible and have been employed extensively in multiple research areas, e.g. automatic text summarization, information retrieval, information extraction, etc. Meanwhile, online debate forums have recently become popular, but have remained largely unexplored. For this reason, there are no sufficient resources of annotated debate data available for conducting research in this genre. In this paper, we collected and annotated debate data for an automatic summarization task. Similar to extractive gold standard summary generation our data contains sentences worthy to include into a summary. Five human annotators performed this task. Inter-annotator agreement, based on semantic similarity, is 36% for Cohen's kappa and 48% for Krippendorff's alpha. Moreover, we also implement an extractive summarization system for online debates and discuss prominent features for the task of summarizing online debate data automatically.
Metadata
Item Type: | Conference or Workshop Item |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2017 The Author(s). For reuse permissions, please contact the Author(s). |
Keywords: | Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR) |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 06 Aug 2018 10:32 |
Last Modified: | 19 Dec 2022 13:50 |
Published Version: | https://arxiv.org/abs/1708.04592 |
Status: | Submitted |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:133572 |