Piotrkowicz, A orcid.org/0000-0002-7723-699X, Dimitrova, VG orcid.org/0000-0002-7001-0891 and Markert, K (2017) Automatic Extraction of News Values from Headline Text. In: Proceedings of the Student Research Workshop at the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL SRW 2017). EACL SRW 2017, 03-07 Apr 2017, Valencia, Spain. Association for Computational Linguistics , pp. 64-74. ISBN 978-1-945626-37-1
Abstract
Headlines play a crucial role in attracting audiences’ attention to online artefacts (e.g. news articles, videos, blogs). The ability to carry out an automatic, largescale analysis of headlines is critical to facilitate the selection and prioritisation of a large volume of digital content. In journalism studies news content has been extensively studied using manually annotated news values – factors used implicitly and explicitly when making decisions on the selection and prioritisation of news items. This paper presents the first attempt at a fully automatic extraction of news values from headline text. The news values extraction methods are applied on a large headlines corpus collected from The Guardian, and evaluated by comparing it with a manually annotated gold standard. A crowdsourcing survey indicates that news values affect people’s decisions to click on a headline, supporting the need for an automatic news values detection.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2017 Association for Computational Linguistics. This is an author produced version of a paper accepted for publication in Proceedings of EACL. |
Keywords: | text analytics; feature engineering; headlines; news popularity |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 23 Jan 2017 14:53 |
Last Modified: | 14 Aug 2017 06:40 |
Published Version: | https://www.aclweb.org/anthology/E/E17/#4000 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:110978 |