Helmy, M., Basaldella, M., Maddalena, E. et al. (2 more authors) (2017) Towards building a standard dataset for Arabic keyphrase extraction evaluation. In: 2016 International Conference on Asian Language Processing (IALP). 2016 International Conference on Asian Language Processing (IALP), 21-23 Nov 2016, Tainan ,Taiwan. IEEE ISBN 978-1-5090-0922-0
Abstract
Keyphrases are short phrases that best represent a document content. They can be useful in a variety of applications, including document summarization and retrieval models. In this paper, we introduce the first dataset of keyphrases for an Arabic document collection, obtained by means of crowdsourcing. We experimentally evaluate different crowdsourced answer aggregation strategies and validate their performances against expert annotations to evaluate the quality of our dataset. We report about our experimental results, the dataset features,
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2016 IEEE. This is an author produced version of a paper subsequently published in Asian Language Processing (IALP), 2016 International Conference on. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | Arabic Language Resources; Dataset; Keyphrase Extraction; Crowdsourcing |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 21 Nov 2016 16:38 |
Last Modified: | 19 Dec 2022 13:34 |
Published Version: | https://doi.org/10.1109/IALP.2016.7875927 |
Status: | Published |
Publisher: | IEEE |
Refereed: | Yes |
Identification Number: | 10.1109/IALP.2016.7875927 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:107611 |