Know thy corpus! Robust methods for digital curation of Web corpora

Sharov, S orcid.org/0000-0002-4877-0210 (2020) Know thy corpus! Robust methods for digital curation of Web corpora. In: Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). 12th Conference on Language Resources and Evaluation (LREC 2020), 11-16 May 2020, Marseille. . ISBN 979-10-95546-34-4

Abstract

Metadata

Authors/Creators:
Copyright, Publisher and Additional Information: © The European Language Resources Association (ELRA), 2020 The LREC 2020 Proceedings are licensed under a Creative Commons Attribution Non-Commercial 4.0 International License.
Keywords: Validation of language resources, Text analytics, Language Modelling, Digital curation
Dates:
  • Accepted: 11 February 2020
  • Published: May 2020
Institution: The University of Leeds
Academic Units: The University of Leeds > Faculty of Arts, Humanities and Cultures (Leeds) > School of Languages Cultures & Societies (Leeds) > Translation Studies (Leeds)
Depositing User: Symplectic Publications
Date Deposited: 17 Apr 2020 13:10
Last Modified: 06 Feb 2021 09:02
Published Version: http://www.elra.info/en/lrec/proceedings/
Status: Published

Download

Export

Statistics