Improving tokenisation by alternative treatment of spaces

Gow-Smith, E., Madabushi, H.T., Scarton, C. orcid.org/0000-0002-0103-4072 et al. (1 more author) (Submitted: 2022) Improving tokenisation by alternative treatment of spaces. arXiv. (Submitted)

Abstract

Metadata

Authors/Creators:
Copyright, Publisher and Additional Information: © 2022 The Authors. Preprint available under a CC BY license (http://creativecommons.org/licenses/by/4.0).
Dates:
  • Submitted: 8 April 2022
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User: Symplectic Sheffield
Date Deposited: 25 Apr 2022 13:15
Last Modified: 25 Apr 2022 13:15
Status: Submitted
Identification Number: https://doi.org/10.48550/arXiv.2204.04058
Related URLs:

Download

Export

Statistics