Self-calibration for language model quantization and pruning

This is a preprint and may not have undergone formal peer review

Williams, M., Chrysostomou, G. and Aletras, N. orcid.org/0000-0003-4285-1965 (Submitted: 2024) Self-calibration for language model quantization and pruning. [Preprint - arXiv] (Submitted)

Abstract

Metadata

Item Type: Preprint
Authors/Creators:
Copyright, Publisher and Additional Information:

© 2024 The Author(s). For reuse permissions, please contact the Author(s).

Dates:
  • Submitted: 22 October 2024
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Funding Information:
Funder
Grant number
RESPONSIBLE AI UK
EP/Y009800/1
Depositing User: Symplectic Sheffield
Date Deposited: 10 Jan 2025 11:02
Last Modified: 10 Jan 2025 11:02
Status: Submitted
Identification Number: 10.48550/arXiv.2410.17170
Open Archives Initiative ID (OAI ID):

Export

Statistics