Confidence regulation neurons in language models

Stolfo, A., Wu, B., Gurnee, W. et al. (4 more authors) (2024) Confidence regulation neurons in language models. In: Advances in Neural Information Processing Systems. NeurIPS 2024: The Thirty-Eighth Annual Conference on Neural Information Processing Systems, 10-15 Dec 2024, Vancouver, Canada. NeurIPS

Metadata

Item Type: Proceedings Paper
Authors/Creators:
Copyright, Publisher and Additional Information:

© 2024 The Author(s)

Dates:
  • Published: 12 December 2024
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Funding Information:
Funder
Grant number
INNOVATE UK
10098112 TS/Y019806/1
Engineering and Physical Sciences Research Council
2803593
Depositing User: Symplectic Sheffield
Date Deposited: 23 Jan 2025 16:50
Last Modified: 23 Jan 2025 16:50
Published Version: https://neurips.cc/virtual/2024/poster/96903
Status: Published
Publisher: NeurIPS
Refereed: Yes
Related URLs:
Open Archives Initiative ID (OAI ID):

Export

Statistics