Mind the trade-off: debiasing NLU models without degrading the in-distribution performance

Utama, P.A., Moosavi, N.S. orcid.org/0000-0002-8332-307X and Gurevych, I. (2020) Mind the trade-off: debiasing NLU models without degrading the in-distribution performance. In: Jurafsky, D., Chai, J., Schluter, N. and Tetreault, J., (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020). 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), 05-10 Jul 2020, Online. Association for Computational Linguistics , pp. 8717-8729. ISBN 9781952148255

Abstract

Models for natural language understanding (NLU) tasks often rely on the idiosyncratic biases of the dataset, which make them brittle against test cases outside the training distribution. Recently, several proposed debiasing methods are shown to be very effective in improving out-of-distribution performance. However, their improvements come at the expense of performance drop when models are evaluated on the in-distribution data, which contain examples with higher diversity. This seemingly inevitable trade-off may not tell us much about the changes in the reasoning and understanding capabilities of the resulting models on broader types of examples beyond the small subset represented in the out-of-distribution data. In this paper, we address this trade-off by introducing a novel debiasing method, called confidence regularization, which discourage models from exploiting biases while enabling them to receive enough incentive to learn from all the training examples. We evaluate our method on three NLU tasks and show that, in contrast to its predecessors, it improves the performance on out-of-distribution datasets (e.g., 7pp gain on HANS dataset) while maintaining the original in-distribution accuracy.

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Utama, P.A. Moosavi, N.S. https://orcid.org/0000-0002-8332-307X Gurevych, I.
Editors:	Jurafsky, D. Chai, J. Schluter, N. Tetreault, J.
Copyright, Publisher and Additional Information:	© 2020 Association for Computational Linguistics. Available under a Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).
Dates:	Published (online): July 2020 Published: July 2020
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	07 Sep 2022 13:50
Last Modified:	07 Sep 2022 14:17
Status:	Published
Publisher:	Association for Computational Linguistics
Refereed:	Yes
Identification Number:	10.18653/v1/2020.acl-main.770
Related URLs:	Conference
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:190601

Download

Published Version

Filename: 2020.acl-main.770.pdf

Licence: CC-BY 4.0

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Mind the trade-off: debiasing NLU models without degrading the in-distribution performance

Abstract

Metadata

Download

Published Version

Export

Statistics