Strategically efficient exploration in competitive multi-agent reinforcement learning

Loftin, R., Saha, A., Devlin, S. et al. (1 more author) (2021) Strategically efficient exploration in competitive multi-agent reinforcement learning. In: Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence (UAI 2021). 37th Conference on Uncertainty in Artificial Intelligence, 27-30 Jul 2021, Online. Proceedings of Machine Learning Research, 161 . ML Research Press , pp. 1587-1596.

Abstract

High sample complexity remains a barrier to the application of reinforcement learning (RL), particularly in multi-agent systems. A large body of work has demonstrated that exploration mechanisms based on the principle of optimism under uncertainty can significantly improve the sample efficiency of RL in single agent tasks. This work seeks to understand the role of optimistic exploration in non-cooperative multi-agent settings. We will show that, in zero-sum games, optimistic exploration can cause the learner to waste time sampling parts of the state space that are irrelevant to strategic play, as they can only be reached through cooperation between both players. To address this issue, we introduce a formal notion of strategically efficient exploration in Markov games, and use this to develop two strategically efficient learning algorithms for finite Markov games. We demonstrate that these methods can be significantly more sample efficient than their optimistic counterparts.

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Loftin, R. Saha, A. Devlin, S. Hofmann, K.
Copyright, Publisher and Additional Information:	© 2021 The Authors.
Dates:	Published: 30 July 2021
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	16 Feb 2024 13:33
Last Modified:	16 Feb 2024 13:33
Published Version:	https://proceedings.mlr.press/v161/loftin21a.html
Status:	Published
Publisher:	ML Research Press
Series Name:	Proceedings of Machine Learning Research
Refereed:	Yes
Related URLs:	arXiv URL Software or Code Conference
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:209128

CORE (COnnecting REpositories)

Strategically efficient exploration in competitive multi-agent reinforcement learning

Abstract

Metadata

Download

External copy

Export

Statistics