Luna Gutierrez, R and Leonetti, M orcid.org/0000-0002-3831-2400 (2021) Meta-Reinforcement Learning for Heuristic Planing. In: Proceedings of the International Conference on Automated Planning and Scheduling. International Conference on Automated Planning and Scheduling, 02-13 Aug 2021, held virtually from Guangzhou, China. , pp. 551-559. ISBN 978-1-57735-867-1
Abstract
Heuristic planning has a central role in classical planning applications and competitions. Thanks to this success, there has been an increasing interest in using Deep Learning to create high-quality heuristics in a supervised fashion, learning from optimal solutions of previously solved planning problems. Meta-Reinforcement learning is a fast growing research area concerned with learning, from many tasks, behaviours that can quickly generalize to new tasks from the same distribution of the training ones. We make a connection between meta-reinforcement learning and heuristic planning, showing that heuristic functions meta-learned from planning problems, in a given domain, can outperform both popular domain-independent heuristics, and heuristics learned by supervised learning. Furthermore, while most supervised learning algorithms rely on ad-hoc encodings of the state representation, our method uses as input a general PDDL 3.1 description. We evaluated our heuristic with an A* planner on six domains from the International Planning Competition and the FF Domain Collection, showing that the meta-learned heuristic leads to the expansion, on average, of fewer states than three popular heuristics used by the FastDownward planner, and a supervised-learned heuristic.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Keywords: | Learning Effective Heuristics And Other Forms Of Control Knowledge |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 21 Mar 2021 16:52 |
Last Modified: | 07 Jul 2021 09:20 |
Published Version: | https://ojs.aaai.org/index.php/ICAPS/article/view/... |
Status: | Published |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:172238 |