Learning image-based Receding Horizon Planning for manipulation in clutter

Abstract

The manipulation of an object into a desired location in a cluttered and restricted environment requires reasoning over the long-term consequences of an action while reacting locally to the multiple physics-based interactions. We present Visual Receding Horizon Planning (VisualRHP) in a framework which interleaves real-world execution with look-ahead planning to efficiently solve a short-horizon approximation to a multi-step sequential decision making problem. VisualRHP is guided by a learned heuristic that acts on an abstract colour-labelled image-based representation of the state. With this representation, the robot can generalize its behaviours to different environment setups, that is, different number and shape of objects, while also having transferable manipulation skills that can be applied to a multitude of real-world objects. We train the heuristic with imitation and reinforcement learning in discrete and continuous actions spaces. We detail the necessary changes made on existing learning algorithms to improve the stability of heuristic learning in environments with sparse rewards, and non-linear, non-continuous, dynamics. In a series of simulation and real-world experiments, we show the robot performing prehensile and non-prehensile actions in synergy to successfully manipulate a variety of real-world objects in real-time.

Metadata

Item Type:	Article
Authors/Creators:	Bejjani, W https://orcid.org/0000-0002-6129-2460 Leonetti, M https://orcid.org/0000-0002-3831-2400 Dogar, MR https://orcid.org/0000-0002-6896-5461
Copyright, Publisher and Additional Information:	© 2021 Elsevier B.V. All rights reserved. This is an author produced version of an article published in Robotics and Autonomous Systems. Uploaded in accordance with the publisher's self-archiving policy.
Keywords:	Manipulation in clutter; Physics-based manipulation; Heuristic learning; Receding Horizon Planning; Imitation and reinforcement learning; Abstract state representation
Dates:	Published: April 2021 Published (online): 23 January 2021 Accepted: 4 January 2021
Institution:	The University of Leeds
Academic Units:	The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds)
Funding Information:	Funder Grant number EPSRC (Engineering and Physical Sciences Research Council) EP/R031193/1
Depositing User:	Symplectic Publications
Date Deposited:	29 Jan 2021 15:33
Last Modified:	23 Jan 2022 01:38
Status:	Published
Publisher:	Elsevier
Identification Number:	10.1016/j.robot.2021.103730
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:170573

Download

Accepted Version

Filename: Learning Manipulation in Clutter.pdf

Licence: CC-BY-NC-ND 4.0

CLICK TO DOWNLOAD

[thumbnail of Learning Manipulation in Clutter.pdf]

CORE (COnnecting REpositories)

Learning image-based Receding Horizon Planning for manipulation in clutter

Abstract

Metadata

Download

Accepted Version

Export

Statistics