Khatibi, S, Teimouri, M and Rezaei, M orcid.org/0000-0003-3892-421X (2021) Real-time Active Vision for a Humanoid Soccer Robot Using Deep Reinforcement Learning. In: Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART. 13th International Conference on Agents and Artificial Intelligence (ICAART 2021), 04-06 Feb 2021, Online. SciTePress, pp. 742-751. ISBN: 978-989-758-484-8 ISSN: 2184-433X EISSN: UNSPECIFIED
Abstract
In this paper, we present an active vision method using a deep reinforcement learning approach for a humanoid soccer-playing robot. The proposed method adaptively optimises the viewpoint of the robot to acquire the most useful landmarks for self-localisation while keeping the ball into its viewpoint. Active vision is critical for humanoid decision-maker robots with a limited field of view. To deal with active vision problem, several probabilistic entropy-based approaches have previously been proposed which are highly dependent on the accuracy of the self-localisation model. However, in this research, we formulate the problem as an episodic reinforcement learning problem and employ a Deep Q-learning method to solve it. The proposed network only requires the raw images of the camera to move the robot’s head toward the best viewpoint. The model shows a very competitive rate of 80% success rate in achieving the best viewpoint. We implemented the proposed method on a humanoid robot simula ted in Webots simulator. Our evaluations and experimental show that the proposed method outperforms the entropy-based methods in the RoboCup context, in cases with high self-localisation errors.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | Ⓒ 2021 by SCITEPRESS – Science and Technology Publications, Lda. This is an open-access article distributed under the terms of the CC-BY-NC-ND 4.0 license (http://creativecommons.org/licenses/by-nc-nd/4.0). |
| Keywords: | Deep Reinforcement Learning; Active Vision; Deep Q-Network; Humanoid Robot; RoboCup |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Environment (Leeds) > Institute for Transport Studies (Leeds) > ITS: Safety and Technology (Leeds) |
| Depositing User: | Symplectic Publications |
| Date Deposited: | 14 Jan 2021 14:00 |
| Last Modified: | 19 Oct 2023 13:59 |
| Status: | Published |
| Publisher: | SciTePress |
| Identification Number: | 10.5220/0010237307420751 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:169894 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)