Mahdi, A.R. orcid.org/0000-0003-2152-8501, Rezaei, M. orcid.org/0000-0003-3892-421X and Merat, N. (Cover date: January/December 2026) PGR‐Net: A Pedestrian Gesture Recognition Model for Effective AV‐Human Interactions in Autonomous Vehicles. IET Intelligent Transport Systems, 20 (1). e70197. ISSN: 1751-956X
Abstract
Autonomous vehicles (AVs) and advanced driver assistance systems (ADAS) continue to advance, yet effective social coordination with human road users (HRUs) remains a key challenge. This study introduces the PGR‐Net model, a spatiotemporal deep learning (DL) approach for pedestrian gesture recognition (PGR) to bridge the gap in AV‐pedestrian communication. We created the PGR‐Net v1.0 dataset by remapping Jester gesture labels to AV‐relevant classes: Stop, Go, and Greeting/Thanking. Furthermore, a No Gesture class is defined via a sequential hand‐presence rule. The PGR‐Net fuses an R(2+1)D, a three‐dimensional convolutional neural network (3D‐CNN) architecture, and a spatiotemporal stream with hand‐pose landmarks, followed by recurrent neural network (RNN) encoders and self‐attention layers to emphasise gesture‐relevant frames. On the PGR‐Net v1.0 dataset, the PGR‐Netv2 achieves 88.29% accuracy and an absolute 12.56% improvement from the baseline R(2+1)D model. Qualitative tests on single images beyond the dataset indicate sensible generalisation and highlight the importance of short spatiotemporal context for PGR. These results suggest that hand‐augmented spatiotemporal modelling is a viable path toward a robust and AV‐relevant PGR for various traffic scenarios. We discuss current limitations due to the limited availability of PGR‐specific datasets and outline directions for broader in‐the‐wild data and context‐aware modelling to improve applicability.
Metadata
| Item Type: | Article |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2026 The Author(s). IET Intelligent Transport Systems published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
| Keywords: | computer vision; deep learning; human-AV interaction; pedestrian-AV interactions; pedestrian gesture recognition; pedestrian safety |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Environment (Leeds) > Institute for Transport Studies (Leeds) |
| Date Deposited: | 26 Mar 2026 12:17 |
| Last Modified: | 26 Mar 2026 12:17 |
| Status: | Published |
| Publisher: | Institution of Engineering and Technology (IET) |
| Identification Number: | 10.1049/itr2.70197 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:239277 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)