GLoU-MiT: Lightweight Global-Local Mamba-Guided U-mix transformer for UAV-based pavement crack segmentation

Abstract

The utility of Unmanned Aerial Vehicles (UAVs) for routine pavement distresses inspection has been increasingly recognized due to their efficiency, flexibility, safety, and low-cost automation. However, UAV-acquired high-altitude images present unique challenges for deep learning-based semantic segmentation models, such as minute crack details, blurred boundaries, and high levels of environmental noise. We propose GLoU-MiT, a lightweight segmentation model designed to address the difficulties of UAV-based pavement crack segmentation. Our model integrates a U-shaped Mix Transformer architecture for efficient hierarchical feature extraction, a Global-Local Mamba-Guided Skip Connection for improved feature alignment and computational efficiency, and a Boundary / Semantic Deep Supervision Refinement Module to enhance segmentation precision in complex scenarios. Extensive experiments on UAV-Crack500, CrackSC and Crack500 datasets demonstrate that GLoU-MiT effectively improves segmentation accuracy, particularly in low-contrast and complex background environments, making it a robust solution for UAV-based pavement crack inspection tasks. Furthermore, inference speed and energy consumption evaluations conducted on the Jetson Orin Nano (8 GB) show that our model achieves an excellent balance between accuracy, energy efficiency, and speed. The code will be released at: https://github.com/SHAN-JH/GLoU-MiT

Metadata

Item Type:	Article
Authors/Creators:	Shan, J. Huang, Y. https://orcid.org/0000-0002-1220-6896 Jiang, W. Yuan, D. Guo, F.
Copyright, Publisher and Additional Information:	This is an author produced version of an article published in Advanced Engineering Informatics, made available under the terms of the Creative Commons Attribution License (CC-BY), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited.
Keywords:	Pavement crack, Vision mamba, Vison transformer, Semantic segmentation, Skip connection
Dates:	Accepted: 18 April 2025 Published (online): 27 April 2025 Published: May 2025
Institution:	The University of Leeds
Academic Units:	The University of Leeds > Faculty of Environment (Leeds) > Institute for Transport Studies (Leeds)
Depositing User:	Symplectic Publications
Date Deposited:	16 Jun 2025 11:00
Last Modified:	16 Jun 2025 11:00
Status:	Published
Publisher:	Elsevier
Identification Number:	10.1016/j.aei.2025.103384
Related URLs:	Author
Sustainable Development Goals:
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:227785

CORE (COnnecting REpositories)

GLoU-MiT: Lightweight Global-Local Mamba-Guided U-mix transformer for UAV-based pavement crack segmentation

Abstract

Metadata

Download

Accepted Version

Export

Statistics