Encoder–decoder with pyramid region attention for pixel‐level pavement crack recognition

Yao, Hui; Liu, Yanhao; Lv, Haotian; Huyan, Ju; You, Zhanping; Hou, Yue

doi:10.1111/mice.13128

Journal article 326 views 58 downloads

Encoder–decoder with pyramid region attention for pixel‐level pavement crack recognition

Hui Yao, Yanhao Liu, Haotian Lv, Ju Huyan, Zhanping You, Yue Hou

Computer-Aided Civil and Infrastructure Engineering, Volume: 39, Issue: 10, Pages: 1490 - 1506

Swansea University Author: Yue Hou

PDF | Version of Record

© 2023 The Authors. Distributed under the terms of a Creative Commons Attribution 4.0 International License (CC BY 4.0).
Download (5.99MB)

Check full text

DOI (Published version): 10.1111/mice.13128

Abstract

Timely and accurate extraction of pavement crack information is crucial to maintain service conditions and structural safety for infrastructures and reduce further road maintenance costs. Currently, deep learning techniques for automated pavement crack detection are far superior to traditional manua...

Full description

Published in:	Computer-Aided Civil and Infrastructure Engineering
ISSN:	1093-9687 1467-8667
Published:	Wiley 2024
Online Access:	Check full text
URI:	https://cronfa.swan.ac.uk/Record/cronfa65225

Abstract:	Timely and accurate extraction of pavement crack information is crucial to maintain service conditions and structural safety for infrastructures and reduce further road maintenance costs. Currently, deep learning techniques for automated pavement crack detection are far superior to traditional manual approaches in both speed and accuracy. However, existing deep learning models may easily lose crack details when processing images containing complex background textures or other noises. Although many studies have alleviated this challenge by introducing attention mechanisms, especially the non-local (NL) block, which has the ability to efficiently capture long-range dependencies to facilitate crack pixel capture, the huge computational cost of NL makes the inference time of the model too long, which is not conducive to practical implementation. In this study, a new module, namely, the pyramid region attention module (PRAM), was developed by combining the pyramid pooling module in the pyramid scene parsing network and optimized NL, which can achieve global multi-scale context integration and long-range dependencies capture at a relatively lower computational cost. By applying PRAM to deep skip connections in the modified U-Net, an effective crack segmentation model called CrackResU-Net was developed. The test results on the existing CrackForest dataset showed that CrackResU-Net not only achieved an F1 score of 0.9580 but also took only 25.89 ms to process an image with a resolution of 480 × 320, which had advantages in accuracy and speed, compared with several other state-of-the-art crack segmentation approaches. It was fully demonstrated that this approach could realize automatic fast and high-precision recognition of pavement cracks for engineering purposes.
College:	Faculty of Science and Engineering
Funders:	The authors appreciate the financial support from Hunan Expressway Group Co. Ltd and the Hunan Department of Transportation (No. 202152) in China. The authors also appreciate the funding support from the National Natural Science Foundation of China (No. 51778038), the China National Key R&D Program (2021YFB2600600, 2021YFB2600604, 2018YFB1600202), National Natural Science Foundation of China (grant number: 52108403).
Issue:	10
Start Page:	1490
End Page:	1506

Encoder–decoder with pyramid region attention for pixel‐level pavement crack recognition

Similar Items