TY - JOUR
T1 - Multi-scale attention-enhanced deep learning approach for detecting seven trunk pests and diseases in Shanghai’s urban plane trees
AU - Song, Tianyang
AU - Hu, Guohua
AU - Yu, Tianci
AU - Meng, Xing
AU - Zhang, Yanting
AU - Yang, Ruiqing
AU - Wang, Benyao
AU - Li, Xia
N1 - Publisher Copyright:
© 2025 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2025
Y1 - 2025
N2 - Urban street trees, particularly plane trees in Shanghai, are susceptible to pests and diseases like holes, decay, termite, and longicorn. Traditional manual inspections are labor-intensive and error-prone. This study introduces an enhanced YOLOv8-based detection framework to address multi-scale variability in pest and disease datasets. Key improvements include integrating the Selective Kernel Attention (SKA) mechanism for multi-scale feature enhancement, augmenting the Spatial Pyramid Pooling Fast (SPPF) module with average pooling to preserve subtle details, and employing the WIoUv3 loss function to improve localization accuracy. Trained on 3,983 annotated samples from Shanghai, the model achieved a 3.8% increase in mean Average Precision at a 50% Intersection over Union threshold (mAP50) and a significant reduction in missed detections compared to the baseline YOLOv8. Robustness was validated through experiments with varied parameters, repeated training sessions, and stratified sampling addressing class imbalance. This research demonstrates the model's potential for intelligent, automated monitoring of urban tree health.
AB - Urban street trees, particularly plane trees in Shanghai, are susceptible to pests and diseases like holes, decay, termite, and longicorn. Traditional manual inspections are labor-intensive and error-prone. This study introduces an enhanced YOLOv8-based detection framework to address multi-scale variability in pest and disease datasets. Key improvements include integrating the Selective Kernel Attention (SKA) mechanism for multi-scale feature enhancement, augmenting the Spatial Pyramid Pooling Fast (SPPF) module with average pooling to preserve subtle details, and employing the WIoUv3 loss function to improve localization accuracy. Trained on 3,983 annotated samples from Shanghai, the model achieved a 3.8% increase in mean Average Precision at a 50% Intersection over Union threshold (mAP50) and a significant reduction in missed detections compared to the baseline YOLOv8. Robustness was validated through experiments with varied parameters, repeated training sessions, and stratified sampling addressing class imbalance. This research demonstrates the model's potential for intelligent, automated monitoring of urban tree health.
KW - Urban green space
KW - deep learning
KW - street tree pests
KW - target detection
UR - https://www.scopus.com/pages/publications/105011849526
U2 - 10.1080/17538947.2025.2537321
DO - 10.1080/17538947.2025.2537321
M3 - 文章
AN - SCOPUS:105011849526
SN - 1753-8947
VL - 18
JO - International Journal of Digital Earth
JF - International Journal of Digital Earth
IS - 1
M1 - 2537321
ER -