TY - JOUR
T1 - Micro-YOLO+
T2 - Searching Optimal Methods for Compressing Object Detection Model Based on Speed, Size, Cost, and Accuracy
AU - Hu, Lining
AU - Zhang, Yuhang
AU - Zhao, Yang
AU - Wu, Tong
AU - Li, Yongfu
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
PY - 2022/9
Y1 - 2022/9
N2 - Convolutional neural networks play a great role in solving the problem of object detection. However, conventional object detection models, such as YOLO and SSD, are usually too large to be deployed on embedded devices due to their restricted resources and low power requirements. In this paper, several efficient methods are explored to balance model size, network accuracy, and inference speed. We explore elective lightweight convolutional layers to supplant the convolutional layers (Conv) in the YOLOv3-tiny network, including the depth-wise separable convolution (DSConv), the mobile inverted bottleneck convolution with squeeze and excitation block (MBConv) and the ghost module (GConv). Moreover, we explore the optimal hyper-parameters of the network and use the improved NMS algorithm, Cluster-NMS. Moreover, a new object detection model, Micro-YOLO+, which achieves a signification reduction in the number of parameters and computation cost while maintaining the performance is proposed. Our Micro-YOLO+ network reduces the number of parameters by 3.18× and multiply-accumulate operation (MAC) by 2.44× while increases the mAP evaluated on the COCO2014 dataset by 1.6%, compared to the original YOLOv3-tiny network.
AB - Convolutional neural networks play a great role in solving the problem of object detection. However, conventional object detection models, such as YOLO and SSD, are usually too large to be deployed on embedded devices due to their restricted resources and low power requirements. In this paper, several efficient methods are explored to balance model size, network accuracy, and inference speed. We explore elective lightweight convolutional layers to supplant the convolutional layers (Conv) in the YOLOv3-tiny network, including the depth-wise separable convolution (DSConv), the mobile inverted bottleneck convolution with squeeze and excitation block (MBConv) and the ghost module (GConv). Moreover, we explore the optimal hyper-parameters of the network and use the improved NMS algorithm, Cluster-NMS. Moreover, a new object detection model, Micro-YOLO+, which achieves a signification reduction in the number of parameters and computation cost while maintaining the performance is proposed. Our Micro-YOLO+ network reduces the number of parameters by 3.18× and multiply-accumulate operation (MAC) by 2.44× while increases the mAP evaluated on the COCO2014 dataset by 1.6%, compared to the original YOLOv3-tiny network.
KW - Convolutional neural network
KW - Model compression
KW - Object detection
UR - https://www.scopus.com/pages/publications/85134658241
U2 - 10.1007/s42979-022-01299-3
DO - 10.1007/s42979-022-01299-3
M3 - 文章
AN - SCOPUS:85134658241
SN - 2662-995X
VL - 3
JO - SN Computer Science
JF - SN Computer Science
IS - 5
M1 - 391
ER -