TY - GEN
T1 - Adaptive Object Tracking and Detection for Video Recognition in Mobile Edge Networks
AU - Shen, Yun
AU - Guo, Kun
AU - Wang, Xijun
AU - Gao, Ruifeng
AU - Rui, Yun
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Video recognition requires rapid and accurate object recognition across continuous frames, presenting significant challenges for computing-constrained devices such as traffic cameras. The advancements in mobile edge computing have enabled the deployment of high-accuracy neural networks on edge servers for object detection. Combined with lightweight object tracking algorithms on the local device, this approach emerges as a highly promising solution. However, determining when to perform edge detection or local tracking remains challenging. To address this, we first formulate a long-term optimization problem that accounts for the temporal correlation of frames and the dynamic nature of mobile edge networks. The achievable reward for each frame is defined as a weighted sum of recognition accuracy, handling delay, and waiting delay. Then, we propose the LTED-Ada, a deep reinforcement learning based algorithm that adaptively selects between local object tracking and edge object detection for video recognition. Finally, we conduct hardware-in-the-loop experiments to confirm LTED-Ada's superior performance across diverse frame rates and device requirements.
AB - Video recognition requires rapid and accurate object recognition across continuous frames, presenting significant challenges for computing-constrained devices such as traffic cameras. The advancements in mobile edge computing have enabled the deployment of high-accuracy neural networks on edge servers for object detection. Combined with lightweight object tracking algorithms on the local device, this approach emerges as a highly promising solution. However, determining when to perform edge detection or local tracking remains challenging. To address this, we first formulate a long-term optimization problem that accounts for the temporal correlation of frames and the dynamic nature of mobile edge networks. The achievable reward for each frame is defined as a weighted sum of recognition accuracy, handling delay, and waiting delay. Then, we propose the LTED-Ada, a deep reinforcement learning based algorithm that adaptively selects between local object tracking and edge object detection for video recognition. Finally, we conduct hardware-in-the-loop experiments to confirm LTED-Ada's superior performance across diverse frame rates and device requirements.
KW - Mobile edge computing
KW - object detection
KW - object tracking
KW - video recognition
UR - https://www.scopus.com/pages/publications/105017781630
U2 - 10.1109/ICCC65529.2025.11149285
DO - 10.1109/ICCC65529.2025.11149285
M3 - 会议稿件
AN - SCOPUS:105017781630
T3 - 2025 IEEE/CIC International Conference on Communications in China:Shaping the Future of Integrated Connectivity, ICCC 2025
BT - 2025 IEEE/CIC International Conference on Communications in China:Shaping the Future of Integrated Connectivity, ICCC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 IEEE/CIC International Conference on Communications in China, ICCC 2025
Y2 - 10 August 2025 through 13 August 2025
ER -