跳到主要导航 跳到搜索 跳到主要内容

ProEqBEV: Product Group Equivariant BEV Network for 3D Object Detection in Road Scenes of Autonomous Driving

  • East China Normal University
  • Information Engineering University
  • Technical University of Munich

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

With the rapid development of autonomous driving systems, 3D object detection based on Bird's Eye View (BEV) in road scenes has witnessed great progress over the past few years. As a road scene exhibits a part-whole hierarchy between the within objects and the scene itself, simple parts (e.g., roads, lane lines, vehicles and pedestrians) can be assembled into progressively more complex shapes to form a BEV representation of the whole road scene. Therefore, a BEV often has multiple levels of freedom on motion, i.e., the rotation and the moving shift of the whole BEV, and the random movements of objects (e.g., pedestrians and vehicles) inside the BEV. However, most of the current single-sensor or multi-sensor fusion-based BEV object detection methods have not yet taken into account capturing such multi-level motion in a BEV. To address this problem, we propose a product group equivariant object detection network framework that is equivariant with respect to multiple levels of symmetry groups based on multi-sensor fusion. The proposed framework extracts local equivariant features of objects in point clouds, while global equivariant features are extracted in both point clouds and images. Furthermore, the network learns diverse rotation-equivariant features and mitigates a significant amount of detection errors caused by rotations of BEV and objects inside a BEV, thereby further enhancing the performance of object detection. The experiment results show that the network architecture significantly improves object detection on mAP and NDS, respectively. In addition, in order to demonstrate the effectiveness of the proposed local-multi-global equivariant components, we conduct sufficient ablation experiments. The results show that the individual components are indispensable for the object detection performance improvement of the overall network architecture.

源语言英语
主期刊名2024 IEEE International Conference on Robotics and Automation, ICRA 2024
出版商Institute of Electrical and Electronics Engineers Inc.
16178-16184
页数7
ISBN(电子版)9798350384574
DOI
出版状态已出版 - 2024
活动2024 IEEE International Conference on Robotics and Automation, ICRA 2024 - Yokohama, 日本
期限: 13 5月 202417 5月 2024

出版系列

姓名Proceedings - IEEE International Conference on Robotics and Automation
ISSN(印刷版)1050-4729

会议

会议2024 IEEE International Conference on Robotics and Automation, ICRA 2024
国家/地区日本
Yokohama
时期13/05/2417/05/24

指纹

探究 'ProEqBEV: Product Group Equivariant BEV Network for 3D Object Detection in Road Scenes of Autonomous Driving' 的科研主题。它们共同构成独一无二的指纹。

引用此