跳到主要导航 跳到搜索 跳到主要内容

Sequential Viewpoint Selection and Grasping with Partial Observability Reinforcement Learning

  • Weiwen Chen*
  • , Yun Hua
  • , Bo Jin
  • , Jun Zhu
  • , Quanbo Ge
  • , Xiangfeng Wang
  • *此作品的通讯作者
  • East China Normal University
  • Nanjing University of Information Science & Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Despite the success of vision-based object grasping due to deep learning development, fixed-view object grasping methods still face information loss with limited performance. Recently some rule-based or heuristic-based methods have begun to sequentially consider multiple views to improve the perceptibility of the environment, which shows better performance. However, their sequence lengths are too short, or their viewpoint selection is myopic and ignores the long-term effect. This paper models sequential viewpoints selection as a Markov Decision Process. The Sequential Decided Multi-View Grasping (SDMVG) method is proposed based on reinforcement learning, and an RNN-based policy is introduced. Considering long-term return, SDMVG can generate viewpoints sequence which achieves most information gain. Numerical experiments show SDMVG can achieve 10% accuracy improvement compared with rule-or heuristic-based baselines on Multi-View GraspNet Benchmark. Moreover, SDMVG approaches the global optimum with only 1/40 wall time compared with the brute-force method.

源语言英语
主期刊名Proceedings - 2022 37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022
出版商Institute of Electrical and Electronics Engineers Inc.
1125-1129
页数5
ISBN(电子版)9781665465366
DOI
出版状态已出版 - 2022
活动37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022 - Beijing, 中国
期限: 19 11月 202220 11月 2022

出版系列

姓名Proceedings - 2022 37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022

会议

会议37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022
国家/地区中国
Beijing
时期19/11/2220/11/22

指纹

探究 'Sequential Viewpoint Selection and Grasping with Partial Observability Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此