Sequential Viewpoint Selection and Grasping with Partial Observability Reinforcement Learning

Weiwen Chen, Yun Hua, Bo Jin, Jun Zhu, Quanbo Ge, Xiangfeng Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Despite the success of vision-based object grasping due to deep learning development, fixed-view object grasping methods still face information loss with limited performance. Recently some rule-based or heuristic-based methods have begun to sequentially consider multiple views to improve the perceptibility of the environment, which shows better performance. However, their sequence lengths are too short, or their viewpoint selection is myopic and ignores the long-term effect. This paper models sequential viewpoints selection as a Markov Decision Process. The Sequential Decided Multi-View Grasping (SDMVG) method is proposed based on reinforcement learning, and an RNN-based policy is introduced. Considering long-term return, SDMVG can generate viewpoints sequence which achieves most information gain. Numerical experiments show SDMVG can achieve 10% accuracy improvement compared with rule-or heuristic-based baselines on Multi-View GraspNet Benchmark. Moreover, SDMVG approaches the global optimum with only 1/40 wall time compared with the brute-force method.

Original languageEnglish
Title of host publicationProceedings - 2022 37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1125-1129
Number of pages5
ISBN (Electronic)9781665465366
DOIs
StatePublished - 2022
Event37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022 - Beijing, China
Duration: 19 Nov 202220 Nov 2022

Publication series

NameProceedings - 2022 37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022

Conference

Conference37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022
Country/TerritoryChina
CityBeijing
Period19/11/2220/11/22

Keywords

  • Object Grasping
  • Reinforcement
  • Robotic

Fingerprint

Dive into the research topics of 'Sequential Viewpoint Selection and Grasping with Partial Observability Reinforcement Learning'. Together they form a unique fingerprint.

Cite this