跳到主要导航 跳到搜索 跳到主要内容

ENHANCING REINFORCEMENT LEARNING VIA CAUSALLY CORRECT INPUT IDENTIFICATION AND TARGETED INTERVENTION

  • Jiwei Shen
  • , Hu Lu
  • , Hao Zhang
  • , Shujing Lyu*
  • , Yue Lu
  • *此作品的通讯作者
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Causal confusion, characterized by the learning of spurious correlations, detrimentally affects the generalization and effectiveness of reinforcement learning (RL) algorithms, especially in environments without latent confounders often encountered in robot autonomous navigation tasks. This study addresses this gap by developing a causal structure within a Partially Observable Markov Decision Process (POMDP). Subsequently, we introduce a targeted intervention that mitigates the influence of spurious correlations by isolating causally significant state variables and discarding irrelevant inputs. Testing in three real-world scenarios confirms the approach's feasibility and superiority in enhancing the RL algorithms' performance and generalization ability, signifying a promising step towards more robust online RL frameworks.

源语言英语
主期刊名2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
8095-8099
页数5
ISBN(电子版)9798350344851
DOI
出版状态已出版 - 2024
活动2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, 韩国
期限: 14 4月 202419 4月 2024

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(印刷版)1520-6149

会议

会议2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
国家/地区韩国
Seoul
时期14/04/2419/04/24

指纹

探究 'ENHANCING REINFORCEMENT LEARNING VIA CAUSALLY CORRECT INPUT IDENTIFICATION AND TARGETED INTERVENTION' 的科研主题。它们共同构成独一无二的指纹。

引用此