ENHANCING REINFORCEMENT LEARNING VIA CAUSALLY CORRECT INPUT IDENTIFICATION AND TARGETED INTERVENTION

  • Jiwei Shen
  • , Hu Lu
  • , Hao Zhang
  • , Shujing Lyu*
  • , Yue Lu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Causal confusion, characterized by the learning of spurious correlations, detrimentally affects the generalization and effectiveness of reinforcement learning (RL) algorithms, especially in environments without latent confounders often encountered in robot autonomous navigation tasks. This study addresses this gap by developing a causal structure within a Partially Observable Markov Decision Process (POMDP). Subsequently, we introduce a targeted intervention that mitigates the influence of spurious correlations by isolating causally significant state variables and discarding irrelevant inputs. Testing in three real-world scenarios confirms the approach's feasibility and superiority in enhancing the RL algorithms' performance and generalization ability, signifying a promising step towards more robust online RL frameworks.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages8095-8099
Number of pages5
ISBN (Electronic)9798350344851
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Korea, Republic of
Duration: 14 Apr 202419 Apr 2024

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
Country/TerritoryKorea, Republic of
CitySeoul
Period14/04/2419/04/24

Keywords

  • Reinforcement learning
  • causal confusion
  • causal inference
  • navigation
  • targeted intervention

Fingerprint

Dive into the research topics of 'ENHANCING REINFORCEMENT LEARNING VIA CAUSALLY CORRECT INPUT IDENTIFICATION AND TARGETED INTERVENTION'. Together they form a unique fingerprint.

Cite this