TY - GEN
T1 - ENHANCING REINFORCEMENT LEARNING VIA CAUSALLY CORRECT INPUT IDENTIFICATION AND TARGETED INTERVENTION
AU - Shen, Jiwei
AU - Lu, Hu
AU - Zhang, Hao
AU - Lyu, Shujing
AU - Lu, Yue
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Causal confusion, characterized by the learning of spurious correlations, detrimentally affects the generalization and effectiveness of reinforcement learning (RL) algorithms, especially in environments without latent confounders often encountered in robot autonomous navigation tasks. This study addresses this gap by developing a causal structure within a Partially Observable Markov Decision Process (POMDP). Subsequently, we introduce a targeted intervention that mitigates the influence of spurious correlations by isolating causally significant state variables and discarding irrelevant inputs. Testing in three real-world scenarios confirms the approach's feasibility and superiority in enhancing the RL algorithms' performance and generalization ability, signifying a promising step towards more robust online RL frameworks.
AB - Causal confusion, characterized by the learning of spurious correlations, detrimentally affects the generalization and effectiveness of reinforcement learning (RL) algorithms, especially in environments without latent confounders often encountered in robot autonomous navigation tasks. This study addresses this gap by developing a causal structure within a Partially Observable Markov Decision Process (POMDP). Subsequently, we introduce a targeted intervention that mitigates the influence of spurious correlations by isolating causally significant state variables and discarding irrelevant inputs. Testing in three real-world scenarios confirms the approach's feasibility and superiority in enhancing the RL algorithms' performance and generalization ability, signifying a promising step towards more robust online RL frameworks.
KW - Reinforcement learning
KW - causal confusion
KW - causal inference
KW - navigation
KW - targeted intervention
UR - https://www.scopus.com/pages/publications/85195427792
U2 - 10.1109/ICASSP48485.2024.10446642
DO - 10.1109/ICASSP48485.2024.10446642
M3 - 会议稿件
AN - SCOPUS:85195427792
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 8095
EP - 8099
BT - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
Y2 - 14 April 2024 through 19 April 2024
ER -