TY - GEN
T1 - Invariant Adversarial Imitation Learning From Visual Inputs
AU - Zhang, Haoran
AU - Tian, Yinghong
AU - Yuan, Liang
AU - Lu, Yue
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Generalization across environments is critical when using imitation learning algorithms in real-world applications. In this paper, we propose an invariant model-based adversarial imitation learning (IMAIL) method to improve generalization. IMAIL develops a variational dynamics model providing rich auxiliary objectives for efficiently learning compact state representations. The latent representations are then regularized using mutual information constraints, guaranteeing that they are insensitive to environmental changes. Based on such representations, we utilize model-based adversarial imitation learning to mimic expert behavior in the latent space. As a result, the learned policies are well generalized in unseen environments. We conduct experiments with several vision-based control tasks to demonstrate the performance of IMAIL. Experimental results show that IMAIL significantly outperforms existing baselines and successfully achieves expert-level performance in all unseen test environments.
AB - Generalization across environments is critical when using imitation learning algorithms in real-world applications. In this paper, we propose an invariant model-based adversarial imitation learning (IMAIL) method to improve generalization. IMAIL develops a variational dynamics model providing rich auxiliary objectives for efficiently learning compact state representations. The latent representations are then regularized using mutual information constraints, guaranteeing that they are insensitive to environmental changes. Based on such representations, we utilize model-based adversarial imitation learning to mimic expert behavior in the latent space. As a result, the learned policies are well generalized in unseen environments. We conduct experiments with several vision-based control tasks to demonstrate the performance of IMAIL. Experimental results show that IMAIL significantly outperforms existing baselines and successfully achieves expert-level performance in all unseen test environments.
UR - https://www.scopus.com/pages/publications/85177576175
U2 - 10.1109/ICASSP49357.2023.10096070
DO - 10.1109/ICASSP49357.2023.10096070
M3 - 会议稿件
AN - SCOPUS:85177576175
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
BT - ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
Y2 - 4 June 2023 through 10 June 2023
ER -