Invariant Adversarial Imitation Learning From Visual Inputs

  • Haoran Zhang
  • , Yinghong Tian*
  • , Liang Yuan
  • , Yue Lu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Generalization across environments is critical when using imitation learning algorithms in real-world applications. In this paper, we propose an invariant model-based adversarial imitation learning (IMAIL) method to improve generalization. IMAIL develops a variational dynamics model providing rich auxiliary objectives for efficiently learning compact state representations. The latent representations are then regularized using mutual information constraints, guaranteeing that they are insensitive to environmental changes. Based on such representations, we utilize model-based adversarial imitation learning to mimic expert behavior in the latent space. As a result, the learned policies are well generalized in unseen environments. We conduct experiments with several vision-based control tasks to demonstrate the performance of IMAIL. Experimental results show that IMAIL significantly outperforms existing baselines and successfully achieves expert-level performance in all unseen test environments.

Original languageEnglish
Title of host publicationICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728163277
DOIs
StatePublished - 2023
Event48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023 - Rhodes Island, Greece
Duration: 4 Jun 202310 Jun 2023

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2023-June
ISSN (Print)1520-6149

Conference

Conference48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
Country/TerritoryGreece
CityRhodes Island
Period4/06/2310/06/23

Fingerprint

Dive into the research topics of 'Invariant Adversarial Imitation Learning From Visual Inputs'. Together they form a unique fingerprint.

Cite this