跳到主要导航 跳到搜索 跳到主要内容

Adversarial Discriminative Feature Separation for Generalization in Reinforcement Learning

  • Yong Liu
  • , Chunwei Wu
  • , Xidong Xi
  • , Yan Li
  • , Guitao Cao*
  • , Wenming Cao
  • , Hong Wang
  • *此作品的通讯作者
  • East China Normal University
  • Shenzhen University
  • Shanghai Research Institute of Microwave Equipment

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Imporving the generalization ability of an agent is an important and challenging task in deep reinforcement learning (RL). Procedually generated environment is an important benchmark for testing generalization in deep RL. In this benchmark, each game consists of multiple levels, each level is an algorithmically created environment instance with a unique configuration of its factors of variation. Existing methods (e.g., regularization, data augmentation) for improving the generalization of RL agent do not learn well the invariant representation among multiple levels. Besides, existing methods for learning invariant representations in RL using adversarial training can only learn invariant information across two levels. To solve this problem, we propose Adversarial Discriminative Feature Separate (ADFS). First, ADFS design a new discriminator for distinguishing whether two observations belong to the same level. Thus, the policy encoder is encouraged to learn invariant information between multiple levels. Second, it separates the representation of observation into level-invariant features and level-discriminative features, so that correction of the optimization direction of the discriminator. The discriminative features are learned by reducing the similarity of specific features intra-levels and increasing that of inter-levels, respectively. Experimental results demonstrate that our method is quite competitive with existing state-of-the-art methods on Procgen Benchmark.

源语言英语
主期刊名2022 International Joint Conference on Neural Networks, IJCNN 2022 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9781728186719
DOI
出版状态已出版 - 2022
活动2022 International Joint Conference on Neural Networks, IJCNN 2022 - Padua, 意大利
期限: 18 7月 202223 7月 2022

出版系列

姓名Proceedings of the International Joint Conference on Neural Networks
ISSN(印刷版)2161-4393
ISSN(电子版)2161-4407

会议

会议2022 International Joint Conference on Neural Networks, IJCNN 2022
国家/地区意大利
Padua
时期18/07/2223/07/22

指纹

探究 'Adversarial Discriminative Feature Separation for Generalization in Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此