跳到主要导航 跳到搜索 跳到主要内容

InCLET: Large Language Model In-context Learning can Improve Embodied Instruction-following

  • Peng Yuan Wang
  • , Jing Cheng Pang
  • , Chen Yang Wang
  • , Xuhui Liu
  • , Tian Shuo Liu
  • , Si Hang Yang
  • , Hong Qian
  • , Yang Yu*
  • *此作品的通讯作者
  • Nanjing University
  • Polixir Technologies

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Natural language-conditioned reinforcement learning (NLC-RL) empowers embodied agent to complete various tasks following human instruction. However, the unbounded natural language examples still introduce much complexity for the agent that solves concrete RL tasks, which can distract policy learning from completing the task. Consequently, extracting effective task representation from human instruction emerges as the critical component of NLC-RL. While previous methods have attempted to address this issue by learning task-related representation using large language models (LLMs), they highly rely on pre-collected task data and require extra training procedure. In this study, we uncover the inherent capability of LLMs to generate task representations and present a novel method, in-context learning embedding as task representation (InCLET). InCLET is grounded on a foundational finding that LLM in-context learning using trajectories can greatly help represent tasks. We thus firstly employ LLM to imagine task trajectories following the natural language instruction, then use in-context learning of LLM to generate task representations, and finally aggregate and project into a compact low-dimensional task representation. This representation is then used to train a human instruction-following agent. We conduct experiments on various embodied control environments and results show that InCLET creates effective task representations. Furthermore, this representation can significantly improve the RL training efficiency, compared to the baseline methods.

源语言英语
主期刊名Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025
编辑Yevgeniy Vorobeychik, Sanmay Das, Ann Nowe
出版商International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
2134-2142
页数9
ISBN(电子版)9798400714269
出版状态已出版 - 2025
活动24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025 - Detroit, 美国
期限: 19 5月 202523 5月 2025

出版系列

姓名Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
ISSN(印刷版)1548-8403
ISSN(电子版)1558-2914

会议

会议24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025
国家/地区美国
Detroit
时期19/05/2523/05/25

指纹

探究 'InCLET: Large Language Model In-context Learning can Improve Embodied Instruction-following' 的科研主题。它们共同构成独一无二的指纹。

引用此