Automatic discovery and transfer of MAXQ hierarchies in a complex system

Hongbing Wang*, Wenya Li, Xuan Zhou

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Reinforcement learning has been an important category of machine learning approaches exhibiting self-learning and online learning characteristics. Using reinforcement learning, an agent can learn its behaviors through trial-and-error interactions with a dynamic environment and finally come up with an optimal strategy. Reinforcement learning suffers the curse of dimensionality, though there has been significant progress to overcome this issue in recent years. MAXQ is one of the most common approaches for reinforcement learning. To function properly, MAXQ requires a decomposition of the agent's task into a task hierarchy. Previously, the decomposition can only be done manually. In this paper, we propose a mechanism for automatic subtask discovery. The mechanism applies clustering to automatically construct task hierarchy required by MAXQ, such that MAXQ can be fully automated. We present the design of our mechanism, and demonstrate its effectiveness through theoretical analysis and an extensive experimental evaluation.

Original languageEnglish
Title of host publicationProceedings - 2012 IEEE 24th International Conference on Tools with Artificial Intelligence, ICTAI 2012
Pages1157-1162
Number of pages6
DOIs
StatePublished - 2012
Externally publishedYes
Event2012 IEEE 24th International Conference on Tools with Artificial Intelligence, ICTAI 2012 - Athens, Greece
Duration: 7 Nov 20129 Nov 2012

Publication series

NameProceedings - International Conference on Tools with Artificial Intelligence, ICTAI
Volume1
ISSN (Print)1082-3409

Conference

Conference2012 IEEE 24th International Conference on Tools with Artificial Intelligence, ICTAI 2012
Country/TerritoryGreece
CityAthens
Period7/11/129/11/12

Keywords

  • Clustering
  • MAXQ
  • Reinforcement Learning
  • System of Systems

Fingerprint

Dive into the research topics of 'Automatic discovery and transfer of MAXQ hierarchies in a complex system'. Together they form a unique fingerprint.

Cite this