首页 /研究 /Learning Options From Demonstrations: A <italic>Pac-Man</italic> Case Study

LEARNING

Learning Options From Demonstrations: A <italic>Pac-Man</italic> Case Study

Marco Tamassia, Fabio Zambetta, William Raffe, Florian Mueller, Xiaodong Li

发表年份: 2017
引用次数: 7

摘要

Reinforcement learning (RL) is a machine learning paradigm behind many successes in games, robotics, and control applications. RL agents improve through trial-and-error, therefore undergoing a learning phase during which they perform suboptimally. Research effort has been put into optimizing behavior during this period, to reduce its duration and to maximize after-learning performance. We introduce a novel algorithm that extracts useful information from expert demonstrations (traces of interactions with the target environment) and uses it to improve performance. The algorithm detects unexpected decisions made by the expert and infers what goal the expert was pursuing. Goals are then used to bias decisions while learning. Our experiments in the video game Pac-Man provide statistically significant evidence that our method can improve final performance compared to a state-of-the-art approach.

关键词

Reinforcement learningComputer scienceArtificial intelligenceMachine learningDuration (music)RoboticsControl (management)Robot

Learning Options From Demonstrations: A <italic>Pac-Man</italic> Case Study

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory

Learning Options From Demonstrations: A &lt;italic&gt;Pac-Man&lt;/italic&gt; Case Study

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory

Learning Options From Demonstrations: A <italic>Pac-Man</italic> Case Study