Supervised Actor-Critic Reinforcement Learning

Michael T. Rosenstein, Andrew G. Barto, Jennie Si, Andy Barto, Warren B. Powell, Donald C. Wunsch

Year: 2012
Citations: 69

Abstract

Editor’s Summary: Chapter?? introduced policy gradients as a way to improve on stochastic search of the policy space when learning. This chapter presents supervised actor-critic reinforcement learning as another method for improving the effectiveness of learning. With this approach, a supervisor adds structure to a learning problem and supervised learning makes that structure part of an actor-critic framework for reinforcement learning. Theoretical background and a detailed algorithm description are provided, along with several examples that contain enough detail to make them easy to understand and possible to duplicate. These examples also illustrate the use of two kinds of supervisors: a feedback controller that is easily designed yet sub-optimal, and a human operator providing intermittent control of a simulated robotic arm. 1.1

Keywords

Reinforcement learningArtificial intelligenceReinforcementPsychologyComputer scienceSocial psychology

Supervised Actor-Critic Reinforcement Learning

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory