Controlled Use of Subgoals in Reinforcement Learning

Junichi Murata

发表年份: 2008
引用次数: 4
访问权限: 开放获取

摘要

Reinforcement learning A learning agent observes the state of its environment, chooses an action based on its current policy and executes the action. Responding to the action, the environment transitions to a new state, and a reword is given to the agent when applicable. The reward indicates how good or how bad the new state is, and the agent uses it to improve its policy so that it can obtain more rewards. Since reinforcement learning (abbreviated as RL hereafter) requires no other information, e.g. a model of environment, than the perceived states and rewards, it can be applied to a class of problems where the environment is complex or uncertain. The applications of RL include control of multi-legged robots (Kimura et al.,

关键词

Reinforcement learningComputer scienceAction (physics)Artificial intelligenceClass (philosophy)State (computer science)ElevatorA priori and a posterioriRobotHuman–computer interaction

Controlled Use of Subgoals in Reinforcement Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory