首页 /研究 /Controlled Use of Subgoals in Reinforcement Learning
LOCOMOTION

Controlled Use of Subgoals in Reinforcement Learning

Junichi Murata

发表年份
2008
引用次数
4
访问权限
开放获取

摘要

Reinforcement learning A learning agent observes the state of its environment, chooses an action based on its current policy and executes the action. Responding to the action, the environment transitions to a new state, and a reword is given to the agent when applicable. The reward indicates how good or how bad the new state is, and the agent uses it to improve its policy so that it can obtain more rewards. Since reinforcement learning (abbreviated as RL hereafter) requires no other information, e.g. a model of environment, than the perceived states and rewards, it can be applied to a class of problems where the environment is complex or uncertain. The applications of RL include control of multi-legged robots (Kimura et al.,

关键词

Reinforcement learningComputer scienceAction (physics)Artificial intelligenceClass (philosophy)State (computer science)ElevatorA priori and a posterioriRobotHuman–computer interaction

相关论文

查看 LOCOMOTION 分类全部论文