首页 /研究 /Learning to Predict Consequences as a Method of Knowledge Transfer in Reinforcement Learning

LEARNING

Learning to Predict Consequences as a Method of Knowledge Transfer in Reinforcement Learning

Eric Chalmers, Edgar Bermudez-Contreras, Brandon Robertson, Artur Luczak, Aaron J. Gruber

发表年份: 2017
引用次数: 53

摘要

The reinforcement learning (RL) paradigm allows agents to solve tasks through trial-and-error learning. To be capable of efficient, long-term learning, RL agents should be able to apply knowledge gained in the past to new tasks they may encounter in the future. The ability to predict actions' consequences may facilitate such knowledge transfer. We consider here domains where an RL agent has access to two kinds of information: agent-centric information with constant semantics across tasks, and environment-centric information, which is necessary to solve the task, but with semantics that differ between tasks. For example, in robot navigation, environment-centric information may include the robot's geographic location, while agent-centric information may include sensor readings of various nearby obstacles. We propose that these situations provide an opportunity for a very natural style of knowledge transfer, in which the agent learns to predict actions' environmental consequences using agent-centric information. These predictions contain important information about the affordances and dangers present in a novel environment, and can effectively transfer knowledge from agent-centric to environment-centric learning systems. Using several example problems including spatial navigation and network routing, we show that our knowledge transfer approach can allow faster and lower cost learning than existing alternatives.

关键词

Computer scienceReinforcement learningAffordanceRobotHuman–computer interactionArtificial intelligenceRobot learningSemantics (computer science)Knowledge transferTransfer of learning

Learning to Predict Consequences as a Method of Knowledge Transfer in Reinforcement Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory