首页 /研究 /Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control
LEARNING

Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

Sanket Kamthe, Marc Peter Deisenroth

发表年份
2017
引用次数
74

摘要

Trial-and-error based reinforcement learning
\n(RL) has seen rapid advancements in recent
\ntimes, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A
\nlarge number of interactions may be impractical in many real-world applications, such as
\nrobotics, and many practical systems have to
\nobey limitations in the form of state space
\nor control constraints. To reduce the number
\nof system interactions while simultaneously
\nhandling constraints, we propose a modelbased RL framework based on probabilistic
\nModel Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs)
\nto incorporate model uncertainty into longterm predictions, thereby, reducing the impact of model errors. We then use MPC to
\nfind a control sequence that minimises the
\nexpected long-term cost. We provide theoretical guarantees for first-order optimality in
\nthe GP-based transition models with deterministic approximate inference for long-term
\nplanning. We demonstrate that our approach
\ndoes not only achieve state-of-the-art data
\nefficiency, but also is a principled way for RL
\nin constrained environments.

关键词

Reinforcement learningProbabilistic logicComputer scienceArtificial intelligenceModel predictive controlTerm (time)Machine learningState spaceInferenceRobotics

相关论文

查看 LEARNING 分类全部论文