首页 /研究 /Practical Reinforcement Learning For MPC: Learning from sparse objectives in under an hour on a real robot

LEARNING

Practical Reinforcement Learning For MPC: Learning from sparse objectives in under an hour on a real robot

Napat Karnchanachari, Miguel de la Iglesia Valls, David Hoeller, Marco Hutter

发表年份: 2020
引用次数: 14
访问权限: 开放获取

摘要

Model Predictive Control (MPC) is a powerful control technique that handles constraints, takes the system's dynamics into account, and optimizes for a given cost function. In practice, however, it often requires an expert to craft and tune this cost function and find trade-offs between different state penalties to satisfy simple high level objectives. In this paper, we use Reinforcement Learning and in particular value learning to approximate the value function given only high level objectives, which can be sparse and binary. Building upon previous works, we present improvements that allowed us to successfully deploy the method on a real world unmanned ground vehicle. Our experiments show that our method can learn the cost function from scratch and without human intervention, while reaching a performance level similar to that of an expert-tuned MPC. We perform a quantitative comparison of these methods with standard MPC approaches both in simulation and on the real robot.

关键词

Reinforcement learningComputer scienceArtificial intelligenceFunction (biology)ScratchModel predictive controlBellman equationRobotMachine learningControl (management)

Practical Reinforcement Learning For MPC: Learning from sparse objectives in under an hour on a real robot

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory