Practical Reinforcement Learning For MPC: Learning from sparse\n objectives in under an hour on a real robot
Napat Karnchanachari, Miguel I. Valls, David Hoeller, Marco Hutter
- Year
- 2020
- Citations
- 7
- Access
- Open access
Abstract
Model Predictive Control (MPC) is a powerful control technique that handles\nconstraints, takes the system's dynamics into account, and optimizes for a\ngiven cost function. In practice, however, it often requires an expert to craft\nand tune this cost function and find trade-offs between different state\npenalties to satisfy simple high level objectives. In this paper, we use\nReinforcement Learning and in particular value learning to approximate the\nvalue function given only high level objectives, which can be sparse and\nbinary. Building upon previous works, we present improvements that allowed us\nto successfully deploy the method on a real world unmanned ground vehicle. Our\nexperiments show that our method can learn the cost function from scratch and\nwithout human intervention, while reaching a performance level similar to that\nof an expert-tuned MPC. We perform a quantitative comparison of these methods\nwith standard MPC approaches both in simulation and on the real robot.\n
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002