首页 /研究 /Adaptive Probabilistic Trajectory Optimization via Efficient Approximate Inference

LEARNING

Adaptive Probabilistic Trajectory Optimization via Efficient Approximate Inference

Yunpeng Pan, Xinyan Yan, Evangelos A. Theodorou, Byron Boots

发表年份: 2016
引用次数: 7
访问权限: 开放获取

摘要

Robotic systems must be able to quickly and robustly make decisions when operating in uncertain and dynamic environments. While Reinforcement Learning (RL) can be used to compute optimal policies with little prior knowledge about the environment, it suffers from slow convergence. An alternative approach is Model Predictive Control (MPC), which optimizes policies quickly, but also requires accurate models of the system dynamics and environment. In this paper we propose a new approach, adaptive probabilistic trajectory optimization, that combines the benefits of RL and MPC. Our method uses scalable approximate inference to learn and updates probabilistic models in an online incremental fashion while also computing optimal control policies via successive local approximations. We present two variations of our algorithm based on the Sparse Spectrum Gaussian Process (SSGP) model, and we test our algorithm on three learning tasks, demonstrating the effectiveness and efficiency of our approach.

关键词

Computer scienceProbabilistic logicGaussian processInferenceTrajectoryReinforcement learningScalabilityConvergence (economics)Approximate inferenceMathematical optimization

Adaptive Probabilistic Trajectory Optimization via Efficient Approximate Inference

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory