POMDP-lite for robust robot planning under uncertainty

Min Chen, Emilio Frazzoli, David Hsu, Wee Sun Lee

发表年份: 2016
引用次数: 48

摘要

The partially observable Markov decision process (POMDP) provides a principled general model for planning under uncertainty. However, solving a general POMDP is computationally intractable in the worst case. This paper introduces POMDP-lite, a subclass of POMDPs in which the hidden state variables are constant or only change deterministically. We show that a POMDP-lite is equivalent to a set of fully observable Markov decision processes indexed by a hidden parameter and is useful for modeling a variety of interesting robotic tasks. We develop a simple model-based Bayesian reinforcement learning algorithm to solve POMDP-lite models. The algorithm performs well on large-scale POMDP-lite models with up to 1020 states and outperforms the state-of-the-art general-purpose POMDP algorithms. We further show that the algorithm is near-Bayesian-optimal under suitable conditions.

关键词

Partially observable Markov decision processComputer scienceReinforcement learningMarkov decision processObservableArtificial intelligenceBayesian probabilityMachine learningSet (abstract data type)Markov process

POMDP-lite for robust robot planning under uncertainty

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory