首页 /研究 /POMDP-lite for robust robot planning under uncertainty
LEARNING

POMDP-lite for robust robot planning under uncertainty

Min Chen, Emilio Frazzoli, David Hsu, Wee Sun Lee

发表年份
2016
引用次数
48

摘要

The partially observable Markov decision process (POMDP) provides a principled general model for planning under uncertainty. However, solving a general POMDP is computationally intractable in the worst case. This paper introduces POMDP-lite, a subclass of POMDPs in which the hidden state variables are constant or only change deterministically. We show that a POMDP-lite is equivalent to a set of fully observable Markov decision processes indexed by a hidden parameter and is useful for modeling a variety of interesting robotic tasks. We develop a simple model-based Bayesian reinforcement learning algorithm to solve POMDP-lite models. The algorithm performs well on large-scale POMDP-lite models with up to 1020 states and outperforms the state-of-the-art general-purpose POMDP algorithms. We further show that the algorithm is near-Bayesian-optimal under suitable conditions.

关键词

Partially observable Markov decision processComputer scienceReinforcement learningMarkov decision processObservableArtificial intelligenceBayesian probabilityMachine learningSet (abstract data type)Markov process

相关论文

查看 LEARNING 分类全部论文