首页 /研究 /A formal framework for robot learning and control under model uncertainty
LEARNING

A formal framework for robot learning and control under model uncertainty

Robin Jaulmes, Joëlle Pineau, Doina Precup

发表年份
2007
引用次数
22

摘要

While the partially observable Markov decision process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and stationary model of the environment. In this paper, we study the problem of finding an optimal policy for controlling a robot in a partially observable domain, where the model is not perfectly known, and may change over time. We present an algorithm called MEDUSA which incrementally learns a POMDP model using queries, while still optimizing a reward function. We demonstrate effectiveness of the approach for a simple scenario, where a robot seeking a person has minimal a priori knowledge of its own sensor model, as well as where the person is located.

关键词

Partially observable Markov decision processA priori and a posterioriComputer scienceRobotMarkov decision processObservableDomain (mathematical analysis)Markov processProcess (computing)Artificial intelligence

相关论文

查看 LEARNING 分类全部论文