首页 /研究 /Learning Control Under Uncertainty: A Probabilistic Value-Iteration Approach

LEARNING

Learning Control Under Uncertainty: A Probabilistic Value-Iteration Approach

Bastian Bischoff, Duy Nguyen-Tuong, Heiner Markert, Alois Knoll

发表年份: 2013
引用次数: 3
访问权限: 开放获取

摘要

Abstract. In this paper, we introduce a probabilistic version of the well-studied Value-Iteration approach, i.e. Probabilistic Value-Iteration (PVI). The PVI approach can handle continuous states and actions in an episodic Reinforcement Learning (RL) setting, while using Gaussian Processes to model the state uncertainties. We further show, how the approach can be efficiently realized making it suitable for learning with large data. The proposed PVI is evaluated on a benchmark problem, as well as on a real robot for learning a control task. A comparison of PVI with two state-of-the-art RL algorithms shows that the proposed approach is competitive in performance while being efficient in learning. 1

关键词

Probabilistic logicReinforcement learningBenchmark (surveying)Computer scienceArtificial intelligenceGaussian processIterative learning controlTask (project management)Machine learningState (computer science)

Learning Control Under Uncertainty: A Probabilistic Value-Iteration Approach

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory