首页 /研究 /Integrating POMDP and reinforcement learning for a two layer simulated robot architecture

LEARNING

Integrating POMDP and reinforcement learning for a two layer simulated robot architecture

Larry D. Pyeatt, Adele E. Howe

发表年份: 1999
引用次数: 12

摘要

Two layer control systems are common in robot architectures. The lower level is designed to provide fast, fine grained control while the higher level plans longer term sequences of actions to achieve some goal. Our approach uses reinforcement learning (RL) for the low level and Partially Observable Markov Decision Process (POMDP) planning for the high level. Because both levels can adapt their behavior within the scope of their tasks, the combination is expected to be robust to degradations in sensor and actuator failures and so to enhance overall system reliability. We implemented our architecture for use in the Khepera robot simulator. In a set of experiments, we show that good performance can be difficult to achieve with hand coded low level control and that performance of our RL/POMDP system degrades slowly with increasing sensor and actuator failure. 1 Introduction Two layer control systems are common in robot architectures. Two layer control is a pragmatic solution to different...

关键词

Reinforcement learningCitationPartially observable Markov decision processComputer scienceArchitectureState (computer science)Artificial intelligenceWorld Wide WebOperations researchEngineering

Integrating POMDP and reinforcement learning for a two layer simulated robot architecture

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory