首页 /研究 /A State Space Filter for Reinforcement Learning in Partially Observable Markov Decision Processes

LEARNING

A State Space Filter for Reinforcement Learning in Partially Observable Markov Decision Processes

Masato Nagayoshi, Hajime Murao, Hisashi Tamaki

发表年份: 2009
引用次数: 2
访问权限: 开放获取

摘要

This paper presents a technique for reinforcement learning to deal with both discrete and continuous state space systems in POMDPs while keeping the state space of an agent compact. First, in our computational model for MDP environments, a concept of “ state space filtering ” has been introduced and constructed to make the state space of an agent smaller properly by referring to “ entropy ” calculated based on the state-action mapping. The model is extended to be applicable in POMDP environments by introducing the mechanism of utilizing effectively of history information. The extended model is capable of being dealt with a continuous state space as well as a discrete state space by the extended model. Here, the mechanism of adjusting the amount of history information is also introduced so that the state space of an agent should be compact. Moreover, some computational experiments with a robot navigation problem with a continuous state space have been carried out. The potential and the effectiveness of the extended approach have been confirmed through these experiments.

关键词

Reinforcement learningPartially observable Markov decision processState spaceMarkov decision processObservableState (computer science)Computer scienceSpace (punctuation)Markov chainEntropy (arrow of time)

A State Space Filter for Reinforcement Learning in Partially Observable Markov Decision Processes

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory