Online Sensor Selection in Reinforcement Learning Environment

Kôichirô Ishikawa, Akito Sakurai, Tsutomu Fujinami, Susumu Kunifuji

发表年份: 2005
引用次数: 2
访问权限: 开放获取

摘要

More sensors do not necessarily result in more appropriate state descriptions, so that a mobile robot has to select an appropriate set of sensors besides learning a state-action function in a reinforcement learning environment. We present a multi-armed bandit formulation of the problem and apply it to mobile robot navigation task. We modified the reinforcement comparison method to suit our problem and build a system where the selection of optimal set of sensors and the learning of state-action functions are done simultaneously. Our approach is evaluated on a Khepera robot simulator and the results reveal that our approach works well as an integrated learning system to identify the best set of sensors and reduce learning time.

关键词

Reinforcement learningAction selectionComputer scienceSet (abstract data type)Artificial intelligenceTask (project management)Mobile robotRobotSelection (genetic algorithm)Robot learning

Online Sensor Selection in Reinforcement Learning Environment

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory