Online Sensor Selection in Reinforcement Learning Environment
Kôichirô Ishikawa, Akito Sakurai, Tsutomu Fujinami, Susumu Kunifuji
- 发表年份
- 2005
- 引用次数
- 2
- 访问权限
- 开放获取
摘要
More sensors do not necessarily result in more appropriate state descriptions, so that a mobile robot has to select an appropriate set of sensors besides learning a state-action function in a reinforcement learning environment. We present a multi-armed bandit formulation of the problem and apply it to mobile robot navigation task. We modified the reinforcement comparison method to suit our problem and build a system where the selection of optimal set of sensors and the learning of state-action functions are done simultaneously. Our approach is evaluated on a Khepera robot simulator and the results reveal that our approach works well as an integrated learning system to identify the best set of sensors and reduce learning time.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002