首页 /研究 /Multilayer Perceptrons with Radial Basis Functions as Value Functions in Reinforcement Learning.
LEARNING

Multilayer Perceptrons with Radial Basis Functions as Value Functions in Reinforcement Learning.

Victor Uc Cetina

发表年份
2008
引用次数
6

摘要

Abstract. Using multilayer perceptrons (MLPs) to approximate the state-action value function in reinforcement learning (RL) algorithms could become a nightmare due to the constant possibility of unlearning past experiences. Moreover, since the target values in the training examples are bootstraps values, this is, estimates of other estimates, the chances to get stuck in a local minimum are increased. These problems occur very often in the mountain car task, as showed by Boyan and Moore [2]. In this paper we present empirical evidence showing that MLPs augmented with one layer of radial basis functions (RBFs) can avoid these problems. Our experimental testbeds are the mountain car task and a robot control problem. 1

关键词

PerceptronReinforcement learningComputer scienceRadial basis functionArtificial intelligenceTask (project management)Bellman equationConstant (computer programming)Function (biology)Basis (linear algebra)

相关论文

查看 LEARNING 分类全部论文