Fuzzy Q-learning: a new approach for fuzzy dynamic programming
H.R. Berenji
- 发表年份
- 1994
- 引用次数
- 86
摘要
Fuzzy reinforcement learning (FRL) involves "jump starting" reinforcement learning with fuzzy logic rules. By using FRL, prior domain knowledge, which may be very approximate and imprecise, can be expressed in terms of fuzzy rules and refined later through the learning process. In this paper, we develop a new algorithm called fuzzy Q-learning (or FQ-Learning) which extends Watkin's Q-learning method. It can be used for decision processes in which the goals and/or the constraints, but not necessarily the system under control, are fuzzy in nature. An example of a fuzzy constraint is: "the weight of object A must not be substantially heavier than w" where w is a specified weight. Similarly, an example of a fuzzy goal is: "the robot must be in the vicinity of door k". We show that FQ-learning provides an alternative solution to this problem which is simpler than the Bellman-Zadeh's fuzzy dynamic programming approach. We apply the algorithm to a multistage decision making problem and a navigation task.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002