OTHER
Non-Markovian policies in sequential decision problems
Csaba Szepesvári
- 发表年份
- 1998
- 引用次数
- 3
摘要
In this article we prove the validity of the Dellman Optimality Equation a.nd related results for sequential decision problems with a general recursive structure. The characteristic feature of our approach is that also non-Markovian policies are taken into account. The theory is motivated The theory of sequential decision problems is an important mathematical tool for studying some problems of cybernetics, e.g. control of robots. Consider for example the robot shmvn in Figure 1. This robot, called Khepera1, is equipped with eight infra-red sensors, six in the front and two at the back, the infra-red
关键词
Markov processComputer scienceFeature (linguistics)Markov decision processMathematical optimizationDecision problemArtificial intelligenceMathematicsAlgorithmStatistics
相关论文
OTHER
📊 26,957 引用
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 引用
Artificial intelligence: a modern approach
1995
OTHER
开放获取📊 20,501 引用
Fractional Differential Equations
Igor Podlubný
2025
OTHER
📊 18,993 引用
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991