首页 /研究 /Non-Markovian policies in sequential decision problems
OTHER

Non-Markovian policies in sequential decision problems

Csaba Szepesvári

发表年份
1998
引用次数
3

摘要

In this article we prove the validity of the Dellman Optimality Equation a.nd related results for sequential decision problems with a general recursive structure. The characteristic feature of our approach is that also non-Markovian policies are taken into account. The theory is motivated The theory of sequential decision problems is an important mathematical tool for studying some problems of cybernetics, e.g. control of robots. Consider for example the robot shmvn in Figure 1. This robot, called Khepera1, is equipped with eight infra-red sensors, six in the front and two at the back, the infra-red

关键词

Markov processComputer scienceFeature (linguistics)Markov decision processMathematical optimizationDecision problemArtificial intelligenceMathematicsAlgorithmStatistics

相关论文

查看 OTHER 分类全部论文