Non-Markovian policies in sequential decision problems

Csaba Szepesvári

Year: 1998
Citations: 3

Abstract

In this article we prove the validity of the Dellman Optimality Equation a.nd related results for sequential decision problems with a general recursive structure. The characteristic feature of our approach is that also non-Markovian policies are taken into account. The theory is motivated The theory of sequential decision problems is an important mathematical tool for studying some problems of cybernetics, e.g. control of robots. Consider for example the robot shmvn in Figure 1. This robot, called Khepera1, is equipped with eight infra-red sensors, six in the front and two at the back, the infra-red

Keywords

Markov processComputer scienceFeature (linguistics)Markov decision processMathematical optimizationDecision problemArtificial intelligenceMathematicsAlgorithmStatistics

Non-Markovian policies in sequential decision problems

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control