OTHER
Non-Markovian policies in sequential decision problems
Csaba Szepesvári
- Year
- 1998
- Citations
- 3
Abstract
In this article we prove the validity of the Dellman Optimality Equation a.nd related results for sequential decision problems with a general recursive structure. The characteristic feature of our approach is that also non-Markovian policies are taken into account. The theory is motivated The theory of sequential decision problems is an important mathematical tool for studying some problems of cybernetics, e.g. control of robots. Consider for example the robot shmvn in Figure 1. This robot, called Khepera1, is equipped with eight infra-red sensors, six in the front and two at the back, the infra-red
Keywords
Markov processComputer scienceFeature (linguistics)Markov decision processMathematical optimizationDecision problemArtificial intelligenceMathematicsAlgorithmStatistics
Related papers
OTHER
📊 26,957 cites
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 cites
Artificial intelligence: a modern approach
1995
OTHER
Open access📊 20,501 cites
Fractional Differential Equations
Igor Podlubný
2025
OTHER
📊 18,993 cites
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991