Home /Research /Optimal Policies for Partially Observable Markov Decision Processes

OTHER

Optimal Policies for Partially Observable Markov Decision Processes

Anthony R. Cassandra

Year: 1994
Citations: 68

Abstract

this paper, we will be exploring a specific model-based scheme in which decisions need to be made. Even when we cannot assume we have the model, we can use techniques, [2], that allow us to approximate the model and then apply these model-based schemes. The many problems associated with such models will be outlined in a subsequent section. Throughout this discussion, the term agent will refer to the automated process that has to make decisions. A convenient example of an agent is that of an autonomous robot trying to survive in a real world environment. However, the agent can simply be a computer program such as one that does medical diagnosis. In this case, the model of the world might be based upon statistics and medical research.

Keywords

Partially observable Markov decision processObservableMarkov decision processComputer scienceWitnessMathematical optimizationMarkov processMarkov chainAlgorithmMarkov model

Optimal Policies for Partially Observable Markov Decision Processes

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control