Home /Research /Optimal Policies for Partially Observable Markov Decision Processes
OTHER

Optimal Policies for Partially Observable Markov Decision Processes

Anthony R. Cassandra

Year
1994
Citations
68

Abstract

this paper, we will be exploring a specific model-based scheme in which decisions need to be made. Even when we cannot assume we have the model, we can use techniques, [2], that allow us to approximate the model and then apply these model-based schemes. The many problems associated with such models will be outlined in a subsequent section. Throughout this discussion, the term agent will refer to the automated process that has to make decisions. A convenient example of an agent is that of an autonomous robot trying to survive in a real world environment. However, the agent can simply be a computer program such as one that does medical diagnosis. In this case, the model of the world might be based upon statistics and medical research.

Keywords

Partially observable Markov decision processObservableMarkov decision processComputer scienceWitnessMathematical optimizationMarkov processMarkov chainAlgorithmMarkov model

Related papers

Browse all OTHER papers