Home /Research /Smoothed Sarsa: Reinforcement learning for robot delivery tasks
LEARNING

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

Deepak Ramachandran, Rakesh Gupta

Year
2009
Citations
15

Abstract

Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to find a sequence of actions in real time to accomplish these tasks efficiently. We introduce a novel reinforcement learning algorithm called Smoothed Sarsa that learns a good policy for these delivery tasks by delaying the backup reinforcement step until the uncertainty in the state estimate improves. The state space is modeled by a Dynamic Bayesian Network and updated using a Region-based Particle Filter. We take advantage of the fact that only discrete (topological) representations of entity locations are needed for decision-making, to make the tracking and decision making more efficient. Our experiments show that policy search leads to faster task completion times as well as higher total reward compared to a manually crafted policy. Smoothed Sarsa learns a policy orders of magnitude faster than previous policy search algorithms. We demonstrate our results on the Player/Stage simulator and on the Pioneer robot.

Keywords

Reinforcement learningComputer scienceMarkov decision processTask (project management)State spaceParticle filterArtificial intelligenceRobotBackupState (computer science)

Related papers

Browse all LEARNING papers