首页 /研究 /Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics

LEARNING

Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics

Tomoki Nishi, Prashant Doshi, Michael R. James, Danil Prokhorov

发表年份: 2017
访问权限: 开放获取

摘要

In many robotic applications, some aspects of the system dynamics can be modeled accurately while others are difficult to obtain or model. We present a novel reinforcement learning (RL) method for continuous state and action spaces that learns with partial knowledge of the system and without active exploration. It solves linearly-solvable Markov decision processes (L-MDPs), which are well suited for continuous state and action spaces, based on an actor-critic architecture. Compared to previous RL methods for L-MDPs and path integral methods which are model based, the actor-critic learning does not need a model of the uncontrolled dynamics and, importantly, transition noise levels; however, it requires knowing the control dynamics for the problem. We evaluate our method on two synthetic test problems, and one real-world problem in simulation and using real traffic data. Our experiments demonstrate improved learning and policy performance.

关键词

cs.AI

Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics

摘要

关键词

相关论文

The Organization of Behavior

Fractional Brownian Motions, Fractional Noises and Applications

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

A guide to deep learning in healthcare