Multi-task Reinforcement Learning with a Planning Quasi-Metric

Vincent Micheli, Karthigan Sinnathamby, François Fleuret

发表年份: 2020
访问权限: 开放获取

摘要

We introduce a new reinforcement learning approach combining a planning quasi-metric (PQM) that estimates the number of steps required to go from any state to another, with task-specific "aimers" that compute a target state to reach a given goal. This decomposition allows the sharing across tasks of a task-agnostic model of the quasi-metric that captures the environment's dynamics and can be learned in a dense and unsupervised manner. We achieve multiple-fold training speed-up compared to recently published methods on the standard bit-flip problem and in the MuJoCo robotic arm simulator.

关键词

cs.LGstat.ML

Multi-task Reinforcement Learning with a Planning Quasi-Metric

摘要

关键词

相关论文

The Organization of Behavior

Fractional Brownian Motions, Fractional Noises and Applications

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

A guide to deep learning in healthcare