首页 /研究 /Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

LEARNING

Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

Vibhavari Dasagi, Robert Lee, Jake Bruce, Jürgen Leitner

发表年份: 2019
访问权限: 开放获取

摘要

Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many of these algorithms, but generating robot experience in the real world is expensive, especially when each task requires a lengthy online training procedure. Off-policy algorithms can in principle learn arbitrary tasks from a diverse enough fixed dataset. In this work, we evaluate popular exploration methods by generating robotics datasets for the purpose of learning to solve tasks completely offline without any further interaction in the real world. We present results on three popular continuous control tasks in simulation, as well as continuous control of a high-dimensional real robot arm. Code documenting all algorithms, experiments, and hyper-parameters is available at https://github.com/qutrobotlearning/batchlearning.

关键词

cs.LGcs.ROstat.ML

Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

摘要

关键词

相关论文

The Organization of Behavior

Fractional Brownian Motions, Fractional Noises and Applications

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

A guide to deep learning in healthcare