Imaginary Hindsight Experience Replay: Curious Model-based Learning for\n Sparse Reward Tasks
Robert McCarthy
- 发表年份
- 2021
- 引用次数
- 7
- 访问权限
- 开放获取
摘要
Model-based reinforcement learning is a promising learning strategy for\npractical robotic applications due to its improved data-efficiency versus\nmodel-free counterparts. However, current state-of-the-art model-based methods\nrely on shaped reward signals, which can be difficult to design and implement.\nTo remedy this, we propose a simple model-based method tailored for\nsparse-reward multi-goal tasks that foregoes the need for complicated reward\nengineering. This approach, termed Imaginary Hindsight Experience Replay,\nminimises real-world interactions by incorporating imaginary data into policy\nupdates. To improve exploration in the sparse-reward setting, the policy is\ntrained with standard Hindsight Experience Replay and endowed with\ncuriosity-based intrinsic rewards. Upon evaluation, this approach provides an\norder of magnitude increase in data-efficiency on average versus the\nstate-of-the-art model-free method in the benchmark OpenAI Gym Fetch Robotics\ntasks.\n
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002