Sparse Gaussian Processes-based Black-Box Data-efficient Policy Search for Robotics
Chunyan Rong, Jingyi Huang, André Rosendo
- 发表年份
- 2021
- 引用次数
- 3
摘要
While searching for solutions for Reinforcement Learning problems Policy Search algorithms crawl through a Search Space to efficiently find a successful policy. In contrast to Deep learning methods, which rely on hundreds of trials to tune weights, such efficient methods rely on very few trials to find a solution. In this work we introduce a variant of the method Black-box Data Efficient Policy Search for Robotics, which originally relied on a Gaussian Process method to learn a dynamics model. Our algorithm is called Black-DROPS-SGP and it adopts a more efficient Gaussian Process called Sparse Gaussian Processes, and we prove the superiority of our method by performing a comparison with Deep PILCO and Black-DROPS, two highly efficient Policy Search algorithms. We use cart-pole systems with one, two and three rotating joints to assess how our method fares when dealing with different complexities, and our results show that our method can find the optimal solution with virtually the same number of trials as the current state-of-the-art, but requires much less computational time between iterations to find this solution. In fields where trials are expensive (e.g. drug discovery, robotics) a higher number of trials can lead to broken robots before new behaviors are learnt, or too many patients injected with an ineffective drug before a cure is found.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002