首页 /研究 /Bayesian Optimization with Automatic Prior Selection for Data-Efficient\n Direct Policy Search
LOCOMOTION

Bayesian Optimization with Automatic Prior Selection for Data-Efficient\n Direct Policy Search

Rémi Pautrat, Konstantinos Chatzilygeroudis, Jean-Baptiste Mouret

发表年份
2017
引用次数
34
访问权限
开放获取

摘要

One of the most interesting features of Bayesian optimization for direct\npolicy search is that it can leverage priors (e.g., from simulation or from\nprevious tasks) to accelerate learning on a robot. In this paper, we are\ninterested in situations for which several priors exist but we do not know in\nadvance which one fits best the current situation. We tackle this problem by\nintroducing a novel acquisition function, called Most Likely Expected\nImprovement (MLEI), that combines the likelihood of the priors and the expected\nimprovement. We evaluate this new acquisition function on a transfer learning\ntask for a 5-DOF planar arm and on a possibly damaged, 6-legged robot that has\nto learn to walk on flat ground and on stairs, with priors corresponding to\ndifferent stairs and different kinds of damages. Our results show that MLEI\neffectively identifies and exploits the priors, even when there is no obvious\nmatch between the current situations and the priors.\n

关键词

Prior probabilityLeverage (statistics)Computer scienceArtificial intelligenceMachine learningExploitBayesian probabilityBayesian optimizationTask (project management)Selection (genetic algorithm)

相关论文

查看 LOCOMOTION 分类全部论文