首页 /研究 /Adaptive Exploration through Covariance Matrix Adaptation Enables Developmental Motor Learning
LEARNING

Adaptive Exploration through Covariance Matrix Adaptation Enables Developmental Motor Learning

Freek Stulp, Pierre‐Yves Oudeyer

发表年份
2012
引用次数
5
访问权限
开放获取

摘要

Abstract The “Policy Improvement with Path Integrals” (PI 2 ) [25] and “Covariance Matrix Adaptation - Evolutionary Strategy” [8] are considered to be state-of-the-art in direct reinforcement learning and stochastic optimization respectively. We have recently shown that incorporating covariance matrix adaptation into PI 2 – which yields the PI CMA 2 algorithm – enables adaptive exploration by continually and autonomously reconsidering the exploration/exploitation trade-off. In this article, we provide an overview of our recent work on covariance matrix adaptation for direct reinforcement learning [22–24], highlight its relevance to developmental robotics, and conduct further experiments to analyze the results. We investigate two complementary phenomena from developmental robotics. First, we demonstrate PI CMA 2 ’s ability to adapt to slowly or abruptly changing tasks due to its continual and adaptive exploration. This is an important component of life-long skill learning in dynamic environments. Second, we show on a reaching task PI CMA 2 how subsequently releases degrees of freedom from proximal to more distal limbs as learning progresses. A similar effect is observed in human development, where it is known as ‘proximodistal maturation’.

关键词

CMA-ESAdaptation (eye)Reinforcement learningArtificial intelligenceCovariance matrixComputer scienceRoboticsCovarianceEvolution strategyTask (project management)

相关论文

查看 LEARNING 分类全部论文