Home /Research /Adaptive Exploration through Covariance Matrix Adaptation Enables Developmental Motor Learning
LEARNING

Adaptive Exploration through Covariance Matrix Adaptation Enables Developmental Motor Learning

Freek Stulp, Pierre‐Yves Oudeyer

Year
2012
Citations
5
Access
Open access

Abstract

Abstract The “Policy Improvement with Path Integrals” (PI 2 ) [25] and “Covariance Matrix Adaptation - Evolutionary Strategy” [8] are considered to be state-of-the-art in direct reinforcement learning and stochastic optimization respectively. We have recently shown that incorporating covariance matrix adaptation into PI 2 – which yields the PI CMA 2 algorithm – enables adaptive exploration by continually and autonomously reconsidering the exploration/exploitation trade-off. In this article, we provide an overview of our recent work on covariance matrix adaptation for direct reinforcement learning [22–24], highlight its relevance to developmental robotics, and conduct further experiments to analyze the results. We investigate two complementary phenomena from developmental robotics. First, we demonstrate PI CMA 2 ’s ability to adapt to slowly or abruptly changing tasks due to its continual and adaptive exploration. This is an important component of life-long skill learning in dynamic environments. Second, we show on a reaching task PI CMA 2 how subsequently releases degrees of freedom from proximal to more distal limbs as learning progresses. A similar effect is observed in human development, where it is known as ‘proximodistal maturation’.

Keywords

CMA-ESAdaptation (eye)Reinforcement learningArtificial intelligenceCovariance matrixComputer scienceRoboticsCovarianceEvolution strategyTask (project management)

Related papers

Browse all LEARNING papers