首页 /研究 /Using dimensionality reduction to exploit constraints in reinforcement learning

LEARNING

Using dimensionality reduction to exploit constraints in reinforcement learning

Sebastian Bitzer, Matthew Howard, Sethu Vijayakumar

发表年份: 2010
引用次数: 42

摘要

Reinforcement learning in the high-dimensional, continuous spaces typical in robotics, remains a challenging problem. To overcome this challenge, a popular approach has been to use demonstrations to find an appropriate initialisation of the policy in an attempt to reduce the number of iterations needed to find a solution. Here, we present an alternative way to incorporate prior knowledge from demonstrations of individual postures into learning, by extracting the inherent problem structure to find an efficient state representation. In particular, we use probabilistic, nonlinear dimensionality reduction to capture latent constraints present in the data. By learning policies in the learnt latent space, we are able to solve the planning problem in a reduced space that automatically satisfies task constraints. As shown in our experiments, this reduces the exploration needed and greatly accelerates the learning. We demonstrate our approach for learning a bimanual reaching task on the 19-DOF KHR-1HV humanoid.

关键词

Reinforcement learningComputer scienceArtificial intelligenceDimensionality reductionExploitProbabilistic logicTask (project management)Curse of dimensionalityMachine learningReduction (mathematics)

Using dimensionality reduction to exploit constraints in reinforcement learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory