首页 /研究 /Control Policy with Autocorrelated Noise in Reinforcement Learning for Robotics

LEARNING

Control Policy with Autocorrelated Noise in Reinforcement Learning for Robotics

Paweł Wawrzyński

发表年份: 2015
引用次数: 41
访问权限: 开放获取

摘要

Direct application of reinforcement learning in robotics rises the issue of discontinuity of control signal. Consecutive actions are selected independently on random, which often makes them excessively far from one another. Such control is hardly ever appropriate in robots, it may even lead to their destruction. This paper considers a control policy in which consecutive actions are modified by autocorrelated noise. That policy generally solves the aforementioned problems and it is readily applicable in robots. In the experimental study it is applied to three robotic learning control tasks: Cart-Pole SwingUp, Half-Cheetah, and a walking humanoid.

关键词

Computer scienceReinforcement learningNoise (video)RoboticsArtificial intelligenceAutocorrelationControl (management)Speech recognitionRobotStatistics

Control Policy with Autocorrelated Noise in Reinforcement Learning for Robotics

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory