Training Musculoskeletal Arm Play Taichi with Deep Reinforcement Learning
Haoran Xu, Xiang Ma, Leiyang Xu, Qiang Wang
- Year
- 2020
- Citations
- 3
Abstract
Musculoskeletal robot arm driven by pneumatic artificial muscle actuators is secure, lightweight and compliant, which makes it an ideal solution for prosthetics and rehabilitation equipment. However, there exists conflicts between anatomical accuracy and engineering feasibility, and the nonlinearity of muscle actuators brings difficulty in accurate mathematical modeling. To overcome these problems, in this paper, we propose an optimized musculoskeletal arm design of ten muscles and four degrees-offreedom, employ Deep Deterministic Policy Gradient (DDPG) to train a data driven controller, and combine the off-policy reinforcement learning algorithm with Hindsight Experience Replay (HER) to deal with sparse rewards situation. TaiChi motion sequence is acquired using Perception Neuron and the trajectory tracking experiments are carried out in simulation platform. Experiment results demonstrate that, the controlled musculoskeletal arm is able to track the TaiChi trajectory and learn well in sparse rewards situation.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002