首页 /研究 /Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion

LOCOMOTION

Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion

Gwanghyeon Ji, Juhyeok Mun, Hyeongjun Kim, Jemin Hwangbo

发表年份: 2022
访问权限: 开放获取

摘要

In this paper, we propose a locomotion training framework where a control policy and a state estimator are trained concurrently. The framework consists of a policy network which outputs the desired joint positions and a state estimation network which outputs estimates of the robot's states such as the base linear velocity, foot height, and contact probability. We exploit a fast simulation environment to train the networks and the trained networks are transferred to the real robot. The trained policy and state estimator are capable of traversing diverse terrains such as a hill, slippery plate, and bumpy road. We also demonstrate that the learned policy can run at up to 3.75 m/s on normal flat ground and 3.54 m/s on a slippery plate with the coefficient of friction of 0.22.

关键词

cs.ROcs.LGeess.SY

Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion

摘要

关键词

相关论文

Trust Region Policy Optimization

Legged Robots That Balance

Being there: putting brain, body, and world together again

Small-scale soft-bodied robot with multimodal locomotion