Multi-performance index reinforcement learning training of beaver-like robot
Gang Chen, Hanhan Xue, Xianghui Meng, Zhi-Han Zhao, Zhen Liu
- 发表年份
- 2025
- 引用次数
- 9
摘要
Abstract The underwater environment is characterized by its inherent complexity and dynamics, leading to substantial interference with the precision of underwater measurement data. To enhance the precision of underwater data measurements, underwater robotic platforms necessitate improved motion and stability characteristics. As amphibian mammals, beavers possess excellent amphibious abilities and a wide range of environmental adaptability. Based on the observation of biological morphology and hind limb fin structure of beavers, this study analyses their swimming mechanism and designs a beaver-like robot. This study introduces an efficient control algorithm designed for a beaver-like robot platform. The algorithm integrates reinforcement learning with conservative Q-learning, model-based policy optimization and deep Q-network methods to facilitate offline training of the robot. A training weight allocation system is employed to enhance adaptability across diverse swimming conditions in the complex underwater setting. Simulating the robot’s underwater environment, the algorithm has demonstrated effective training in both speed and stability. The pitch angle is successfully stabilized between −0.245 and 0.305 rad, while the robot’s speed reaches up to 0.38 m·s −1 .
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002