首页 /研究 /Improved QT-Opt Algorithm for Robotic Arm Grasping Based on Offline Reinforcement Learning

SWARM

Improved QT-Opt Algorithm for Robotic Arm Grasping Based on Offline Reinforcement Learning

Zhang Haojun, Sheng Zeng, Y. R. Hou, Haojie Huang, Zhezhuang Xu

发表年份: 2025
引用次数: 1
访问权限: 开放获取

摘要

Reinforcement learning plays a crucial role in the field of robotic arm grasping, providing a promising approach for the development of intelligent and adaptive grasping strategies. Due to distribution shift and local optimum in action, traditional online reinforcement learning is difficult to use existing grasping datasets, leading to low sample efficiency. This study proposes an improved QT-Opt algorithm for robotic arm grasping based on offline reinforcement learning. This improved algorithm proposes the Particle Swarm Optimization (PSO) to identify the action with the highest value within the robotic arm’s action space. Furthermore, a regularization term is proposed during the value iteration process to facilitate the learning of a conservative Q-function, enabling precise estimation of the robotic arm’s action values. Experimental results indicate that the improved QT-Opt algorithm achieves higher average grasping success rates when trained on multiple offline grasping datasets and demonstrates improved stability throughout the training process.

关键词

Reinforcement learningRobotic armComputer scienceReinforcementArtificial intelligenceAlgorithmEngineeringStructural engineering

Improved QT-Opt Algorithm for Robotic Arm Grasping Based on Offline Reinforcement Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory