首页 /研究 /Research on Apple Object Detection and Localization Method Based on Improved YOLOX and RGB-D Images
PERCEPTION

Research on Apple Object Detection and Localization Method Based on Improved YOLOX and RGB-D Images

Tiantian Hu, Wenbo Wang, Jinan Gu, Zilin Xia, Jian Zhang, Bo Wang

发表年份
2023
引用次数
34
访问权限
开放获取

摘要

The vision-based fruit recognition and localization system is the basis for the automatic operation of agricultural harvesting robots. Existing detection models are often constrained by high complexity and slow inference speed, which do not meet the real-time requirements of harvesting robots. Here, a method for apple object detection and localization is proposed to address the above problems. First, an improved YOLOX network is designed to detect the target region, with a multi-branch topology in the training phase and a single-branch structure in the inference phase. The spatial pyramid pooling layer (SPP) with serial structure is used to expand the receptive field of the backbone network and ensure a fixed output. Second, the RGB-D camera is used to obtain the aligned depth image and to calculate the depth value of the desired point. Finally, the three-dimensional coordinates of apple-picking points are obtained by combining two-dimensional coordinates in the RGB image and depth value. Experimental results show that the proposed method has high accuracy and real-time performance: F1 is 93%, mean average precision (mAP) is 94.09%, detection speed can reach 167.43 F/s, and the positioning errors in X, Y, and Z directions are less than 7 mm, 7 mm, and 5 mm, respectively.

关键词

Artificial intelligenceComputer visionRGB color modelComputer scienceObject detectionRobotPyramid (geometry)Pattern recognition (psychology)Mathematics

相关论文

查看 PERCEPTION 分类全部论文