Precision citrus segmentation and stem picking point localization using improved YOLOv8n-seg algorithm
Han Li, Zirui Yin, Zhijiang Zuo, Junfeng Zhang
- 发表年份
- 2025
- 引用次数
- 2
- 访问权限
- 开放获取
摘要
Introduction: Due to the small size of citrus stems, their color similarity to the background, and their variable position relative to the fruit, accurately locating picking points using robots in natural environments presents significant challenges. Methods: To address this issue, this study proposes a method for segmenting citrus fruits and stems based on an improved YOLOv8n-seg model, combined with geometric constraints for stem matching to achieve accurate localization of picking points. First, all standard convolutions in the model are replaced with GhostConv to reduce the number of model parameters. Furthermore, a convolutional block attention module (CBAM) and a small-object detection layer are introduced to enhance the model's feature representation and segmentation accuracy for small objects. Then, by incorporating the positional relationship between the fruit and the stem, constraints are defined to match the target stem, and an algorithm is designed to determine the optimal picking point. Results: Experimental results show that the improved YOLOv8n-seg model achieves recall rates of 90.91% for fruits and stems, a mean average precision (mAP50) of 94.43%, and an F1-score of 93.51%. The precision rates for fruit and stem segmentation are 96.04% and 97.12%, respectively. The average detection rate of picking points reaches 88.38%, with an average localization time of 373.25 milliseconds under GPU support, demonstrating high real-time performance. Compared with other models, the improved YOLOv8n-seg model shows significantly better performance. Discussion: This study confirms the reliability and effectiveness of the proposed citrus picking point localization method and lays a technical foundation for the automated harvesting of citrus fruits.
关键词
相关论文
Artificial intelligence: a modern approach
1995
Are we ready for autonomous driving? The KITTI vision benchmark suite
Andreas Geiger, P Lenz, R. Urtasun
2012
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martı́n Abadi, Ashish Agarwal, Paul Barham 等 20 位作者
2016
Vision meets robotics: The KITTI dataset
Andreas Geiger, Philip Lenz, Christoph Stiller 等 4 位作者
2013