Home /Research /Robust detection of dense small tea shoots across cultivars under occlusion and bud–leaf similarity for intelligent selective harvesting
OTHER

Robust detection of dense small tea shoots across cultivars under occlusion and bud–leaf similarity for intelligent selective harvesting

Decheng Liu, Pengfei Wang, Zhi Zhang, Yongzong Lu, Baijuan Wang, Yongguang Hu

Year
2025
Citations
2

Abstract

• YOLOv7-LEES detects dense, small, and shaded tea shoots in real time (116.3 FPS) for field-robot deployment by integrating Efficient Channel Attention, Explicit Visual Center schemes, SIoU, and the lightweight RepNCSPELAN4 (–12.9 % parameters, –8.3 % computational cost). • YOLOv7-LEES demonstrates exceptional performance in tea shoot detection, achieving the highest precision (92.0 %) and recall (82.2 %) compared to other mainstream models. It maintains strong performance across various cultivars, weather, and lighting conditions, making it highly suitable for real-world automated tea harvesting applications. • In scenarios involving occlusion, overlap, and bud-leaf similarity, the YOLOv7-LEES model achieves a recall rate increase of 15.1 %, 14.8 %, and 20.9 %, respectively, while maintaining 93.9 % mAP and 110.8 FPS in high-density conditions, showcasing exceptional detection accuracy and speed. Accurate and robust recognition of dense small tea shoots in complex fields is critical for automated picking, but occlusion and bud–leaf similarity degrade detection accuracy. To address these challenges, the YOLOv7-LEES model is proposed. The backbone integrates Efficient Channel Attention (ECA) to improve recall on dense small objects. The Neck network incorporates the Explicit Visual Centre (EVC) to capture complementary global and local features, increasing sensitivity and reducing misclassification under occlusion, while RepNet Cross-Stage Partial Attention ELAN4 Network (RepNCSPELAN4) reduces computation cost and parameters. SCYLLA-IoU (SIoU) loss with angle cost reduces localisation uncertainty and speeds convergence. YOLOv7-LEES achieves 92.0 % precision, 82.2 % recall, 90.3 % mAP 0.5 , and 88.0 % F1 score, with 35.5 M parameters and 116.3 FPS. Ablation studies show that EVC increases mAP 0.5SS by 9.8 %, ECA enhances R all by 2.3 %, and SIoU improves P SS by 4.9 %, while RepNCSPELAN4 reduces parameters by 12.9 % and computational costs by 8.3 %. Compared to the classical two-stage detectors (Faster R-CNN) and the most recent single-stage detector (YOLOv11m and YOLOv12m), YOLOv7-LEES offers advantages in both detection accuracy and speed. On a multi-cultivar dataset, YOLOv7-LEES achieves 93.35 % mAP 0.5 and demonstrates consistent performance across varying weather conditions. It robust to bud overlap, occlusion by old leaves, and similarity between buds and leaves, demonstrating adaptability from sparse to dense shoot distributions, achieving 82.0 % mAP 0.5 for small and 93.9 % mAP 0.5 for large quantities. YOLOv7-LEES achieves robust dense tea-shoot detection under field conditions, supporting precise and efficient tea-picking automation.

Keywords

ShootSimilarity (geometry)Pattern recognition (psychology)Channel (broadcasting)ComputationPrecision and recall

Related papers

Browse all OTHER papers