RT-DETR Optimization with Efficiency-Oriented Backbone and Adaptive Scale Fusion for Precise Pomegranate Detection
Jun Yuan, Jing Fan, Hui Liu, W. C. Yan, Donghan Li, Hongtao Liu, Dongyan Huang
- 发表年份
- 2025
- 引用次数
- 2
摘要
To develop a high-performance detection system for automated harvesting on resource-limited edge devices, we introduce FSA-DETR-P, a lightweight detection framework that addresses challenges such as illumination inconsistency, occlusion, and scale variation in complex orchard environments. Unlike traditional computationally intensive architectures, this model optimizes real-time detection transformers by integrating an efficient backbone for fast feature extraction, a simplified aggregation structure to minimize complexity, and an adaptive mechanism for multi-scale feature fusion. The optimized backbone improves early-stage texture extraction while reducing computational demands. The streamlined aggregation design enhances multi-level interactions without losing spatial detail, and the adaptive fusion module strengthens the detection of small, partially occluded, or ambiguous fruits. We created a domain-specific pomegranate dataset, expanded to 13,840 images with a rigorous 8:1:1 split for training, validation, and testing. The results show that the pruned and optimized model achieves a Mean Average Precision (mAP50) of 0.928 and mAP50–95 of 0.632 with reduced parameters (13.73 M) and lower computational costs (34.6 GFLOPs). It operates at 24.6 FPS on an NVIDIA Jetson Orin Nano, indicating a strong balance between accuracy and deployability, making it well-suited for orchard monitoring and robotic harvesting in real-world applications.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Fractional Differential Equations
Igor Podlubný
2025
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
Genetic Programming: On the Programming of Computers by Means of Natural Selection
John R. Koza
1992