首页 /研究 /RT-DETR Optimization with Efficiency-Oriented Backbone and Adaptive Scale Fusion for Precise Pomegranate Detection
OTHER

RT-DETR Optimization with Efficiency-Oriented Backbone and Adaptive Scale Fusion for Precise Pomegranate Detection

Jun Yuan, Jing Fan, Hui Liu, W. C. Yan, Donghan Li, Hongtao Liu, Dongyan Huang

发表年份
2025
引用次数
2

摘要

To develop a high-performance detection system for automated harvesting on resource-limited edge devices, we introduce FSA-DETR-P, a lightweight detection framework that addresses challenges such as illumination inconsistency, occlusion, and scale variation in complex orchard environments. Unlike traditional computationally intensive architectures, this model optimizes real-time detection transformers by integrating an efficient backbone for fast feature extraction, a simplified aggregation structure to minimize complexity, and an adaptive mechanism for multi-scale feature fusion. The optimized backbone improves early-stage texture extraction while reducing computational demands. The streamlined aggregation design enhances multi-level interactions without losing spatial detail, and the adaptive fusion module strengthens the detection of small, partially occluded, or ambiguous fruits. We created a domain-specific pomegranate dataset, expanded to 13,840 images with a rigorous 8:1:1 split for training, validation, and testing. The results show that the pruned and optimized model achieves a Mean Average Precision (mAP50) of 0.928 and mAP50–95 of 0.632 with reduced parameters (13.73 M) and lower computational costs (34.6 GFLOPs). It operates at 24.6 FPS on an NVIDIA Jetson Orin Nano, indicating a strong balance between accuracy and deployability, making it well-suited for orchard monitoring and robotic harvesting in real-world applications.

关键词

FusionFeature extractionComputational complexity theoryPattern recognition (psychology)Edge detectionEnhanced Data Rates for GSM EvolutionScale (ratio)Feature (linguistics)

相关论文

查看 OTHER 分类全部论文