首页 /研究 /Monocular 3D Object Localization using 2D Estimates for Industrial Robot Vision System

PERCEPTION

Monocular 3D Object Localization using 2D Estimates for Industrial Robot Vision System

Thanh Nguyen Canh, Du Trinh Ngoc, Xiem HoangVan

发表年份: 2025
引用次数: 1
访问权限: 开放获取

摘要

3D Object Localization has been emerging as one of the main challenges in Machine Vision tasks. In this paper, we proposed a novel 3D object localization method, leveraging a blend of deep learning techniques primarily rooted in object detection, post-image processing, and pose estimation algorithms. Our approach involves 3D calibration methods tailored for low-cost industrial robotics systems, requiring only a single 2D image input. Initially, object detection is performed using the You Only Look Once (YOLO) model, followed by an R-CNN model for segmenting the object into two distinct parts, i.e., the top face and the remainder. Subsequently, the center of the top face is served as an initialization position, and being refined with a novel calibration algorithm. Experimental results demonstrate a notable reduction in localization error by 87.65% when compared to existing methodologies.

关键词

Object (grammar)InitializationCalibrationIndustrial robotObject detectionFace (sociological concept)PoseRoboticsMachine vision

Monocular 3D Object Localization using 2D Estimates for Industrial Robot Vision System

摘要

关键词

相关论文

Artificial intelligence: a modern approach

Are we ready for autonomous driving? The KITTI vision benchmark suite

Self-Organizing Maps

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems