首页 /研究 /Pointing-Based Object Recognition
HRI

Pointing-Based Object Recognition

Lukáš Hajdúch, Viktor Kocur

发表年份
2026
访问权限
开放获取

摘要

This paper presents a comprehensive pipeline for recognizing objects targeted by human pointing gestures using RGB images. As human-robot interaction moves toward more intuitive interfaces, the ability to identify targets of non-verbal communication becomes crucial. Our proposed system integrates several existing state-of-the-art methods, including object detection, body pose estimation, monocular depth estimation, and vision-language models. We evaluate the impact of 3D spatial information reconstructed from a single image and the utility of image captioning models in correcting classification errors. Experimental results on a custom dataset show that incorporating depth information significantly improves target identification, especially in complex scenes with overlapping objects. The modularity of the approach allows for deployment in environments where specialized depth sensors are unavailable.

关键词

cs.CV

相关论文

查看 HRI 分类全部论文