首页 /研究 /Gaze2Act：基于注视条件的视觉-语言-行动策略用于交互式机器人操作

HRI

Gaze2Act：基于注视条件的视觉-语言-行动策略用于交互式机器人操作

Kuangji Zuo, Gen Li, Bofan Lyu, Yanshuo Lu, Boyu Ma, Shijia Han, Xinyu Zhou, Xichen Yuan, Chuhao Zhou, Jiaqi Bai, Geng Li, Jianfei Yang

发表年份: 2026
访问权限: 开放获取

摘要

本文提出Gaze2Act框架，通过人类注视作为动态意图信号，结合跨视角语义匹配将第一人称注视映射到机器人视角，实现粗到细的目标指定。在Unitree G1人形机器人上的16项真实任务中，该方法在意图准确率和任务成功率上均达到最先进水平。

关键词

human gazeVLAintent specificationinteractive manipulationhumanoid

相关论文

HRI

📊 3,196 引用

The Uncanny Valley [From the Field]

Masahiro Mori, Karl F. MacDorman, Norri Kageki

2012

HRI

开放获取📊 3,034 引用

Measurement Instruments for the Anthropomorphism, Animacy, Likeability, Perceived Intelligence, and Perceived Safety of Robots

Christoph Bartneck, Dana Kulić, Elizabeth A. Croft 等 4 位作者

2008

📄 PDF 详情 →

HRI

📊 1,925 引用

The development of Honda humanoid robot

Kazuo Hirai, Masato Hirose, Y. Haikawa 等 4 位作者

2002

HRI

📊 1,914 引用

A Meta-Analysis of Factors Affecting Trust in Human-Robot Interaction

Peter A. Hancock, Deborah R. Billings, Kristin E. Schaefer 等 6 位作者

2011

查看 HRI 分类全部论文