首页 /研究 /From text to motion: grounding GPT-4 in a humanoid robot “Alter3”
OTHER

From text to motion: grounding GPT-4 in a humanoid robot “Alter3”

Takahide Yoshida, Atsushi Masumori, Takashi Ikegami

发表年份
2025
引用次数
9
访问权限
开放获取

摘要

This paper introduces Alter3, a humanoid robot that demonstrates spontaneous motion generation through the integration of GPT-4, a cutting-edge Large Language Model (LLM). This integration overcomes the challenge of applying LLMs to direct robot control, which typically struggles with the hardware-specific nuances of robotic operation. By translating linguistic descriptions of human actions into robotic movements via programming, Alter3 can autonomously perform a diverse range of actions, such as adopting a "selfie" pose or simulating a "ghost." This approach not only shows Alter3's few-shot learning capabilities but also its adaptability to verbal feedback for pose adjustments without manual fine-tuning. This research advances the field of humanoid robotics by bridging linguistic concepts with physical embodiment and opens new avenues for exploring spontaneity in humanoid robots.

关键词

Humanoid robotComputer scienceMotion (physics)Artificial intelligenceComputer visionRobotHuman–computer interaction

相关论文

查看 OTHER 分类全部论文