Home /Research /From text to motion: grounding GPT-4 in a humanoid robot “Alter3”
OTHER

From text to motion: grounding GPT-4 in a humanoid robot “Alter3”

Takahide Yoshida, Atsushi Masumori, Takashi Ikegami

Year
2025
Citations
9
Access
Open access

Abstract

This paper introduces Alter3, a humanoid robot that demonstrates spontaneous motion generation through the integration of GPT-4, a cutting-edge Large Language Model (LLM). This integration overcomes the challenge of applying LLMs to direct robot control, which typically struggles with the hardware-specific nuances of robotic operation. By translating linguistic descriptions of human actions into robotic movements via programming, Alter3 can autonomously perform a diverse range of actions, such as adopting a "selfie" pose or simulating a "ghost." This approach not only shows Alter3's few-shot learning capabilities but also its adaptability to verbal feedback for pose adjustments without manual fine-tuning. This research advances the field of humanoid robotics by bridging linguistic concepts with physical embodiment and opens new avenues for exploring spontaneity in humanoid robots.

Keywords

Humanoid robotComputer scienceMotion (physics)Artificial intelligenceComputer visionRobotHuman–computer interaction

Related papers

Browse all OTHER papers