首页 /研究 /Embodied large language models enable robots to complete complex tasks in unpredictable environments

OTHER

Embodied large language models enable robots to complete complex tasks in unpredictable environments

Ruaridh Mon-Williams, Gen Li, Ran Long, Wenqian Du, Christopher G. Lucas

发表年份: 2025
引用次数: 67
访问权限: 开放获取

摘要

Completing complex tasks in unpredictable settings challenges robotic systems, requiring a step change in machine intelligence. Sensorimotor abilities are considered integral to human intelligence. Thus, biologically inspired machine intelligence might usefully combine artificial intelligence with robotic sensorimotor capabilities. Here we report an embodied large-language-model-enabled robot (ELLMER) framework, utilizing GPT-4 and a retrieval-augmented generation infrastructure, to enable robots to complete long-horizon tasks in unpredictable settings. The method extracts contextually relevant examples from a knowledge base, producing action plans that incorporate force and visual feedback and enabling adaptation to changing conditions. We tested ELLMER on a robot tasked with coffee making and plate decoration; these tasks consist of a sequence of sub-tasks from drawer opening to pouring, each benefiting from distinct feedback types and methods. We show that the ELLMER framework allows the robot to complete the tasks. This demonstration marks progress towards scalable, efficient and 'intelligent robots' able to complete complex tasks in uncertain environments.

关键词

Embodied cognitionRobotComputer scienceHuman–computer interactionArtificial intelligence

Embodied large language models enable robots to complete complex tasks in unpredictable environments

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control