首页 /研究 /Embodied large language models enable robots to complete complex tasks in unpredictable environments
OTHER

Embodied large language models enable robots to complete complex tasks in unpredictable environments

Ruaridh Mon-Williams, Gen Li, Ran Long, Wenqian Du, Christopher G. Lucas

发表年份
2025
引用次数
67
访问权限
开放获取

摘要

Completing complex tasks in unpredictable settings challenges robotic systems, requiring a step change in machine intelligence. Sensorimotor abilities are considered integral to human intelligence. Thus, biologically inspired machine intelligence might usefully combine artificial intelligence with robotic sensorimotor capabilities. Here we report an embodied large-language-model-enabled robot (ELLMER) framework, utilizing GPT-4 and a retrieval-augmented generation infrastructure, to enable robots to complete long-horizon tasks in unpredictable settings. The method extracts contextually relevant examples from a knowledge base, producing action plans that incorporate force and visual feedback and enabling adaptation to changing conditions. We tested ELLMER on a robot tasked with coffee making and plate decoration; these tasks consist of a sequence of sub-tasks from drawer opening to pouring, each benefiting from distinct feedback types and methods. We show that the ELLMER framework allows the robot to complete the tasks. This demonstration marks progress towards scalable, efficient and 'intelligent robots' able to complete complex tasks in uncertain environments.

关键词

Embodied cognitionRobotComputer scienceHuman–computer interactionArtificial intelligence

相关论文

查看 OTHER 分类全部论文