Language-Conditioned Offline RL for Multi-Robot Navigation

Steven Morad, Ajay Shankar, Jan Blumenkamp, Amanda Prorok

发表年份: 2025
引用次数: 3

摘要

We present a method for synthesizing navigation policies for multi-robot teams that interpret and follow natural language instructions. We condition these policies on embeddings from pretrained Large Language Models (LLMs), and train them via offline reinforcement learning with as little as 20 minutes of randomly-collected real-world data. Experiments on a team of five real robots show that these policies generalize well to unseen commands, indicating an understanding of the LLM latent space. Our method requires no simulators or environment models, and produces low-latency control policies that can be deployed directly to real robots without finetuning. We provide videos of our experiments at https://sites.google.com/view/llm-marl.

关键词

Computer scienceRobotMobile robotArtificial intelligenceHuman–computer interactionComputer visionReal-time computing

Language-Conditioned Offline RL for Multi-Robot Navigation

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory