首页 /研究 /Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models

PERCEPTION

Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models

Nils Blank, Moritz Reuss, Marcel Rühle, Ömer Erdinç Yağmurlu, Fabian Wenzel, Oier Mees, Rudolf Lioutikov

发表年份: 2024
访问权限: 开放获取

摘要

A central challenge towards developing robots that can relate human language to their perception and actions is the scarcity of natural language annotations in diverse robot datasets. Moreover, robot policies that follow natural language instructions are typically trained on either templated language or expensive human-labeled instructions, hindering their scalability. To this end, we introduce NILS: Natural language Instruction Labeling for Scalability. NILS automatically labels uncurated, long-horizon robot data at scale in a zero-shot manner without any human intervention. NILS combines pretrained vision-language foundation models in order to detect objects in a scene, detect object-centric changes, segment tasks from large datasets of unlabelled interaction data and ultimately label behavior datasets. Evaluations on BridgeV2, Fractal, and a kitchen play dataset show that NILS can autonomously annotate diverse robot demonstrations of unlabeled and unstructured datasets while alleviating several shortcomings of crowdsourced human annotations, such as low data quality and diversity. We use NILS to label over 115k trajectories obtained from over 430 hours of robot data. We open-source our auto-labeling code and generated annotations on our website: http://robottasklabeling.github.io.

关键词

cs.ROcs.AIcs.CVcs.LG

Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models

摘要

关键词

相关论文

如何缓解越野环境中语义分割的分布偏移

基于点云配准的非破坏性高分辨率涂层厚度三维扫描测量

基于原型模糊推理与证据融合的不确定性引导工业机器人可进化识别框架

迈向智能机器人时代：用于高级感知系统的多模态柔性触觉传感器