droidlet: modular, heterogenous, multi-modal agents

Anurag Pratik, Soumith Chintala, Kavya Srinet, Dhiraj Gandhi, Rebecca Qian, Yuxuan Sun, Ryan Drew, Sara Elkafrawy, Anoushka Tiwari, Tucker Hart, Mary Williamson, Abhinav Gupta, Arthur Szlam

发表年份: 2021
访问权限: 开放获取

摘要

In recent years, there have been significant advances in building end-to-end Machine Learning (ML) systems that learn at scale. But most of these systems are: (a) isolated (perception, speech, or language only); (b) trained on static datasets. On the other hand, in the field of robotics, large-scale learning has always been difficult. Supervision is hard to gather and real world physical interactions are expensive. In this work we introduce and open-source droidlet, a modular, heterogeneous agent architecture and platform. It allows us to exploit both large-scale static datasets in perception and language and sophisticated heuristics often used in robotics; and provides tools for interactive annotation. Furthermore, it brings together perception, language and action onto one platform, providing a path towards agents that learn from the richness of real world interactions.

关键词

cs.ROcs.AI

droidlet: modular, heterogenous, multi-modal agents

摘要

关键词

相关论文

如何缓解越野环境中语义分割的分布偏移

基于原型模糊推理与证据融合的不确定性引导工业机器人可进化识别框架

基于点云配准的非破坏性高分辨率涂层厚度三维扫描测量

迈向智能机器人时代：用于高级感知系统的多模态柔性触觉传感器