首页 /研究 /SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Models
LEARNING

SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Models

Delin Qu, Haoming Song, Qizhi Chen, Yuanqi Yao, Xinyi Ye, Jiayuan Gu, Zhigang Wang, Yan Ding, Bin Zhao, Dong Wang

发表年份
2025
引用次数
15
访问权限
开放获取

摘要

and real-world setups, where the pre-learned action grids are re-discretized to capture robot-specific spatial action movements of new setups.The superior results from extensive evaluations demonstrate the exceptional in-distribution generalization and out-of-distribution adaptation capability, highlighting the crucial benefit of the proposed spatial-aware representations for generalist robot policy learning.All the details and codes are opensourced.

关键词

Representation (politics)Field (mathematics)Perspective (graphical)Feature (linguistics)Set (abstract data type)

相关论文

查看 LEARNING 分类全部论文