首页 /研究 /SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Models

LEARNING

SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Models

Delin Qu, Haoming Song, Qizhi Chen, Yuanqi Yao, Xinyi Ye, Jiayuan Gu, Zhigang Wang, Yan Ding, Bin Zhao, Dong Wang

发表年份: 2025
引用次数: 15
访问权限: 开放获取

摘要

and real-world setups, where the pre-learned action grids are re-discretized to capture robot-specific spatial action movements of new setups.The superior results from extensive evaluations demonstrate the exceptional in-distribution generalization and out-of-distribution adaptation capability, highlighting the crucial benefit of the proposed spatial-aware representations for generalist robot policy learning.All the details and codes are opensourced.

关键词

Representation (politics)Field (mathematics)Perspective (graphical)Feature (linguistics)Set (abstract data type)

SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Models

摘要

关键词

相关论文

Self-Organizing Maps

Machine learning a probabilistic perspective

The Organization of Behavior

The spread of true and false news online