首页 /研究 /History-Enhanced 3D Scene Graph Reasoning From RGB-D Sequences
OTHER

History-Enhanced 3D Scene Graph Reasoning From RGB-D Sequences

Mingtao Feng, Chan Kit Yan, Zijie Wu, Weisheng Dong, Yaonan Wang, Ajmal Mian

发表年份
2025
引用次数
28

摘要

3D scene graph has emerged as a powerful high-level representation of the environment, and is considered a prerequisite for long-term autonomous robotic operations. However, building rich representations from RGB-D sequences remains a challenging problem. Existing methods ignore the semantic gap between linguistic and geometric feature spaces or neglect the importance of historical context in incrementally captured data. This limits the learning of visual-textual correspondence and the capability of relationship prediction. To address these problems, we propose a history-enhanced 3D scene graph reasoning framework that incrementally builds a consistent 3D semantic scene graph from an RGB-D image sequence. Specifically, we first introduce a cross-domain unified feature representation module to describe the object instances and their relationships distinctly. Next, we build a one-hot candidate matrix-enabled recurrent mechanism to reason the 3D scene graph, combining the perceived global and local history information. Finally, we design history-aware supervised semantics contrastive learning to optimize the scene-specific global history features. Extensive experiments on the 3DSSG dataset show the effectiveness of the proposed method in this challenging task, outperforming state-of-the-art approaches. Our code will be available at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/cbyan1003/HE-3DSGR</uri>.

关键词

Computer scienceArtificial intelligenceComputer visionRGB color modelGraphTheoretical computer science

相关论文

查看 OTHER 分类全部论文