A Multimodality Scene Graph Generation Approach for Robust Human–Robot Collaborative Assembly Visual Relationship Representation
Jianhao Lv, Rong Zhang, Xinyu Li, Shimin Liu, Tianyuan Liu, Qi Zhang, Jinsong Bao
- 发表年份
- 2023
- 引用次数
- 11
摘要
Human–robot collaborative assembly is required to comprehensively perceive the working scenarios for the most possible assembly collaborations. Nevertheless, existing works have paid much attention to physical entities (i.e., object detection, pose estimation), while ignores the weight of interactive relationships. This research gap makes it difficult to become aware of the cues for decision-making, especially in a complicated assembly task. Furthermore, inadequate relative position characteristics and indescribable object influence remain quite challenging for visual relationship representation. To overcome these abovementioned gaps, a multimodality scene graph generation approach is proposed to more robustly describe the abstract visual relationships. A novel heat modality is presented to better represent the relative spatial characteristic. Three strategies are developed for adapting different baselines in the multimodality feature encoder module. Experimental results show the generality and superb performance for multimodality scene graph generation tasks in human–robot collaborative assembly scenarios.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002