首页 /研究 /Deep Object Detector With Attentional Spatiotemporal LSTM for Space Human–Robot Interaction

PERCEPTION

Deep Object Detector With Attentional Spatiotemporal LSTM for Space Human–Robot Interaction

Jiahui Yu, Hongwei Gao, Yongquan Chen, Dalin Zhou, Jinguo Liu, Zhaojie Ju

发表年份: 2022
引用次数: 34

摘要

Global temporal information and local semantic information are essential cues for high-performance online object detection in videos. However, despite their promising detection accuracy in most cases, most state-of-the-art approaches have following two limitations: invalid background/scale suppression and inadequate temporal information mining between frames. Many jobs currently focus on temporal information learning based on a single frame. In this article, we propose an attentional global–local information learning network; this is one of the first attempts to fully use both types of information between frames. Attention maps are creatively utilized to transfer temporal contexts between frames. This also effectively alleviates the adverse effects of scale changes. Furthermore, empowered by a detailed framework, a proposed detector effectively uses multilevel feature extraction. Given these contributions, the proposed detector achieves state-of-the-art performance on challenging benchmarks. Finally, practical experiments are conducted on a space human–robot interaction platform.

关键词

Computer scienceArtificial intelligenceFrame (networking)RobotObject (grammar)DetectorFocus (optics)Scale (ratio)Transfer of learningSpace (punctuation)

Deep Object Detector With Attentional Spatiotemporal LSTM for Space Human–Robot Interaction

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory