Home /Research /Temporal Prompt Learning With Depth Memory for Video Mirror Detection
OTHER

Temporal Prompt Learning With Depth Memory for Video Mirror Detection

Zhaohu Xing, Ye Tian, Xin Yang, Sixiang Chen, Huazhu Fu, Yan Nei Law, Lei Zhu

Year
2025
Citations
1

Abstract

Mirror detection in dynamic scenes plays a crucial role in ensuring safety for various applications, such as drone tracking and robot navigation. However, current mirror detection models often fail in areas with mirrors that have a similar visual and color appearance to their surrounding objects. They also struggle to generalize well in complex cases, primarily due to limited annotated datasets. In this work, we propose a novel temporal prompt learning network with depth memory (TPD-Net) to address these critical challenges. Our approach includes several key components. First, we introduce a Temporal Prompt Generator (TPG) to learn temporal prompt features. Then, we devise Multi-layer Depth-aware Adaptor (MDA) modules to progressively adapt prompt features from the TPG, thereby learning mirror-related features by embedding temporal depth information as guidance. Moreover, we further refine these mirror-related features by constructing a depth memory and a Depth Memory Read module to read the temporal depths stored in the memory, boosting video mirror detection. Experimental results on a benchmark dataset show that our TPD-Net significantly outperforms 22 state-of-the-art methods in video mirror detection tasks. Our code, models, and results are publicly available at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/ge-xing/TPDNet</uri>.

Keywords

Boosting (machine learning)EmbeddingKey (lock)Benchmark (surveying)Tracking (education)Generator (circuit theory)Deep learningRobot

Related papers

Browse all OTHER papers