Home /Research /LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection

PERCEPTION

LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection

Zhengyi Liu, Longzhen Wang, Xianyong Fang, Zhengzheng Tu, Linbo Wang

Year: 2024
Access: Open access

Abstract

A light field camera can reconstruct 3D scenes using captured multi-focus images that contain rich spatial geometric information, enhancing applications in stereoscopic photography, virtual reality, and robotic vision. In this work, a state-of-the-art salient object detection model for multi-focus light field images, called LFSamba, is introduced to emphasize four main insights: (a) Efficient feature extraction, where SAM is used to extract modality-aware discriminative features; (b) Inter-slice relation modeling, leveraging Mamba to capture long-range dependencies across multiple focal slices, thus extracting implicit depth cues; (c) Inter-modal relation modeling, utilizing Mamba to integrate all-focus and multi-focus images, enabling mutual enhancement; (d) Weakly supervised learning capability, developing a scribble annotation dataset from an existing pixel-level mask dataset, establishing the first scribble-supervised baseline for light field salient object detection.https://github.com/liuzywen/LFScribble

Keywords

cs.CV

LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection

Abstract

Keywords

Related papers

How to Relieve Distribution Shifts in Semantic Segmentation for Off-Road Environments

Point cloud registration for non-destructive, high-resolution coating thickness measurement from 3D scans

Uncertainty-guided evolvable recognition framework for industrial robots via prototype-based fuzzy inference and evidence fusion

Toward the intelligent robotics era: Multimodal flexible haptic sensors for advanced perception systems