Home /Research /From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction

SURGICAL

From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction

Ayberk Acar, Mariana Magnus Smith, Lidia Al-Zogbi, Tanner Watts, Fangjie Li, Hao Li, Nural Yilmaz, Paul Maria Scheikl, Jesse F. d’Almeida, Susheela Sharma, Lauren Branscombe, Tayfun Efe Ertop, Robert J. Webster, İpek Oğuz, Alan Kuntz, Axel Krieger, Jie Ying Wu

Year: 2025
Citations: 2

Abstract

Surgical automation requires precise guidance and understanding of the scene. Current methods in the literature rely on bulky depth cameras to create maps of the anatomy; however, this does not translate well to space-limited clinical applications. Monocular cameras are small and allow minimally invasive surgeries in tight spaces, but additional processing is required to generate 3D scene understanding. We propose a 3D mapping pipeline that uses only RGB images to create segmented point clouds of the target anatomy. To ensure the most accurate reconstruction, we compare different structure from motion algorithms’ performance on mapping the central airway obstructions, and test the pipeline on a downstream task of tumor resection. In several metrics, including post-procedure percentage tissue charring, our pipeline performs comparably to RGB-D cameras and, in some cases, even surpasses their downstream task performance. These promising results demonstrate that automation guidance can be achieved in minimally invasive procedures with monocular cameras. This study is a step toward the complete autonomy of surgical robots.

Keywords

MonocularPipeline (software)3D reconstructionAutomationPhotogrammetryTask (project management)Point cloudRGB color model

From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction

Abstract

Keywords

Related papers

A new optimizer using particle swarm theory

3D is here: Point Cloud Library (PCL)

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

Computer and Robot Vision