Dense Prediction Transformer for Scale Estimation in Monocular Visual Odometry
André O. Françani, Marcos R. O. A. Maximo
- Year
- 2022
- Access
- Open access
Abstract
Monocular visual odometry consists of the estimation of the position of an agent through images of a single camera, and it is applied in autonomous vehicles, medical robots, and augmented reality. However, monocular systems suffer from the scale ambiguity problem due to the lack of depth information in 2D frames. This paper contributes by showing an application of the dense prediction transformer model for scale estimation in monocular visual odometry systems. Experimental results show that the scale drift problem of monocular systems can be reduced through the accurate estimation of the depth map by this model, achieving competitive state-of-the-art performance on a visual odometry benchmark.
Keywords
Related papers
Robotics in Plastic Surgery
Vijay Kumar, Sandhya Pandey
Clinical Journal of Plastic & Reconstructive Surgery · 2026
SurfSurg6D: Geometry Consistent Dense Correspondence for Textureless Surgical Instrument Pose Estimation
Daiyun Shen, Shuojue Yang, Chang Han Low +4 more
2026
EndoGSim: Physics-Aware 4D Dynamic Endoscopic Scene Simulations via MLLM-Guided Gaussian Splatting
Changjing Liu, Yiming Huang, Long Bai +2 more
2026
Retroperitoneal Robot-Assisted Nephroureterectomy: Technical Description and Single Center Experience.
Kawashima A, Ishizuya Y, Yamamoto Y +9 more
Asian journal of endoscopic surgery · 2026