首页 /研究 /Diffusion-Based Image Augmentation for Semantic Segmentation in Outdoor Robotics
PERCEPTION

Diffusion-Based Image Augmentation for Semantic Segmentation in Outdoor Robotics

Peter Mortimer, Mirko Maehlisch

发表年份
2025
访问权限
开放获取

摘要

The performance of leaning-based perception algorithms suffer when deployed in out-of-distribution and underrepresented environments. Outdoor robots are particularly susceptible to rapid changes in visual scene appearance due to dynamic lighting, seasonality and weather effects that lead to scenes underrepresented in the training data of the learning-based perception system. In this conceptual paper, we focus on preparing our autonomous vehicle for deployment in snow-filled environments. We propose a novel method for diffusion-based image augmentation to more closely represent the deployment environment in our training data. Diffusion-based image augmentations rely on the public availability of vision foundation models learned on internet-scale datasets. The diffusion-based image augmentations allow us to take control over the semantic distribution of the ground surfaces in the training data and to fine-tune our model for its deployment environment. We employ open vocabulary semantic segmentation models to filter out augmentation candidates that contain hallucinations. We believe that diffusion-based image augmentations can be extended to many other environments apart from snow surfaces, like sandy environments and volcanic terrains.

关键词

cs.CV

相关论文

查看 PERCEPTION 分类全部论文