Home /Research /Multi-modal Variational Faster R-CNN for Improved Visual Object Detection in Manufacturing
PERCEPTION

Multi-modal Variational Faster R-CNN for Improved Visual Object Detection in Manufacturing

Panagiotis Mouzenidis, Antonios Louros, Dimitrios Konstantinidis, Kosmas Dimitropoulos, Petros Daras, Theofilos Mastos

Year
2021
Citations
6

Abstract

Visual object detection is a critical task for a variety of industrial applications, such as robot navigation, quality control and product assembling. Modern industrial environments require AI-based object detection methods that can achieve high accuracy, robustness and generalization. To this end, we propose a novel object detection approach that can process and fuse information from RGB-D images for the accurate detection of industrial objects. The proposed approach utilizes a novel Variational Faster R-CNN algorithm that aims to improve the robustness and generalization ability of the original Faster R-CNN algorithm by employing a VAE encoder-decoder network and a very powerful attention layer. Experimental results on two object detection datasets, namely the well-known RGB-D Washington dataset and the QCONPASS dataset of industrial objects that is first presented in this paper, verify the significant performance improvement achieved when the proposed approach is employed.

Keywords

Computer scienceRobustness (evolution)Artificial intelligenceObject detectionComputer visionRGB color modelGeneralizationEncoderPattern recognition (psychology)Industrial robot

Related papers

Browse all PERCEPTION papers