Home /Research /Deep Active Cross-Modal Visuo-Tactile Transfer Learning for Robotic Object Recognition
PERCEPTION

Deep Active Cross-Modal Visuo-Tactile Transfer Learning for Robotic Object Recognition

Prajval Kumar Murali, Cong Wang, Dongheui Lee, Ravinder Dahiya, Mohsen Kaboli

Year
2022
Citations
23
Access
Open access

Abstract

We propose for the first time, a novel deep active visuo-tactile cross-modal full-fledged framework for object recognition by autonomous robotic systems. Our proposed network <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">xAVTNet</i> is actively trained with labelled point clouds from a vision sensor with one robot and tested with an active tactile perception strategy to recognise objects never touched before using another robot. We propose a novel visuo-tactile loss ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">VTLoss</i> ) to minimise the discrepancy between the visual and tactile domains for unsupervised domain adaptation. Our framework leverages the strengths of deep neural networks for cross-modal recognition along with active perception and active learning strategies for increased efficiency by minimising redundant data collection. Our method is extensively evaluated on a real robotic system and compared against baselines and other state-of-art approaches. We demonstrate clear outperformance in recognition accuracy compared to the state-of-art visuo-tactile cross-modal recognition method.

Keywords

Artificial intelligenceComputer scienceActive perceptionModalDeep learningComputer visionCognitive neuroscience of visual object recognitionAdaptation (eye)Transfer of learningTactile sensor

Related papers

Browse all PERCEPTION papers