Home /Research /Voice Expression System of Visual Environment for a Guide Dog Robot
LEARNING

Voice Expression System of Visual Environment for a Guide Dog Robot

Reiya Ichikawa, Bin Zhang, Hun‐ok Lim

Year
2022
Citations
7

Abstract

In this paper, a voice expression system of visual environment for a guide dog robot is proposed. This system enables the robot to recognize objects and scenes in front of the robot by CNN (Convolutional Neural Network) and generate captions of the scenes by LSTM (Long-Short-Term Memory) network. Then the robot expresses the recognized visual scene by voice, which is generated by speech synthesis. The guide dog robot can guide the visually impaired person safely to the desired destination, as well as entertain the user by expressing various visual information through voice. The system is composed of object recognition, scene caption, and speech synthesis. The effectiveness of this system is confirmed through experiments conducted with our guide dog robot.

Keywords

Computer scienceRobotComputer visionArtificial intelligenceConvolutional neural networkExpression (computer science)Object (grammar)Social robotMobile robotSpeech recognition

Related papers

Browse all LEARNING papers