首页 /研究 /Underwater Image Processing and Object Detection Based on Deep CNN Method
PERCEPTION

Underwater Image Processing and Object Detection Based on Deep CNN Method

Fenglei Han, Jingzheng Yao, Haitao Zhu, Chunhui Wang

发表年份
2020
引用次数
171
访问权限
开放获取

摘要

Due to the importance of underwater exploration in the development and utilization of deep-sea resources, underwater autonomous operation is more and more important to avoid the dangerous high-pressure deep-sea environment. For underwater autonomous operation, the intelligent computer vision is the most important technology. In an underwater environment, weak illumination and low-quality image enhancement, as a preprocessing procedure, is necessary for underwater vision. In this paper, a combination of max-RGB method and shades of gray method is applied to achieve the enhancement of underwater vision, and then a CNN (Convolutional Neutral Network) method for solving the weakly illuminated problem for underwater images is proposed to train the mapping relationship to obtain the illumination map. After the image processing, a deep CNN method is proposed to perform the underwater detection and classification, according to the characteristics of underwater vision, two improved schemes are applied to modify the deep CNN structure. In the first scheme, a <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M1"><mml:mn>1</mml:mn><mml:mo>∗</mml:mo><mml:mn>1</mml:mn></mml:math> convolution kernel is used on the <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M2"><mml:mn>26</mml:mn><mml:mo>∗</mml:mo><mml:mn>26</mml:mn></mml:math> feature map, and then a downsampling layer is added to resize the output to equal <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M3"><mml:mn>13</mml:mn><mml:mo>∗</mml:mo><mml:mn>13</mml:mn></mml:math>. In the second scheme, a downsampling layer is added firstly, and then the convolution layer is inserted in the network, the result is combined with the last output to achieve the detection. Through comparison with the Fast RCNN, Faster RCNN, and the original YOLO V3, scheme 2 is verified to be better in detecting underwater objects. The detection speed is about 50 FPS (Frames per Second), and mAP (mean Average Precision) is about 90%. The program is applied in an underwater robot; the real-time detection results show that the detection and classification are accurate and fast enough to assist the robot to achieve underwater working operation.

关键词

Convolutional neural networkUnderwaterArtificial intelligenceComputer scienceAlgorithmRGB color modelDeep learningComputer visionGeology

相关论文

查看 PERCEPTION 分类全部论文