Underwater Image Processing and Object Detection Based on Deep CNN Method
Fenglei Han, Jingzheng Yao, Haitao Zhu, Chunhui Wang
- 发表年份
- 2020
- 引用次数
- 171
- 访问权限
- 开放获取
摘要
Due to the importance of underwater exploration in the development and utilization of deep-sea resources, underwater autonomous operation is more and more important to avoid the dangerous high-pressure deep-sea environment. For underwater autonomous operation, the intelligent computer vision is the most important technology. In an underwater environment, weak illumination and low-quality image enhancement, as a preprocessing procedure, is necessary for underwater vision. In this paper, a combination of max-RGB method and shades of gray method is applied to achieve the enhancement of underwater vision, and then a CNN (Convolutional Neutral Network) method for solving the weakly illuminated problem for underwater images is proposed to train the mapping relationship to obtain the illumination map. After the image processing, a deep CNN method is proposed to perform the underwater detection and classification, according to the characteristics of underwater vision, two improved schemes are applied to modify the deep CNN structure. In the first scheme, a <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M1"><mml:mn>1</mml:mn><mml:mo>∗</mml:mo><mml:mn>1</mml:mn></mml:math> convolution kernel is used on the <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M2"><mml:mn>26</mml:mn><mml:mo>∗</mml:mo><mml:mn>26</mml:mn></mml:math> feature map, and then a downsampling layer is added to resize the output to equal <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M3"><mml:mn>13</mml:mn><mml:mo>∗</mml:mo><mml:mn>13</mml:mn></mml:math>. In the second scheme, a downsampling layer is added firstly, and then the convolution layer is inserted in the network, the result is combined with the last output to achieve the detection. Through comparison with the Fast RCNN, Faster RCNN, and the original YOLO V3, scheme 2 is verified to be better in detecting underwater objects. The detection speed is about 50 FPS (Frames per Second), and mAP (mean Average Precision) is about 90%. The program is applied in an underwater robot; the real-time detection results show that the detection and classification are accurate and fast enough to assist the robot to achieve underwater working operation.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002