Home /Research /Improving the model of object detection on aerial photographs and video in unmanned aerial systems
PERCEPTION

Improving the model of object detection on aerial photographs and video in unmanned aerial systems

Vadym Slyusar, Mykhailo Protsenko, Anton Chernukha, Vasyl Melkin, Oleh Biloborodov, Mykola Samoilenko, Олена Олександрівна Кравченко, Halyna Kalynychenko, Anton Rohovyi, Mykhaylo Soloshchuk

Year
2022
Citations
16
Access
Open access

Abstract

This paper considers a model of object detection on aerial photographs and video using a neural network in unmanned aerial systems. The development of artificial intelligence and computer vision systems for unmanned systems (drones, robots) requires the improvement of models for detecting and recognizing objects in images and video streams. The results of video and aerial photography in unmanned aircraft systems are processed by the operator manually but there are objective difficulties associated with the operator’s processing of a large number of videos and aerial photographs, so it is advisable to automate this process. Analysis of neural network models has revealed that the YOLOv5x model (USA) is most suitable, as a basic model, for performing the task of object detection on aerial photographs and video. The Microsoft COCO suite (USA) is used to train this model. This set contains more than 200,000 images across 80 categories. To improve the YOLOv5x model, the neural network was trained with a set of VisDrone 2021 images (China) with the choice of such optimal training parameters as the optimization algorithm SGD; the initial learning rate (step) of 0.0005; the number of epochs of 25. As a result, a new model of object detection on aerial photographs and videos with the proposed name VisDroneYOLOv5x was obtained. The effectiveness of the improved model was studied using aerial photographs and videos from the VisDrone 2021 set. To assess the effectiveness of the model, the following indicators were chosen as the main indicators: accuracy, sensitivity, the estimation of average accuracy. Using a convolutional neural network has made it possible to automate the process of object detection on aerial photographs and video in unmanned aerial systems.

Keywords

Artificial intelligenceComputer scienceComputer visionAerial photographyObject detectionSet (abstract data type)Artificial neural networkObject (grammar)Process (computing)Aerial image

Related papers

Browse all PERCEPTION papers