Incremental multi-view object detection from a moving camera

Takashi Konno, Ayako Amma, Asako Kanezaki

发表年份: 2021
引用次数: 6

摘要

Object detection in a single image is a challenging problem due to clutters, occlusions, and a large variety of viewing locations. This task can benefit from integrating multi-frame information captured by a moving camera. In this paper, we propose a method to increment object detection scores extracted from multiple frames captured from different viewpoints. For each frame, we run an efficient end-to-end object detector that outputs object bounding boxes, each of which is associated with the scores of categories and poses. The scores of detected objects are then stored in grid locations in 3D space. After observing multiple frames, the object scores stored in each grid location are integrated based on the best object pose hypothesis. This strategy requires the consistency of object categories and poses among multiple frames, and thus it significantly suppresses miss detections. The performance of the proposed method is evaluated on our newly created multi-class object dataset captured in robot simulation and real environments, as well as on a public benchmark dataset.

关键词

Artificial intelligenceComputer visionComputer scienceObject detectionBenchmark (surveying)Object (grammar)Bounding overwatchGridFrame (networking)Cognitive neuroscience of visual object recognition

Incremental multi-view object detection from a moving camera

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory