首页 /研究 /A Demonstration of FPGA-Based You Only Look Once Version2 (YOLOv2)
PERCEPTION

A Demonstration of FPGA-Based You Only Look Once Version2 (YOLOv2)

Hiroki Nakahara, Masayuki Shimoda, Shimpei Sato

发表年份
2018
引用次数
18

摘要

We implement the YOLO (You only look once) object detector on an FPGA, which is faster and has higher accuracy. It is based on the convolutional deep neural network (CNN), and it is a dominant part of both the performance and the area. It is widely used in the embedded systems, such as robotics, autonomous driving, security, and drones, all of which require high-performance and low-power consumption. A frame object detection problem consists of two problems: one is a regression problem to spatially separated bounding boxes, the second is the associated classification of the objects within realtime frame rate. We used the binary (1 bit) precision CNN for feature extraction and the half-precision (16 bit) precision CNN for both classification and localization. We implement a pipelined based architecture for the mixed-precision YOLOv2 on the Xilinx Inc. zcu102 board, which has the Xilinx Inc. Zynq Ultrascale+ MPSoC. The implemented object detector archived 35.71 frames per second (FPS), which is faster than the standard video speed (29.9 FPS). Compared with a CPU and a GPU, an FPGA based accelerator was superior in power performance efficiency. Our method is suitable for the frame object detector for an embedded vision system.

关键词

Field-programmable gate arrayComputer scienceFrame rateConvolutional neural networkObject detectionArtificial intelligenceFrame (networking)Feature extractionDetectorFeature (linguistics)

相关论文

查看 PERCEPTION 分类全部论文