Home /Research /Scalable FPGA-Based Convolutional Neural Network Accelerator for Embedded Systems

PERCEPTION

Scalable FPGA-Based Convolutional Neural Network Accelerator for Embedded Systems

Jingyuan Zhao, Zhendong Yin, Yanlong Zhao, Mingyang Wu, Mingdong Xu

Year: 2019
Citations: 5

Abstract

Convolutional neural network(CNN) and related deep learning algorithms represent the state-of-art ability in several computer vision tasks, such as image classification and video analysis. However, model complexity and massive computational operations restrain CNN deployment on embedded systems with limited computing capability and low power budgets including smartphones, drones and robots. In this paper, we propose a scalable FPGA-based CNN hardware accelerator for embedded systems based on 8-bit fixed point approximation of a hardware-friendly CNN model named SqueezeNet v1.1. Proposed FPGA accelerator is implemented with OpenCL framework and achieves 1.9x energy efficiency compared to previous work. Besides, performance and resource occupation of proposed FPGA accelerator are capable of being customized to fit in different FPGA platforms by changing a single hyperparameter.

Keywords

Computer scienceField-programmable gate arrayConvolutional neural networkScalabilityFPGA prototypeHardware accelerationDeep learningEmbedded systemArtificial neural networkSoftware deployment

Scalable FPGA-Based Convolutional Neural Network Accelerator for Embedded Systems

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory