Efficient frequency domain CNN algorithm
Mihir Mody, Chaitanya Ghone, Manu Mathew, Jason Jones
- 发表年份
- 2017
- 引用次数
- 2
摘要
Deep Learning techniques like Convolutional Neural Networks (CNN) are getting popular for image classification with broad usage spanning across automotive, industrial, medicine, robotics etc. Typical CNN network consists of multiple layers of 2D convolutions, non-linearity, spatial pooling and fully connected layer, with 2D convolutions constituting more than 90% of total computations. The Fast Fourier Transform (FFT) based approach for convolution is promising in theory, but not used in practice due to growth in memory sizing of coefficients storage. The paper proposes new frequency domain algorithm which avoids memory size growth compared to traditional FFT based approach for performing 2D convolution. The proposed algorithm performs Fourier Transform (FT) of coefficients On-The-Fly (OTF) instead of offline calculation on PC. The proposed algorithm consists of expands, OTF-FT and pruning blocks that do efficient 2D convolution in the frequency domain. The proposed algorithm is compared with the FFT-based algorithm for the coefficient transformation. As per simulations, assuming typical network configuration parameters, the proposed algorithm is 4-8X faster compared to FFT based approach for the co-efficient transform.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002