{"title":"Efficient frequency domain CNN algorithm","authors":"Mihir Mody, C. Ghone, Manu Mathew, Jason Jones","doi":"10.1109/ICCE-ASIA.2017.8307846","DOIUrl":null,"url":null,"abstract":"Deep Learning techniques like Convolutional Neural Networks (CNN) are getting popular for image classification with broad usage spanning across automotive, industrial, medicine, robotics etc. Typical CNN network consists of multiple layers of 2D convolutions, non-linearity, spatial pooling and fully connected layer, with 2D convolutions constituting more than 90% of total computations. The Fast Fourier Transform (FFT) based approach for convolution is promising in theory, but not used in practice due to growth in memory sizing of coefficients storage. The paper proposes new frequency domain algorithm which avoids memory size growth compared to traditional FFT based approach for performing 2D convolution. The proposed algorithm performs Fourier Transform (FT) of coefficients On-The-Fly (OTF) instead of offline calculation on PC. The proposed algorithm consists of expands, OTF-FT and pruning blocks that do efficient 2D convolution in the frequency domain. The proposed algorithm is compared with the FFT-based algorithm for the coefficient transformation. As per simulations, assuming typical network configuration parameters, the proposed algorithm is 4–8X faster compared to FFT based approach for the co-efficient transform.","PeriodicalId":202045,"journal":{"name":"2017 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-ASIA.2017.8307846","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Deep Learning techniques like Convolutional Neural Networks (CNN) are getting popular for image classification with broad usage spanning across automotive, industrial, medicine, robotics etc. Typical CNN network consists of multiple layers of 2D convolutions, non-linearity, spatial pooling and fully connected layer, with 2D convolutions constituting more than 90% of total computations. The Fast Fourier Transform (FFT) based approach for convolution is promising in theory, but not used in practice due to growth in memory sizing of coefficients storage. The paper proposes new frequency domain algorithm which avoids memory size growth compared to traditional FFT based approach for performing 2D convolution. The proposed algorithm performs Fourier Transform (FT) of coefficients On-The-Fly (OTF) instead of offline calculation on PC. The proposed algorithm consists of expands, OTF-FT and pruning blocks that do efficient 2D convolution in the frequency domain. The proposed algorithm is compared with the FFT-based algorithm for the coefficient transformation. As per simulations, assuming typical network configuration parameters, the proposed algorithm is 4–8X faster compared to FFT based approach for the co-efficient transform.