基于FPGA的节能卷积神经网络

N. L. Venkataraman, S. Sumithra, Dr. Suresh Kumar, R. Purushothaman, K. Kukulavani, V. Gowri
{"title":"基于FPGA的节能卷积神经网络","authors":"N. L. Venkataraman, S. Sumithra, Dr. Suresh Kumar, R. Purushothaman, K. Kukulavani, V. Gowri","doi":"10.1109/ICCES57224.2023.10192650","DOIUrl":null,"url":null,"abstract":"Research into GPU (Graphics Processing Unit) acceleration using programmable logic arrays has mostly focused on Convolutional Neural Networks. These studies demonstrate the effectiveness of CNNs in various technical vision tasks, such as feature extraction, image analysis, face identification, and rear cross-traffic alert, amongst many others. As a result, there are restrictions on the times in which the CNN model can be implemented on FPGA, such as the restrictions on the quantity of on-chip memory, the dimensions of the CNN, and the parameters of the model. This work suggests a television commercial and an advanced CNN prototype informed by the basic AlexNet prototype. The proposed architecture uses a Commercial engine, an improved version of the insight separates, and a distinct permutation unit. In addition, the designers provide a GPU integration model that supports the Mish and Rectified linear initiation characteristics. The suggested method has a comparatively high detection accuracy while consuming a relatively little amount of the computer system's resources in contrast to other methods considered to be state-of-the-art. This proposed system is design in RTL Verilog Hardware description language. The proposed system is implemented in Xilinx ISE Design Suite-13.1 for use on Spartan -6 (Target Device). The synthesis tool optimized speed, area, and power.","PeriodicalId":442189,"journal":{"name":"2023 8th International Conference on Communication and Electronics Systems (ICCES)","volume":"525 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FPGA based Power-Efficient Convolutional Neural Network\",\"authors\":\"N. L. Venkataraman, S. Sumithra, Dr. Suresh Kumar, R. Purushothaman, K. Kukulavani, V. Gowri\",\"doi\":\"10.1109/ICCES57224.2023.10192650\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Research into GPU (Graphics Processing Unit) acceleration using programmable logic arrays has mostly focused on Convolutional Neural Networks. These studies demonstrate the effectiveness of CNNs in various technical vision tasks, such as feature extraction, image analysis, face identification, and rear cross-traffic alert, amongst many others. As a result, there are restrictions on the times in which the CNN model can be implemented on FPGA, such as the restrictions on the quantity of on-chip memory, the dimensions of the CNN, and the parameters of the model. This work suggests a television commercial and an advanced CNN prototype informed by the basic AlexNet prototype. The proposed architecture uses a Commercial engine, an improved version of the insight separates, and a distinct permutation unit. In addition, the designers provide a GPU integration model that supports the Mish and Rectified linear initiation characteristics. The suggested method has a comparatively high detection accuracy while consuming a relatively little amount of the computer system's resources in contrast to other methods considered to be state-of-the-art. This proposed system is design in RTL Verilog Hardware description language. The proposed system is implemented in Xilinx ISE Design Suite-13.1 for use on Spartan -6 (Target Device). The synthesis tool optimized speed, area, and power.\",\"PeriodicalId\":442189,\"journal\":{\"name\":\"2023 8th International Conference on Communication and Electronics Systems (ICCES)\",\"volume\":\"525 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 8th International Conference on Communication and Electronics Systems (ICCES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCES57224.2023.10192650\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 8th International Conference on Communication and Electronics Systems (ICCES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCES57224.2023.10192650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

使用可编程逻辑阵列加速GPU(图形处理单元)的研究主要集中在卷积神经网络上。这些研究证明了cnn在各种技术视觉任务中的有效性,例如特征提取、图像分析、人脸识别和后方交叉交通警报等。因此,在FPGA上实现CNN模型的次数是有限制的,如片上存储器的数量、CNN的尺寸、模型的参数等。这项工作提出了一个电视广告和一个先进的CNN原型,由基本的AlexNet原型提供信息。所建议的体系结构使用商业引擎、改进版本的洞察分离和不同的排列单元。此外,设计人员还提供了支持Mish和Rectified线性启动特性的GPU集成模型。与其他被认为是最先进的方法相比,所建议的方法具有相对较高的检测精度,同时消耗相对较少的计算机系统资源。本系统采用RTL Verilog硬件描述语言进行设计。提出的系统在Xilinx ISE Design Suite-13.1中实现,用于Spartan -6(目标设备)。合成工具优化了速度、面积和功率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
FPGA based Power-Efficient Convolutional Neural Network
Research into GPU (Graphics Processing Unit) acceleration using programmable logic arrays has mostly focused on Convolutional Neural Networks. These studies demonstrate the effectiveness of CNNs in various technical vision tasks, such as feature extraction, image analysis, face identification, and rear cross-traffic alert, amongst many others. As a result, there are restrictions on the times in which the CNN model can be implemented on FPGA, such as the restrictions on the quantity of on-chip memory, the dimensions of the CNN, and the parameters of the model. This work suggests a television commercial and an advanced CNN prototype informed by the basic AlexNet prototype. The proposed architecture uses a Commercial engine, an improved version of the insight separates, and a distinct permutation unit. In addition, the designers provide a GPU integration model that supports the Mish and Rectified linear initiation characteristics. The suggested method has a comparatively high detection accuracy while consuming a relatively little amount of the computer system's resources in contrast to other methods considered to be state-of-the-art. This proposed system is design in RTL Verilog Hardware description language. The proposed system is implemented in Xilinx ISE Design Suite-13.1 for use on Spartan -6 (Target Device). The synthesis tool optimized speed, area, and power.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信