N. L. Venkataraman, S. Sumithra, Dr. Suresh Kumar, R. Purushothaman, K. Kukulavani, V. Gowri
{"title":"基于FPGA的节能卷积神经网络","authors":"N. L. Venkataraman, S. Sumithra, Dr. Suresh Kumar, R. Purushothaman, K. Kukulavani, V. Gowri","doi":"10.1109/ICCES57224.2023.10192650","DOIUrl":null,"url":null,"abstract":"Research into GPU (Graphics Processing Unit) acceleration using programmable logic arrays has mostly focused on Convolutional Neural Networks. These studies demonstrate the effectiveness of CNNs in various technical vision tasks, such as feature extraction, image analysis, face identification, and rear cross-traffic alert, amongst many others. As a result, there are restrictions on the times in which the CNN model can be implemented on FPGA, such as the restrictions on the quantity of on-chip memory, the dimensions of the CNN, and the parameters of the model. This work suggests a television commercial and an advanced CNN prototype informed by the basic AlexNet prototype. The proposed architecture uses a Commercial engine, an improved version of the insight separates, and a distinct permutation unit. In addition, the designers provide a GPU integration model that supports the Mish and Rectified linear initiation characteristics. The suggested method has a comparatively high detection accuracy while consuming a relatively little amount of the computer system's resources in contrast to other methods considered to be state-of-the-art. This proposed system is design in RTL Verilog Hardware description language. The proposed system is implemented in Xilinx ISE Design Suite-13.1 for use on Spartan -6 (Target Device). The synthesis tool optimized speed, area, and power.","PeriodicalId":442189,"journal":{"name":"2023 8th International Conference on Communication and Electronics Systems (ICCES)","volume":"525 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FPGA based Power-Efficient Convolutional Neural Network\",\"authors\":\"N. L. Venkataraman, S. Sumithra, Dr. Suresh Kumar, R. Purushothaman, K. Kukulavani, V. Gowri\",\"doi\":\"10.1109/ICCES57224.2023.10192650\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Research into GPU (Graphics Processing Unit) acceleration using programmable logic arrays has mostly focused on Convolutional Neural Networks. These studies demonstrate the effectiveness of CNNs in various technical vision tasks, such as feature extraction, image analysis, face identification, and rear cross-traffic alert, amongst many others. As a result, there are restrictions on the times in which the CNN model can be implemented on FPGA, such as the restrictions on the quantity of on-chip memory, the dimensions of the CNN, and the parameters of the model. This work suggests a television commercial and an advanced CNN prototype informed by the basic AlexNet prototype. The proposed architecture uses a Commercial engine, an improved version of the insight separates, and a distinct permutation unit. In addition, the designers provide a GPU integration model that supports the Mish and Rectified linear initiation characteristics. The suggested method has a comparatively high detection accuracy while consuming a relatively little amount of the computer system's resources in contrast to other methods considered to be state-of-the-art. This proposed system is design in RTL Verilog Hardware description language. The proposed system is implemented in Xilinx ISE Design Suite-13.1 for use on Spartan -6 (Target Device). The synthesis tool optimized speed, area, and power.\",\"PeriodicalId\":442189,\"journal\":{\"name\":\"2023 8th International Conference on Communication and Electronics Systems (ICCES)\",\"volume\":\"525 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 8th International Conference on Communication and Electronics Systems (ICCES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCES57224.2023.10192650\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 8th International Conference on Communication and Electronics Systems (ICCES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCES57224.2023.10192650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
使用可编程逻辑阵列加速GPU(图形处理单元)的研究主要集中在卷积神经网络上。这些研究证明了cnn在各种技术视觉任务中的有效性,例如特征提取、图像分析、人脸识别和后方交叉交通警报等。因此,在FPGA上实现CNN模型的次数是有限制的,如片上存储器的数量、CNN的尺寸、模型的参数等。这项工作提出了一个电视广告和一个先进的CNN原型,由基本的AlexNet原型提供信息。所建议的体系结构使用商业引擎、改进版本的洞察分离和不同的排列单元。此外,设计人员还提供了支持Mish和Rectified线性启动特性的GPU集成模型。与其他被认为是最先进的方法相比,所建议的方法具有相对较高的检测精度,同时消耗相对较少的计算机系统资源。本系统采用RTL Verilog硬件描述语言进行设计。提出的系统在Xilinx ISE Design Suite-13.1中实现,用于Spartan -6(目标设备)。合成工具优化了速度、面积和功率。
FPGA based Power-Efficient Convolutional Neural Network
Research into GPU (Graphics Processing Unit) acceleration using programmable logic arrays has mostly focused on Convolutional Neural Networks. These studies demonstrate the effectiveness of CNNs in various technical vision tasks, such as feature extraction, image analysis, face identification, and rear cross-traffic alert, amongst many others. As a result, there are restrictions on the times in which the CNN model can be implemented on FPGA, such as the restrictions on the quantity of on-chip memory, the dimensions of the CNN, and the parameters of the model. This work suggests a television commercial and an advanced CNN prototype informed by the basic AlexNet prototype. The proposed architecture uses a Commercial engine, an improved version of the insight separates, and a distinct permutation unit. In addition, the designers provide a GPU integration model that supports the Mish and Rectified linear initiation characteristics. The suggested method has a comparatively high detection accuracy while consuming a relatively little amount of the computer system's resources in contrast to other methods considered to be state-of-the-art. This proposed system is design in RTL Verilog Hardware description language. The proposed system is implemented in Xilinx ISE Design Suite-13.1 for use on Spartan -6 (Target Device). The synthesis tool optimized speed, area, and power.