Ahmed Nasser, Karim Ahmed Fadel, Karim Abbas, K. Ahmed, Mohamed Abdelsalam, Mahmoud Gaber
{"title":"基于CNN的人工智能加速器在硬件仿真和FPGA原型中的自动配置和生成流程","authors":"Ahmed Nasser, Karim Ahmed Fadel, Karim Abbas, K. Ahmed, Mohamed Abdelsalam, Mahmoud Gaber","doi":"10.1109/icecs53924.2021.9665606","DOIUrl":null,"url":null,"abstract":"Machine learning (ML) algorithms have proven to be a concrete component in various fields that aim to be fully automated. Therefore, many researchers have shed the light on the modifications of ML algorithms to be fully automated for more complicated tasks. However, the acceleration of such algorithms is extremely hard due to the high computations and memory required. This paper implements automated flow using Perl scripts and generated LeNet-5 (A Convolutional Neural Network Model). Our target is high throughput, configurable and scalable RTL design that is generated by Perl scripts. Our flow is designing and verifying using Veloce emulator.","PeriodicalId":448558,"journal":{"name":"2021 28th IEEE International Conference on Electronics, Circuits, and Systems (ICECS)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Automated Flow for Configuration and Generation of CNN based AI accelerators for HW Emulation & FPGA Prototyping\",\"authors\":\"Ahmed Nasser, Karim Ahmed Fadel, Karim Abbas, K. Ahmed, Mohamed Abdelsalam, Mahmoud Gaber\",\"doi\":\"10.1109/icecs53924.2021.9665606\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning (ML) algorithms have proven to be a concrete component in various fields that aim to be fully automated. Therefore, many researchers have shed the light on the modifications of ML algorithms to be fully automated for more complicated tasks. However, the acceleration of such algorithms is extremely hard due to the high computations and memory required. This paper implements automated flow using Perl scripts and generated LeNet-5 (A Convolutional Neural Network Model). Our target is high throughput, configurable and scalable RTL design that is generated by Perl scripts. Our flow is designing and verifying using Veloce emulator.\",\"PeriodicalId\":448558,\"journal\":{\"name\":\"2021 28th IEEE International Conference on Electronics, Circuits, and Systems (ICECS)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 28th IEEE International Conference on Electronics, Circuits, and Systems (ICECS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icecs53924.2021.9665606\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 28th IEEE International Conference on Electronics, Circuits, and Systems (ICECS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icecs53924.2021.9665606","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Automated Flow for Configuration and Generation of CNN based AI accelerators for HW Emulation & FPGA Prototyping
Machine learning (ML) algorithms have proven to be a concrete component in various fields that aim to be fully automated. Therefore, many researchers have shed the light on the modifications of ML algorithms to be fully automated for more complicated tasks. However, the acceleration of such algorithms is extremely hard due to the high computations and memory required. This paper implements automated flow using Perl scripts and generated LeNet-5 (A Convolutional Neural Network Model). Our target is high throughput, configurable and scalable RTL design that is generated by Perl scripts. Our flow is designing and verifying using Veloce emulator.