{"title":"使用数字时间乘法器的时域乘加器用于CNN处理器在28纳米CMOS","authors":"Xutong Wu, T. Siriburanon, R. Staszewski","doi":"10.1109/ISSC49989.2020.9180202","DOIUrl":null,"url":null,"abstract":"This paper proposes a time-domain multiply-accumulator (MAC) circuit exploiting the use of a digital-to-time multiplier (DTM) unit for performing convolutional operations and designed in 28nm CMOS. As the foundational calculation of state-of-the-art convolutional neural network (CNN) models, convolutional operation could normally be executed millions of times in one CNN task. The proposed circuit is designed to support this large computational resource requirement during the CNN computations. It is running in time-domain with 6-bit resolution (1 sign bit) and performs calculations based on corresponding time delays. Compared with other analog-domain propositions, time-domain designs perform better, with higher operating frequencies up to 50 MHz. In schematic simulations, the proposed DTM unit, operating with 5 bits, achieves 0.1 GOPS ideal throughput and consumes 74.79 µW at 1.0V supply.","PeriodicalId":351013,"journal":{"name":"2020 31st Irish Signals and Systems Conference (ISSC)","volume":"256 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Time-Domain Multiply-Accumulator using Digital-to-Time Multiplier for CNN Processors in 28-nm CMOS\",\"authors\":\"Xutong Wu, T. Siriburanon, R. Staszewski\",\"doi\":\"10.1109/ISSC49989.2020.9180202\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a time-domain multiply-accumulator (MAC) circuit exploiting the use of a digital-to-time multiplier (DTM) unit for performing convolutional operations and designed in 28nm CMOS. As the foundational calculation of state-of-the-art convolutional neural network (CNN) models, convolutional operation could normally be executed millions of times in one CNN task. The proposed circuit is designed to support this large computational resource requirement during the CNN computations. It is running in time-domain with 6-bit resolution (1 sign bit) and performs calculations based on corresponding time delays. Compared with other analog-domain propositions, time-domain designs perform better, with higher operating frequencies up to 50 MHz. In schematic simulations, the proposed DTM unit, operating with 5 bits, achieves 0.1 GOPS ideal throughput and consumes 74.79 µW at 1.0V supply.\",\"PeriodicalId\":351013,\"journal\":{\"name\":\"2020 31st Irish Signals and Systems Conference (ISSC)\",\"volume\":\"256 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 31st Irish Signals and Systems Conference (ISSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSC49989.2020.9180202\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 31st Irish Signals and Systems Conference (ISSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSC49989.2020.9180202","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Time-Domain Multiply-Accumulator using Digital-to-Time Multiplier for CNN Processors in 28-nm CMOS
This paper proposes a time-domain multiply-accumulator (MAC) circuit exploiting the use of a digital-to-time multiplier (DTM) unit for performing convolutional operations and designed in 28nm CMOS. As the foundational calculation of state-of-the-art convolutional neural network (CNN) models, convolutional operation could normally be executed millions of times in one CNN task. The proposed circuit is designed to support this large computational resource requirement during the CNN computations. It is running in time-domain with 6-bit resolution (1 sign bit) and performs calculations based on corresponding time delays. Compared with other analog-domain propositions, time-domain designs perform better, with higher operating frequencies up to 50 MHz. In schematic simulations, the proposed DTM unit, operating with 5 bits, achieves 0.1 GOPS ideal throughput and consumes 74.79 µW at 1.0V supply.