{"title":"Assessing the Reliability of FPGA-Based Quantized Neural Networks Under Neutron Irradiation","authors":"Ioanna Souvatzoglou;Dimitris Agiakatsikas;Vasileios Vlagkoulis;Aitzan Sari;Mihalis Psarakis;Maria Kastriotou;Carlo Cazzaniga","doi":"10.1109/TNS.2024.3491503","DOIUrl":null,"url":null,"abstract":"SRAM field-programmable gate arrays (FPGAs) are popular computing platforms for implementing neural networks (NNs) due to their flexibility and low recurring engineering costs. Nevertheless, reliability concerns arise due to their susceptibility to radiation effects, especially considering high-altitude or -scalability applications. In this work, we explore the resilience of quantized, and especially binarized (e.g., use binary values to represent the weights) and nearly binarized, FPGA NNs to neutron-induced errors. Specifically, we study the impact of various NN design parameters, such as the degree of quantization and parallelization, the type of memory for storing the weights, and the diversity of the input images on the NN reliability. To achieve this, we exposed to accelerated atmospheric-like neutron radiation a large set of NN designs built through FINN, a state-of-the-art development framework targeting FPGA NN accelerators, using different quantization, folding, and weight storage schemes. We examine how these parameters affect the tradeoff between area, performance, and reliability, calculating various metrics such as the dynamic cross section (DCS), failures in time (FIT), and mean executions between failures (MEBF). Our findings show that less quantization (i.e., larger bit precision), as well as less folding (i.e., more parallelization), leads to improved reliability (i.e., lower FIT and higher MEBF). In contrast, these two factors push the performance and area of the NNs in opposite directions (i.e., less quantization and folding result in lower latency but more FPGA resources). Moreover, the results reveal that the choice of the memory type (i.e., block RAM (BRAM) or distributed RAM) for storing the NN weights impacts the reliability metrics, with the mixed memory configuration providing the highest reliability.","PeriodicalId":13406,"journal":{"name":"IEEE Transactions on Nuclear Science","volume":"71 12","pages":"2565-2577"},"PeriodicalIF":1.9000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Nuclear Science","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10757344/","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
SRAM field-programmable gate arrays (FPGAs) are popular computing platforms for implementing neural networks (NNs) due to their flexibility and low recurring engineering costs. Nevertheless, reliability concerns arise due to their susceptibility to radiation effects, especially considering high-altitude or -scalability applications. In this work, we explore the resilience of quantized, and especially binarized (e.g., use binary values to represent the weights) and nearly binarized, FPGA NNs to neutron-induced errors. Specifically, we study the impact of various NN design parameters, such as the degree of quantization and parallelization, the type of memory for storing the weights, and the diversity of the input images on the NN reliability. To achieve this, we exposed to accelerated atmospheric-like neutron radiation a large set of NN designs built through FINN, a state-of-the-art development framework targeting FPGA NN accelerators, using different quantization, folding, and weight storage schemes. We examine how these parameters affect the tradeoff between area, performance, and reliability, calculating various metrics such as the dynamic cross section (DCS), failures in time (FIT), and mean executions between failures (MEBF). Our findings show that less quantization (i.e., larger bit precision), as well as less folding (i.e., more parallelization), leads to improved reliability (i.e., lower FIT and higher MEBF). In contrast, these two factors push the performance and area of the NNs in opposite directions (i.e., less quantization and folding result in lower latency but more FPGA resources). Moreover, the results reveal that the choice of the memory type (i.e., block RAM (BRAM) or distributed RAM) for storing the NN weights impacts the reliability metrics, with the mixed memory configuration providing the highest reliability.
期刊介绍:
The IEEE Transactions on Nuclear Science is a publication of the IEEE Nuclear and Plasma Sciences Society. It is viewed as the primary source of technical information in many of the areas it covers. As judged by JCR impact factor, TNS consistently ranks in the top five journals in the category of Nuclear Science & Technology. It has one of the higher immediacy indices, indicating that the information it publishes is viewed as timely, and has a relatively long citation half-life, indicating that the published information also is viewed as valuable for a number of years.
The IEEE Transactions on Nuclear Science is published bimonthly. Its scope includes all aspects of the theory and application of nuclear science and engineering. It focuses on instrumentation for the detection and measurement of ionizing radiation; particle accelerators and their controls; nuclear medicine and its application; effects of radiation on materials, components, and systems; reactor instrumentation and controls; and measurement of radiation in space.