{"title":"一种可高度容忍硬件缺陷的深度神经网络推理加速器训练方法","authors":"Shuchao Gao, Takashi Ohsawa","doi":"10.35848/1347-4065/ad1895","DOIUrl":null,"url":null,"abstract":"We propose a novel training method named hardware-conscious software training (HCST) for deep neural network inference accelerators to recover the accuracy degradation due to their hardware imperfections. The proposed training method is totally conducted by software whose forward inference path and backpropagation reflect the hardware imperfections, overcoming the problems of the limited endurance, the nonlinearity and the asymmetry for the switching of the nonvolatile memories used in weights and biases. The HCST reformulates the mathematical expressions in the forward propagation and the gradient calculation with the backpropagation so that it replicates the hardware structure under the influence of variations in the chip fabrication process. The effectiveness of this approach is validated through the MNIST dataset experiments to manifest its capability to restore the accuracies. A circuit design is also disclosed for measuring the offset voltages and the open loop gains of the operational amplifiers used in the accelerator.","PeriodicalId":14741,"journal":{"name":"Japanese Journal of Applied Physics","volume":"12 6","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2023-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A training method for deep neural network inference accelerators with high tolerance for their hardware imperfection\",\"authors\":\"Shuchao Gao, Takashi Ohsawa\",\"doi\":\"10.35848/1347-4065/ad1895\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a novel training method named hardware-conscious software training (HCST) for deep neural network inference accelerators to recover the accuracy degradation due to their hardware imperfections. The proposed training method is totally conducted by software whose forward inference path and backpropagation reflect the hardware imperfections, overcoming the problems of the limited endurance, the nonlinearity and the asymmetry for the switching of the nonvolatile memories used in weights and biases. The HCST reformulates the mathematical expressions in the forward propagation and the gradient calculation with the backpropagation so that it replicates the hardware structure under the influence of variations in the chip fabrication process. The effectiveness of this approach is validated through the MNIST dataset experiments to manifest its capability to restore the accuracies. A circuit design is also disclosed for measuring the offset voltages and the open loop gains of the operational amplifiers used in the accelerator.\",\"PeriodicalId\":14741,\"journal\":{\"name\":\"Japanese Journal of Applied Physics\",\"volume\":\"12 6\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2023-12-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Japanese Journal of Applied Physics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.35848/1347-4065/ad1895\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PHYSICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Japanese Journal of Applied Physics","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.35848/1347-4065/ad1895","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHYSICS, APPLIED","Score":null,"Total":0}
A training method for deep neural network inference accelerators with high tolerance for their hardware imperfection
We propose a novel training method named hardware-conscious software training (HCST) for deep neural network inference accelerators to recover the accuracy degradation due to their hardware imperfections. The proposed training method is totally conducted by software whose forward inference path and backpropagation reflect the hardware imperfections, overcoming the problems of the limited endurance, the nonlinearity and the asymmetry for the switching of the nonvolatile memories used in weights and biases. The HCST reformulates the mathematical expressions in the forward propagation and the gradient calculation with the backpropagation so that it replicates the hardware structure under the influence of variations in the chip fabrication process. The effectiveness of this approach is validated through the MNIST dataset experiments to manifest its capability to restore the accuracies. A circuit design is also disclosed for measuring the offset voltages and the open loop gains of the operational amplifiers used in the accelerator.
期刊介绍:
The Japanese Journal of Applied Physics (JJAP) is an international journal for the advancement and dissemination of knowledge in all fields of applied physics. JJAP is a sister journal of the Applied Physics Express (APEX) and is published by IOP Publishing Ltd on behalf of the Japan Society of Applied Physics (JSAP).
JJAP publishes articles that significantly contribute to the advancements in the applications of physical principles as well as in the understanding of physics in view of particular applications in mind. Subjects covered by JJAP include the following fields:
• Semiconductors, dielectrics, and organic materials
• Photonics, quantum electronics, optics, and spectroscopy
• Spintronics, superconductivity, and strongly correlated materials
• Device physics including quantum information processing
• Physics-based circuits and systems
• Nanoscale science and technology
• Crystal growth, surfaces, interfaces, thin films, and bulk materials
• Plasmas, applied atomic and molecular physics, and applied nuclear physics
• Device processing, fabrication and measurement technologies, and instrumentation
• Cross-disciplinary areas such as bioelectronics/photonics, biosensing, environmental/energy technologies, and MEMS