J. Semião, David Saraiva, C. Leong, A. Romão, Marcelino B. Santos, I. Teixeira, João Paulo Teixeira
{"title":"Performance sensor for tolerance and predictive detection of delay-faults","authors":"J. Semião, David Saraiva, C. Leong, A. Romão, Marcelino B. Santos, I. Teixeira, João Paulo Teixeira","doi":"10.1109/DFT.2014.6962092","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962092","url":null,"abstract":"This paper presents the Scout Flip-Flop, a new performance Sensor for toleranCe and predictive detectiOn of delay-faUlTs in synchronous digital circuits. The sensor is based on a new master-slave Flip-Flop (FF), the Scout FF, with built-in functionality to locally (inside the FF) create two distinct guard-band windows: (1) a tolerance window, to increase tolerance to late transitions, making the Scout's master latch transparent during an additional predefined period after the clock trigger; and (2) a detection window, which starts before the clock edge trigger and persists during the tolerance window, to inform that performance and circuit functionality is at risk. When a PVTA (Process, power-supply Voltage, Temperature and Aging) variation occurs, circuit performance is affected and a delay-fault may occur. Hence, the existence of a tolerance window, introduces an extra time-slack by borrowing time from subsequent clock cycles. Moreover, as the predictive-error detection window starts prior to the clock edge trigger, it provides an additional safety margin and may be used to trigger corrective actions before real error occurrence, such as clock frequency reduction. Both tolerance and detection windows are defined by design and are sensitive to performance errors, increasing its size in worst PVTA conditions. Extensive SPICE simulations allowed characterizing the new flip-flop and simulation results are presented for 65nm CMOS technology, using Berkeley Predictive Technology Models (PTM), showing Scout's effectiveness on tolerance and predictive error detection.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125655225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. Constantinescu, Srini Krishnamoorthy, Tuyen Nguyen
{"title":"Estimating the effect of single-event upsets on microprocessors","authors":"C. Constantinescu, Srini Krishnamoorthy, Tuyen Nguyen","doi":"10.1109/DFT.2014.6962059","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962059","url":null,"abstract":"Evaluating the impact of single-event upsets (SEUs) on complex VLSI circuits in general, and microprocessors in particular, requires an interdisciplinary approach, that includes soft error modeling, accelerated measurements, derating of the raw error rates, and specialized design tools. This paper discusses modeling techniques employed to estimate the soft error rates (SER) of storage cells, provides results of accelerated measurements for three technology nodes, and presents a technique for derating the raw error rates by simulated error injection. We use the measurement results to validate and calibrate the models. Then present the tool employed for deriving the SER of the Advanced Micro Devices processor code-named “Bulldozer” and examples of estimated SER. Our approach enables the cost-effective mitigation of SEU by employing data integrity protection for the most sensitive logic.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"154 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133171669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Haghbayan, B. Alizadeh, A. Rahmani, P. Liljeberg, H. Tenhunen
{"title":"Automated formal approach for debugging dividers using dynamic specification","authors":"M. Haghbayan, B. Alizadeh, A. Rahmani, P. Liljeberg, H. Tenhunen","doi":"10.1109/DFT.2014.6962068","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962068","url":null,"abstract":"This paper presents a formal approach to verify and debug division circuits. The proposed technique is based on a reverse-engineering mechanism where a high-level model of the gate-level implementation is obtained and then an intermediate representation of the specification is introduced. This process makes equivalence checking between two models possible. The main advantage of this representation is the fact that the specification is dynamically changed according to the information obtained from the implementation. At the end, if two updated models are not equivalent, possible bugs can be localized and then corrected automatically by analyzing the difference. Experimental results show the robustness of the proposed approach in comparison with other contemporary methods in terms of the run time execution. The results also reveal that up to two orders of magnitude of average speedup can be obtained compared with the state-of-the-art.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"7 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114119955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A 12-bit 32MS/s SAR ADC using built-in self calibration technique to minimize capacitor mismatch","authors":"In-Seok Jung, Yong-Bin Kim","doi":"10.1109/DFT.2014.6962078","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962078","url":null,"abstract":"This paper presents a low-power 12-bit 32MS/s successive approximation register (SAR) analog-to-digital converter (ADC) using a single input with a novel Built-in Self Calibration (BiSC) feature to reduce the capacitor mismatch of the digital-to-analog converter (DAC) and to compensate the comparator input offset voltage. The proposed self-calibration scheme optimize the mismatch of the DAC by changing additional auxiliary capacitor array during calibration mode. In addition, in order to minimize the offset voltage of the comparator in the SAR ADC, a simplified voltage amplifier is proposed. The controller for the proposed algorithm operates as fore-ground operation so that low power consumption is achieved. Consequently, the mismatch error of the DAC can be minimized and the SAR based ADC operates without any extra power dissipation for the circuitry of self calibration during normal operation. The prototype was designed using 0.13μm single poly 6 metal standard CMOS technology. The ADC achieves a SNDR of 65.2 dB and consumes 3.57 mW with 1.2V supply voltage and sampling rate of 32 MS/s. Compared to the converters that use the conventional procedure, INL and DNL are reduced by about 45% and 51%, respectively.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"6 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128413123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design and implementation of a self-healing processor on SRAM-based FPGAs","authors":"M. Psarakis, A. Vavousis, C. Bolchini, A. Miele","doi":"10.1109/DFT.2014.6962076","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962076","url":null,"abstract":"This paper presents an approach to design and implement a soft-core processor on SRAM-based FPGAs able to autonomously deal with the occurrence of soft errors; state-of-the-art area-replication strategies are coupled with dynamic partial reconfiguration to detect faults and to consequently repair them. The reconfiguration process is performed by the processor itself using a minimum set of \"critical\" instructions and the logic responsible for their execution is hardened, to enable the self-healing property. The methodology is applied to the OpenRISC processor, evaluating costs and benefits.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133121786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Machine learning-based techniques for incremental functional diagnosis: A comparative analysis","authors":"C. Bolchini, Luca Cassano","doi":"10.1109/DFT.2014.6962064","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962064","url":null,"abstract":"Incremental functional diagnosis is the process of iteratively selecting a test, executing it and based on the collected outcome deciding either to execute one more test or to stop the process since a faulty candidate component can be identified. The aim is to minimise the cost and the duration of the diagnosis process. In this paper we compare six engines based on machine learning techniques for driving the diagnosis. The comparison has been carried out under a twofold point of view: on the one hand, we analysed the issues related to the use of the considered techniques for the design of incremental diagnosis engines; on the other hand, we carried out a set of experiments on three synthetic but realistic scenarios to assess accuracy and efficiency.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132862782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Daniel Oliveira, P. Rech, L. Pilla, P. Navaux, L. Carro
{"title":"GPGPUs ECC efficiency and efficacy","authors":"Daniel Oliveira, P. Rech, L. Pilla, P. Navaux, L. Carro","doi":"10.1109/DFT.2014.6962085","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962085","url":null,"abstract":"In this paper we assess and discuss the efficiency and overhead of the Error-Correcting Code (ECC) mechanism available on modern GPGPUs, which are increasingly used for both High Performance Computing and safety-critical applications. Both the resilience to radiation-induced silent data corruption and functional interruption are experimentally and analytically addressed. The provided experimental analysis demonstrates that the ECC significantly reduces the occurrence of silent data corruption but may not be sufficient to guarantee high reliability. Moreover, the ECC increases the GPGPU functional interruption rate. Finally, the ECC performances and reliability are compared to Algorithm-Based Fault Tolerance and Duplication With Comparison strategies.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133653424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tiago A. O. Alves, L. A. J. Marzulo, S. Kundu, F. França
{"title":"Domino effect protection on dataflow error detection and recovery","authors":"Tiago A. O. Alves, L. A. J. Marzulo, S. Kundu, F. França","doi":"10.1109/DFT.2014.6962087","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962087","url":null,"abstract":"Dataflow Error Detection and Recovery (DFER) was shown to be a good approach to address errors in the scope of parallel programming. Previous work showed that this technique presents good performance by imposing reduced overhead in error-free executions. However, in the presence of errors excessive rollbacks may occur, characterizing the Domino Effect. In this paper we propose a scheme that addresses this issue by protecting execution from the Domino Effect. Our experimental results show that without adding any significant overheads to the original DFER version we are able to reduce in up to 40% the total execution time in situations where errors are detected. Furthermore, since there are no significant overheads, the execution time in error-free situations remains the same as in the baseline.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133872258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Decreasing FIT with diverse triple modular redundancy in SRAM-based FPGAs","authors":"L. Tambara, F. Kastensmidt, P. Rech, C. Frost","doi":"10.1109/DFT.2014.6962070","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962070","url":null,"abstract":"This paper explores the concept of Design Diversity Redundancy applied to SRAM-based FPGAs as a proposal to decrease failure rate. A 32-bit RISC processor MIPS was protected by coarse grain Triple Modular Redundancy (TMR) and by Diverse TMR (DTMR). Experimental results under neutron flux radiation show that DTMR can reduce in 40% the Failure in Time (FIT) of a system when compared to the standard MIPS while the coarse gain TMR could reduce the FIT in only 10%.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117057905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
V. Tomashevich, Yaara Neumeier, Raghavan Kumar, O. Keren, I. Polian
{"title":"Protecting cryptographic hardware against malicious attacks by nonlinear robust codes","authors":"V. Tomashevich, Yaara Neumeier, Raghavan Kumar, O. Keren, I. Polian","doi":"10.1109/DFT.2014.6962084","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962084","url":null,"abstract":"Fault-based attacks against cryptographic circuits must be addressed by techniques that are different from approaches designed for random transient faults. We systematically investigate robust error-detecting codes that specifically target malicious attacks and guarantee minimal bounds on detection probability. Our study is based on FPGA-supported fault-injection campaigns on the circuit implementation of a recent lightweight block cipher and its sub-modules. We quantify the detection capabilities of different robust and non-robust codes with respect to both random faults and malicious attacks, as well as the required overheads. For the first time, we report performance of a novel punctured cubic code on actual cryptographic circuitry. Experimental results show that robust codes with a certain number of redundant bits have better detection properties in security context and higher predictability than their conventional linear counterparts.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132082867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}