Lei Xie, Hoang Anh Du Nguyen, Jintao Yu, A. Kaichouhi, M. Taouil, M. Alfailakawi, S. Hamdioui
{"title":"Scouting Logic: A Novel Memristor-Based Logic Design for Resistive Computing","authors":"Lei Xie, Hoang Anh Du Nguyen, Jintao Yu, A. Kaichouhi, M. Taouil, M. Alfailakawi, S. Hamdioui","doi":"10.1109/ISVLSI.2017.39","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.39","url":null,"abstract":"Memristor technology is a promising alternative to CMOS due to its high integration density, near-zero standby power, and ability to implement novel resistive computing. One of the major limitations of these architectures is the limited endurance of memristor devices, especially when a logic gate requires multiple steps/switching to execute the logic operations. To alleviate the endurance requirement and improve the performance, we present a novel logic design style, called scouting logic that executes any logic gate by only reading the memristor devices and without changing their states. Hence, no impact on the memristors' endurance. The proposed design is implemented using two styles (current and voltage based). To illustrate the performance of scouting logic based designs, the area, delay, and power consumption are analyzed and compared with state-ofthe- art. The results show that scouting logic improves the delay and power consumption by at least a factor of 2.3, while having similar or less area overhead. Finally, we discuss the potential applications and challenges of scouting logic.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129733606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unified Model for Contrast Enhancement and Denoising","authors":"A. P. James, O. Krestinskaya, J. Mathew","doi":"10.1109/ISVLSI.2017.73","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.73","url":null,"abstract":"In this paper, we attempt a challenging task to unify two important complementary operations, i.e. contrast enhancement and denoising, that is required in most image processing applications. The proposed method is implemented using practical analog circuit configurations that can lead to near real-time processing capabilities useful to be integrated with vision sensors. Metrics used for performance includes estimation of Residual Noise Level (RNL), Structural Similarity Index Measure (SSIM), Output-to-Input Contrast Ratio (CRo_i), and its combined score (SCD). The class of contrast stretching methods has resulted in higher noise levels (RNL ≥ 7) along with increased contrast measures (CRo-i ≥ eight times than that of the input image) and SSIM ≤ 0.52. Denoising methods generates images with lesser noise levels (RNL ≤ 0.2308), poor contrast enhancements (CRo-i ≤ 1.31) and with best structural similarity (SSIM ≥ 0.85). In contrast, the proposed model offers best contrast stretching (CRo-i = 5.83), least noise (RNL = 0.02), a descent structural similarity (SSIM = 0.6453) and the highest combined score (SCD = 169).","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130540696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jordan Morris, Pranay Prabhat, James Myers, A. Yakovlev
{"title":"Unconventional Layout Techniques for a High Performance, Low Variability Subthreshold Standard Cell Library","authors":"Jordan Morris, Pranay Prabhat, James Myers, A. Yakovlev","doi":"10.1109/ISVLSI.2017.14","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.14","url":null,"abstract":"A novel subthreshold sizing strategy utilizing the Inverse Narrow Width Effect is demonstrated that has the largest range of propagation delays within the same cell footprint and lowest variability of any subthreshold sizing strategy thus far proposed. Simulation results and ring oscillators implemented in a commercial 65nm low power process confirm a propagation delay improvement of up to 1.95X over the standard superthreshold sizing strategy at 300mV and below. 32 bit multipliers are then synthesized and static timing analysis performed at several subthreshold voltage corners to illustrate applicability to real designs in conventional EDA design flows.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129368640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Srinivasa, K. Mohan, Wei-Hao Chen, Kuo-Hsiang Hsu, Xueqing Li, Meng-Fan Chang, S. Gupta, J. Sampson, N. Vijaykrishnan
{"title":"Improving FPGA Design with Monolithic 3D Integration Using High Dense Inter-Stack Via","authors":"S. Srinivasa, K. Mohan, Wei-Hao Chen, Kuo-Hsiang Hsu, Xueqing Li, Meng-Fan Chang, S. Gupta, J. Sampson, N. Vijaykrishnan","doi":"10.1109/ISVLSI.2017.31","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.31","url":null,"abstract":"This paper proposes to use the high density of vias enabled by monolithic 3D integration to produce multi-stack FPGA designs with improved performance and functionality. The use of fine grain vertical interconnects enables reconfiguration of FPGA logic within a few clock cycles, as shown in our design that features dynamic reconfiguration capabilities through the use of a pair of configuration memories on the upper stack. Along with the reconfigurability feature, results show that our SLICE design offers an area reduction of 23% compared to a standard design without reconfiguration capability. Our analysis of FPGA switch box logic and physical design with M3D vias provides insights into the sources of benefits from vertical routing in a multi-stacked design. We also discuss the design overheads involved in incorporating multiple inter-stack vias for better and faster communication among logic routed in different design stacks.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130427920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Farhana Parveen, Zhezhi He, Shaahin Angizi, Deliang Fan
{"title":"Hybrid Polymorphic Logic Gate with 5-Terminal Magnetic Domain Wall Motion Device","authors":"Farhana Parveen, Zhezhi He, Shaahin Angizi, Deliang Fan","doi":"10.1109/ISVLSI.2017.35","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.35","url":null,"abstract":"In this paper, a key-controlled hybrid spin-CMOS polymorphic logic gate using a novel 5 terminal magnetic domain wall motion device is proposed. The proposed hybrid polymorphic gate is able to perform a full set of 2-input Boolean logic functions (i.e. AND/NAND, OR/NOR, NOT, XOR/XNOR) by configuring the applied keys. The SPICE device-circuit co-simulation indicates that a full adder design using our proposed polymorphic logic gate shows 74.23% power reduction and 7.14% transistor count reduction compared with traditional CMOS full adder design. Our proposed polymorphic gate could be a promising hardware security primitive to address IC counterfeiting or reverse engineering by logic locking and polymorphic transformation. To summarize, by providing zero leakage power, low dynamic power consumption, compactness and polymorphism to logic circuits, our proposed design can thrive a new paradigm for future power efficient and secured computing platform.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115194667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ruizhe Zhao, W. Luk, Xinyu Niu, Huifeng Shi, Haitao Wang
{"title":"Hardware Acceleration for Machine Learning","authors":"Ruizhe Zhao, W. Luk, Xinyu Niu, Huifeng Shi, Haitao Wang","doi":"10.1109/ISVLSI.2017.127","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.127","url":null,"abstract":"This paper presents an approach to enhance the performance of machine learning applications based on hardware acceleration. This approach is based on parameterised architectures designed for Convolutional Neural Network (CNN) and Support Vector Machine (SVM), and the associated design flow common to both. This approach is illustrated by two case studies including object detection and satellite data analysis. The potential of the proposed approach is presented.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127242062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analysis of RRAM Reliability Soft-Errors on the Performance of RRAM-Based Neuromorphic Systems","authors":"Amr M. S. Tosson, Shimeng Yu, M. Anis, Lan Wei","doi":"10.1109/ISVLSI.2017.20","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.20","url":null,"abstract":"Due to the limitation in speed and throughput of the traditional Von Neumann architecture, the interest in braininspired neuromorphic systems has been the focus of recent research activities. RRAM device has been extensively used as synapses in neuromorphic systems due to its many advantages including small size and compatibility with CMOS fabrication process. However, the RRAM device suffers from reliability soft-errors resulting from the stochastic nature of the oxygen vacancies of its conductive filaments. In this article, for the first time, using a combination of SPICE-based and BRIAN-based simulations, a novel framework is developed to model and assess the impact of RRAM reliability soft-errors on the performance of the neuromorphic systems. Simulation results show that the accuracy of a multi-perceptron RRAM-based neuromorphic system drops from 91.6% to 43% when the reliability softerrors are considered. To overcome this degradation in the system performance, a detailed analysis is conducted to modify the way the RRAM resistive state changes. In addition to this, a list of recommendations for the design of neuromorphic systems is also provided to overcome the RRAM reliability soft-errors.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133695304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shaahin Angizi, Zhezhi He, Farhana Parveen, Deliang Fan
{"title":"RIMPA: A New Reconfigurable Dual-Mode In-Memory Processing Architecture with Spin Hall Effect-Driven Domain Wall Motion Device","authors":"Shaahin Angizi, Zhezhi He, Farhana Parveen, Deliang Fan","doi":"10.1109/ISVLSI.2017.18","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.18","url":null,"abstract":"This paper presents a new Reconfigurable dualmode In-Memory Processing Architecture based on spin Hall effect-driven domain wall motion device called RIMPA. In this architecture, a portion of spintronic memory array can be reconfigured to either non-volatile memory or in-memory logic. Accordingly, computation can be performed within memory without long distance data transfer or large in-memory logic area overhead concerning conventional Von-Neumann or in-memory computing architecture, respectively. The device to architecture simulation results show that, with 17% area increase, RIMPA improves the operating energy by 72.2% as compared with the conventional non-volatile in-memory logic schemes. We show that the Advanced Encryption Standard (AES) algorithm which is widely used in secure big data storage, can be efficiently mapped to RIMPA with 68.8% and 20.8% energy saving in comparison to CMOS-ASIC and recent DW-AES implementations, respectively.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124446505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Tahoori, S. Nair, R. Bishnoi, S. Senni, Jad Mohdad, F. Mailly, L. Torres, P. Benoit, P. Nouet, R. Ma, M. Kreissig, F. Ellinger, K. Jabeur, P. Vanhauwaert, G. D. Pendina, G. Prenat
{"title":"GREAT: HeteroGeneous IntegRated Magnetic tEchnology Using Multifunctional Standardized sTack","authors":"M. Tahoori, S. Nair, R. Bishnoi, S. Senni, Jad Mohdad, F. Mailly, L. Torres, P. Benoit, P. Nouet, R. Ma, M. Kreissig, F. Ellinger, K. Jabeur, P. Vanhauwaert, G. D. Pendina, G. Prenat","doi":"10.1109/ISVLSI.2017.67","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.67","url":null,"abstract":"To tackle the key issues of monolithic heterogeneous integration, fast yet low power processing, high integration density, fast yet low power storage, the goal of the GREAT project is to co-integrate multiple functions like sensors (“Sensing”), RF receivers (“Communicating”) and logic/memory (“Processing/Storing”) together within CMOS by adapting the STT-MTJs (Magnetic devices) to a single baseline technology enabling logic, memory, and analog functions in the same System-on-Chip (SoC) as the enabling technology platform for Internet of Things (IoT). This will lead to a unique STT-MTJ cell technology called Multifunctional Standardized Stack (MSS). The major outputs of GREAT are the technology and the architecture platform for IoT SoCs providing better integration of embedded & mobile communication systems and a significant decrease of their power consumption. Based on the STT-MTJs (now viewed as the most suitable technology for digital applications and with a huge potential for analog subsystems) unique set of performances (non-volatility, high speed, infinite endurance and moderate read/write power), GREAT will achieve the same goal as heterogeneous integration of devices but in a much simpler way since the MSS will enable different functions using the same technology.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"444 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116513825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Shafique, R. Hafiz, M. Javed, Sarmad Abbas, L. Sekanina, Z. Vašíček, Vojtěch Mrázek
{"title":"Adaptive and Energy-Efficient Architectures for Machine Learning: Challenges, Opportunities, and Research Roadmap","authors":"M. Shafique, R. Hafiz, M. Javed, Sarmad Abbas, L. Sekanina, Z. Vašíček, Vojtěch Mrázek","doi":"10.1109/ISVLSI.2017.124","DOIUrl":"https://doi.org/10.1109/ISVLSI.2017.124","url":null,"abstract":"Gigantic rates of data production in the era of Big Data, Internet of Thing (IoT) / Internet of Everything (IoE), and Cyber Physical Systems (CSP) pose incessantly escalating demands for massive data processing, storage, and transmission while continuously interacting with the physical world under unpredictable, harsh, and energy-/power-constrained scenarios. Therefore, such systems need to support not only the high performance capabilities at tight power/energy envelop, but also need to be intelligent/cognitive, self-learning, and robust. As a result, a hype in the artificial intelligence research (e.g., deep learning and other machine learning techniques) has surfaced in numerous communities. This paper discusses the challenges and opportunities for building energy-efficient and adaptive architectures for machine learning. In particular, we focus on brain-inspired emerging computing paradigms, such as approximate computing; that can further reduce the energy requirements of the system. First, we guide through an approximate computing based methodology for development of energy-efficient accelerators, specifically for convolutional Deep Neural Networks (DNNs). We show that in-depth analysis of datapaths of a DNN allows better selection of Approximate Computing modules for energy-efficient accelerators. Further, we show that a multi-objective evolutionary algorithm can be used to develop an adaptive machine learning system in hardware. At the end, we summarize the challenges and the associated research roadmap that can aid in developing energy-efficient and adaptable hardware accelerators for machine learning.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126492969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}