Andreas Moshovos, Jorge Albericio, Patrick Judd, A. Delmas, Sayeh Sharify, M. Mahmoud, Tayler H. Hetherington, M. Nikolic, Dylan Malone Stuart, Kevin Siu, Zissis Poulos, Tor M. Aamodt, Natalie D. Enright Jerger
{"title":"Identifying and Exploiting Ineffectual Computations to Enable Hardware Acceleration of Deep Learning","authors":"Andreas Moshovos, Jorge Albericio, Patrick Judd, A. Delmas, Sayeh Sharify, M. Mahmoud, Tayler H. Hetherington, M. Nikolic, Dylan Malone Stuart, Kevin Siu, Zissis Poulos, Tor M. Aamodt, Natalie D. Enright Jerger","doi":"10.1109/NEWCAS.2018.8585656","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585656","url":null,"abstract":"This article summarizes somde of our work on hardware accelerators for inference with Deep Learning Neural Networks (DNNs). Early success in hardware acceleration for DNNs exploited the computation structure and the significant reuse in their access stream. Our approach to further boost benefits has been to first identify properties in the value stream of DNNs which we can exploit at the hardware level to improve execution time, reduce off- and on-chip communication and storage, resulting in higher energy efficiency and execution time reduction. We have been focusing on properties that are difficult or impossible to discern in advance. These properties include values that are zero or near zero and that prove ineffectual, values that have reduced precision needs, or even the bit-level content of values that lead to ineffectual computations. The presented designs cover a spectrum of choices in terms of area cost, energy efficiency, and relative execution time performance and target a variety of hardware devices from embedded systems to server class machines. A key characteristic of these designs is that they reward but do not requires advances in model design that increase the aforementioned properties (such as reduced precision or sparsity) and thus provide a safe path to innovation.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134456200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wide Input Range Single Feed RF Energy Harvester","authors":"S. Nagaveni, Bibhudatta Sahoo, A. Dutta","doi":"10.1109/NEWCAS.2018.8585489","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585489","url":null,"abstract":"This paper presents a hybrid single-feed CMOS rectifier for RF energy harvesting at 2.4 GHz. The proposed system achieves high power conversion efficiency across a wide input power range by reconfiguring the power paths. The architecture incorporates 3-stage reconfigurable hybrid rectifiers and each stage as regular Vth MOS rectifier for high input power and low Vth MOS rectifier for low input power. Both the rectifiers are connected in a cross-coupled manner which results in small ON-resistance and low reverse leakage current during forward and reverse biased conditions, respectively. The system designed in UMC $0.18 mu mathrm {m}$ CMOS technology for Wi-Fi frequency (2.4 GHz). The post layout simulation results shows an efficiency $ gt 30%$ for input power levels ranging from -24 dBm to -10 dBm while driving a load of $40 mathrm {K}Omega.$","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133026856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Madec, Alexi Bonament, Elise Rosati, L. Hébrard, C. Lallement
{"title":"Virtual prototyping of biosensors involving reaction- diffusion phenomena","authors":"M. Madec, Alexi Bonament, Elise Rosati, L. Hébrard, C. Lallement","doi":"10.1109/NEWCAS.2018.8585653","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585653","url":null,"abstract":"The topic of this paper is the coupling of different tools developed to promote the interface between electronics and biology and its potential for the design of biosensors. More precisely, we put together existing models of electronic devices, a tool (BB-SPICE) that converts descriptions of biological systems into SPICE models and a new tool developed for the compact modeling of biochemical diffusion. This complete virtual prototyping environment is illustrated on an example: a penicillin sensor. Simulations highlight the most critical features that impact the response of the system and can be used to drive designers’ choices.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123086309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"THz Ballistic Deflection Transistor Travelling Wave Amplifier Design with THz Ring Hybrid Coupler","authors":"Huan Wang, J. Millithaler, M. Margala, R. Knepper","doi":"10.1109/NEWCAS.2018.8585496","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585496","url":null,"abstract":"In this paper a Terahertz (THz) Ballistic Deflection Transistor (BDT) Travelling-Wave-Amplifier (TWA) with a THz Ring Hybrid Coupler is developed. The BDT is a novel device based on InGaAs/InAlAs/InP, which is capable of operating at THz frequencies. With the newly developed THz Ring Hybrid Coupler based on the Parallel Plate Dielectric Waveguide with Signal Line (PPDWS) implemented, we are able to increase the ADS simulated gain of a 24-stage BDT TWA to 17dB at 1.02– 1.14THz. This result compensates the low trans conductance of the BDT and shows the BDT TWA as a leading candidate for future THz Amplifier Designs.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123724574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A 0.35V 12.9pW 8.3ppm°C 0.012%/V Feedbackcontrolled Voltage Reference in 65 nm CMOS","authors":"A. Azam, Zhidong Bai, D. Korth, J. Walling","doi":"10.1109/NEWCAS.2018.8585688","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585688","url":null,"abstract":"This paper presents an ultra-low-power voltage reference which works on the principle of cancellation of temperature dependency on sub-threshold current. The feedback control which is based on bulk-voltage compensation, not only suppresses the second order temperature dependency but also enables it to achieve a much higher output voltage using supply voltages as low as 0.35 volt. The closed-loop control ensures reliable performance over all process corners in sub-threshold operation. The proposed design achieves a line sensitivity of 0.004 %/V at 20 °C and power supply rejection ratio (PSRR) of -45 dB with total power consumption of 12.9 pW. The temperature coefficient is 8.3 ppm/°C over a temperature range of -10-110°C and a supply range of 0.35-2.5V. It is the first complete feedbackcontrolled pico-watt voltage reference which has such stable temperature coefficient and line sensitivity while having an output voltage (293.5 mV) comparable to the supply voltage even if it is operated with a supply voltage as low as 350 mV.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125784802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Islam Ahmed, Ahmed Kamaleldin, H. Mostafa, A. Mohieldin
{"title":"Utilizing Dynamic Partial Reconfiguration to Reduce the Cost of FPGA Debugging","authors":"Islam Ahmed, Ahmed Kamaleldin, H. Mostafa, A. Mohieldin","doi":"10.1109/NEWCAS.2018.8585494","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585494","url":null,"abstract":"Debugging of Field-Programmable Gate Arrays (FPGAs) is a difficult task due to the limited access to the internal signals of the design. Embedded logic analyzers enhance the signal observability for FPGAs. These analyzers are implemented on the FPGA resources and they use the embedded memory blocks as trace buffers, so a limited number of signals can be observed using these analyzers due to resources constraints. Changing the traced set of signals requires re-synthesis, placement and routing of the whole design. In this paper, we propose a new methodology for FPGA debugging to change dynamically the set of signals to be observed at runtime, and consequently minimize the time required for debugging. The proposed methodology utilizes the Dynamic Partial Reconfiguration (DPR) technique to dynamically switch between different sets of signals. DPR creates a reconfigurable module (RM) to route each set of signals to an embedded logic analyzer. We demonstrate the proposed approach using Xilinx FPGA tools, finding that changing the set of signals to be observed requires only few milli-seconds to re-program the reconfigurable region (RR). The area overhead of the proposed methodology is lower than other traditional methods of using multiplexers as the DPR allows the routing module to only use buffers to connect a set of signals to the embedded logic analyzer.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124518988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A 500 pW 16 KHz Injection Locked Oscillator for Ultra-Low Power Time-Domain ADC Application","authors":"N. Goux, E. Dina, F. Badets","doi":"10.1109/NEWCAS.2018.8585520","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585520","url":null,"abstract":"This paper presents an Ultra-Low Power Injection Locked Oscillator (ILO) circuit dedicated to Time-Domain ADC architectures. More particularly, it is shown that the power consumption of the ILO depends on its locking range. It is also shown that ILO based phase shifter power consumption could be optimized by adapting its locking range to the full scale of its input. The circuit works on a 0.4V supply voltage, with a 500 pW power consumption and a 16 KHz locking range.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121919711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Wide-Load-Range Mixed-Mode LDO Regulator with Single-Transistor-Assisted Buffer","authors":"Ning Zhang, Chenchang Zhan, Han Li, Linjun He","doi":"10.1109/NEWCAS.2018.8585690","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585690","url":null,"abstract":"A wide-load-range low-dropout (LDO) regulator with single-transistor-assisted buffer and analog-digital mixed-mode control is presented in this paper. The single-transistor-assisted PMOS-input unity-gain-buffer can extend the gate voltage range of the analog power transistor and guarantee that the proposed LDO regulator can work well even at very light load current. To extend the loading capability for large load current, a mixed-mode control is used to detect heavy load and turn on or off the digital power transistor which provides large current to the load with better area efficiency. With the digital assistance, the proposed LDO regulator has better load regulation and loading capability at heavy load as well. A proof-of-concept design of the proposed LDO regulator has been implemented and fabricated in a standard $0.18-mu mathrm {m}$ CMOS process. It occupies an active area of only $0.01 mathrm {m}mathrm {m}^{2}$. With the load current step from 0 mA to 150 mA, the proposed LDO regulator with digital assistance achieves a droop voltage reduction of more than 50% compared to the design without the digital assistance. A load regulation of 0.21 mV/mA is achieved by the proposed design with the help of the digital assistance and single-transistor-assisted buffer.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"1093 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116042173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
G. Guitton, T. Taris, Marcelo de Souza, A. Mariano
{"title":"Design of CMOS LNA with the Inversion Coefficient","authors":"G. Guitton, T. Taris, Marcelo de Souza, A. Mariano","doi":"10.1109/NEWCAS.2018.8585633","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585633","url":null,"abstract":"The design space of two 130 nm CMOS Low Noise Amplifiers (LNA) is explored with the EKV model. More specifically the gain, the noise figure and the input impedance are described with the Inversion Coefficient. This approach allows to work out the relevant sizing and bias conditions for each LNA topology. The first LNA, dedicated to 2.4 GHz low power applications, consumes $80~mu W$. It achieves a voltage gain of 13.8 dB and a noise figure of 5.1 dB. The second LNA, dedicated to multi-standard applications, consumes $3.1~mW$ and covers a bandwidth of 2.1 GHz. The voltage gain is 19.2 dB and the minimum noise figure is $2.4~dB$.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114882716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shimwe Dominique Niyonambaza, Élodie Boisselier, M. Boukadoum, A. Miled
{"title":"Microfluidic H-filter Cell Modeling for Robust Purification of Gold Nanoparticles","authors":"Shimwe Dominique Niyonambaza, Élodie Boisselier, M. Boukadoum, A. Miled","doi":"10.1109/NEWCAS.2018.8585579","DOIUrl":"https://doi.org/10.1109/NEWCAS.2018.8585579","url":null,"abstract":"This paper describes a microfluidic H-filter cell model designed for the dialysis of gold nanoparticles without resort to a conventional membrane filter. It is part of a preliminary investigation of the implementation of a reusable automated dialysis system. The modeling was achieved using COMSOL Metaphysics Ⓒ software. It is based on the laminar flow and transport of diluted species interfaces to screen molecular separation trough diffusion. The targeted gold nanoparticles have a hydrodynamic diameter of $24 pm 2$ nm and a diffusion coefficient of $2times 10^{-11} mathrm {m}^{2}.mathrm {s}^{-1}$. The diffusion coefficient of impurities in the gold nanoparticles solution was estimated to $10^{-10}mathrm {m}^{2}.mathrm {s}^{-1}$. Obtained results of showed that the concentration of impurities in the solution of gold nanoparticles was reduced by 51% with one filtration while keeping 93% of gold nanoparticles.","PeriodicalId":112526,"journal":{"name":"2018 16th IEEE International New Circuits and Systems Conference (NEWCAS)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132288414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}