T. M. Khanshan, K. G. Kjeldgard, E. Ulvestad, D. Wisland, T. Lande
{"title":"CMOS photosensors for LIDAR","authors":"T. M. Khanshan, K. G. Kjeldgard, E. Ulvestad, D. Wisland, T. Lande","doi":"10.1109/NORCHIP.2018.8573471","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573471","url":null,"abstract":"In this paper photosensors for CMOS LIDAR intended for non-invasive biosensing are presented. Adequate depth resolution requires high-speed sampling and high-speed photosensors, hard to make in standard CMOS. In this paper CMOS photodiodes are evaluated with respect to speed and sensitivity designed for use in a single-chip LIDAR system. Diodes dimensions are quite small $(5 mu m times 5 mu m)$ suited for larger arrays. Simple assessment circuits are used in combination with high-speed optical instrumentation for characterization of different photosensors structures fabricated in standard $90 nm$ CMOS.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114667951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design and Implementation of Multi-Purpose DCT/DST-Specific Accelerator on Heterogeneous Multicore Architecture","authors":"Sajjad Nouri, R. G. Youvalari, J. Nurmi","doi":"10.1109/NORCHIP.2018.8573457","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573457","url":null,"abstract":"This paper presents the implementation of various sizes of Discrete Cosine transform (DCT) and Discrete Sine Transform (DST) dedicated for High Efficiency Video Coding (HEVC) standard by using template-based Coarse-Grained Reconfigurable Arrays (CGRAs) as accelerators on Heterogeneous Accelerator-Rich Platform (HARP). The proposal makes multipurpose DCT/DST specific accelerators in such a way that final architecture consists of 4/8/16/32–point DCT and 4-point DST. The accelerators are primarily designed by crafting template-based CGRA devices at different dimensions and then arranging them on a Network-on-Chip platform along with a few RISC cores. In this research work, the performance of each DCT/DST-specific accelerator, the collective performance of the whole platform and the NoC traffic are recorded in terms of the number of clock cycles and several high-level performance metrics. Conducted experiments show that 4-point DCT and 4-point DST can be implemented completely in 54 and 56 clock cycles, respectively, while for 8/16/32–point DCT, 67, 179 and 354 clock cycles are required, respectively. The achieved total power dissipation and energy consumption based on post placement and routing information are equal to 4.1 W and $10.87~mu {mathrm {J}}$, respectively with 256 instantiated Processing Elements (PEs) at 200.0 MHz operating frequency. It resulted to a performance of 51.2 Giga Operations Per Second (GOPS) and 12 MOPS/mW as an architectural constant for the HARP template on 28 nm Altera Stratix-V chip. The proposed architecture is able to sustain Full HD 1080p format at 30 fps on FPGA.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125781923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-Swarm based NoC Configuration and Synthesis","authors":"Muhammad Obaidullah, G. Khan, Fei Yuan","doi":"10.1109/NORCHIP.2018.8573462","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573462","url":null,"abstract":"Network-on-Chip (NoC) is a popular interconnection structure suited to many-core System-on-Chip (SoC). Assuming a mesh-based NoC, we explore the assignment of cores to NoC nodes and produce a best NoC configuration having minimal communication traffic, power consumption, and chip area. We employ pre-synthesized NoC components data to estimate power and area consumption of the interconnection network. NoC configuration and mapping problem is NP-hard, and we propose a hybrid scheme of swarm optimization that combines Tabu-list, sub-swarms, and Discrete Particle Swarm Optimization (DPSO). The main goal is to configure and synthesize NoC such that the total NoC latency, power consumption, and chip area are minimal. DPSO is used as the main optimization scheme and modified it such that each swarm particle move is influenced by NoC traffic. The methodology is tested for some multimedia application core graphs. It is determined that on average our tool reduced NoC area by 30% on average and reduced total NoC power (static + worst case dynamic) by 27.5% as compared to unoptimized NoCs.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133439357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Row-Column Accessed Dynamic Element Matching DAC Architecture for SAR ADCs","authors":"Mustafa Kilic, Selman Ergünay, Y. Leblebici","doi":"10.1109/NORCHIP.2018.8573513","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573513","url":null,"abstract":"Capacitor mismatch in high resolution Successive Approximation Register (SAR) Analog to Digital Converters (ADCs) causes non-linearity effects that reduce the Spurious-Free Dynamic Range (SFDR). This paper presents a novel dynamic element matching (DEM) technique aiming at enhancing the SFDR of SAR ADCs. The high resolution capacitive Digital to Analog Converter (DAC) array is considered as a square-matrix where cells are switched on and off according to row and column bit lines. The number of signals to control and scramble is then highly reduced, enabling high speed operation as well. The proposed architecture has been implemented on 10 and 12-bit SAR ADC models in Matlab in order to highlight its performances. Considering a standard deviation of $sigma$ = 1.5% for the unit capacitance, this architecture enables to correct the SFDR by more than 10dB in high distortion cases, while allowing high speed operation.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133145444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Montaseri, R. Vuohtoniemi, J. Aikio, T. Rahkonen, A. Pärssinen
{"title":"Design of Multi-Stacked CMOS mm-Wave Power Amplifiers for Phased Array Applications Using Triple-Well Process","authors":"M. Montaseri, R. Vuohtoniemi, J. Aikio, T. Rahkonen, A. Pärssinen","doi":"10.1109/NORCHIP.2018.8573452","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573452","url":null,"abstract":"This paper concerns with the design of multi-stacked CMOS millimeter-wave power amplifiers suitable for phased array front-end applications using triple-well process. The parasitics posed by the triple-well technique are studied and compensated using negative capacitance technique for proper operation. The design technique is evaluated using TSMC 28nm CMOS process at 28GHz operating frequency as a candidate operating band for 5G systems. The results illustrate a power gain of 25dB, 22dBm saturated power, and a maximum 38% PAE along with superior phase alignment between stacks.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"619 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123069179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Olaitan Olabode, Vishnu Unnikrishnan, Ilia Kempi, A. Hammer, M. Kosunen, J. Ryynänen
{"title":"A Configurable Hysteresis Comparator for Asynchronous Sigma-Delta Modulators","authors":"Olaitan Olabode, Vishnu Unnikrishnan, Ilia Kempi, A. Hammer, M. Kosunen, J. Ryynänen","doi":"10.1109/NORCHIP.2018.8573454","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573454","url":null,"abstract":"This paper describes a configurable hysteresis comparator for asynchronous sigma-delta modulators (ASDM). The proposed comparator provides coarse and fine tuning options for configuring the loop delay and hence the frequency of an ASDM. The post-layout simulation of the comparator implemented in a 28 nm FDSOI process shows that the comparator provides hysteresis voltage range of ±(1 to 15.3) mV while consuming 36.8 nW to 4.4 μW from 0.7 V supply, which enables configurable ASDM center-frequency in the range of 100 kHz to 6 MHz.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128704772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Building Lumped Models for Measured Passive mm-wave Components","authors":"E. Sankila, Veeti Kiuru, J. Aikio, T. Rahkonen","doi":"10.1109/NORCHIP.2018.8573484","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573484","url":null,"abstract":"This paper presents a synthesis flow for building lumped circuit models of arbitrary complexity for mm-wave IC passive components, based on S-parameters obtained by measurements or electromagnetic (EM) field simulations. Lumped circuit models are needed in time-domain simulations, or to speed up the fine-tuning of passive circuit blocks, as iterating is much faster in circuit simulators than in EM simulator. Modeling algorithm is implemented in MATLAB, and the design flow has a few new features. The device model is given by Spice netlist, and its structure or complexity is not limited. Differential and common mode forms of admittance parameters are used to simplify solving the initial model component values that are then refined manually or by numerical optimization. The flow is illustrated by modeling a parallel LC resonator, whose response has been measured from 1 to 40 GHz.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"41 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120908290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design Considerations and Evaluation of a High-Speed SAR ADC","authors":"Victor Åberg, C. Fager, L. Svensson","doi":"10.1109/NORCHIP.2018.8573500","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573500","url":null,"abstract":"We present design and evaluation of an asynchronous, alternating-comparator, 800MS/s SAR ADC. The comparators use continuous calibration to compensate for static input offset voltages. We use a combination of measurements and behavioural modelling to identify two possible causes of limited performance; leakage from the output drives onto the input signal, and dynamic offset voltage in the comparators.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126820506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nicolai Behmann, M. Mehltretter, S. Kleinschmidt, Bernardo Wagner, C. Heipke, H. Blume
{"title":"GPU-enhanced Multimodal Dense Matching","authors":"Nicolai Behmann, M. Mehltretter, S. Kleinschmidt, Bernardo Wagner, C. Heipke, H. Blume","doi":"10.1109/NORCHIP.2018.8573526","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573526","url":null,"abstract":"Multiple modalities for stereo matching are beneficial for robust path estimation and actioning of autonomous robots in harsh environments, e.g. in the presence of smoke and dust. In order to combine the information resulting from the different modalities, a dense stereo matching approach based on semi-global matching and a combined cost function using cross-based support regions and phase congruency shows a good performance. However, these computationally complex algorithmic steps set high requirements for the mobile processing platform and prohibit a real-time execution at limited power budget on mobile platforms. Therefore, this paper explores the usage of graphic processors for the parallelization and acceleration of the aforementioned algorithm. The resulting implementation performs the computation of phase congruency and cross-based support regions at 68 and 5 frames per second for $[960mathrm{x}560]$ pixel images on a Nvidia Quadro P5000 and Tegra X2 GPU respectively.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124318304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Reem W. Etman, Salma Hesham, Klaus Hoffman, M. A. E. Ghany, D. Göhringer
{"title":"Analysis of Synchronous-Asynchronous NoC for the Dark Silicon Era","authors":"Reem W. Etman, Salma Hesham, Klaus Hoffman, M. A. E. Ghany, D. Göhringer","doi":"10.1109/NORCHIP.2018.8573470","DOIUrl":"https://doi.org/10.1109/NORCHIP.2018.8573470","url":null,"abstract":"Future chips are anticipated to feature hundreds of on-die resources, but a significant portion of silicon area in these chips will be powered-off or “dark”. As a result, only a limited number of cores of future processors can be powered on simultaneously. In this paper, a novel NoC architecture is proposed, called Dark Silicon Synchronous-Asynchronous (DSSA) NoC, that offer a promising solution from the latency perspective in consideration to the number of hops from source to destination. In this design two layers of architecturally analogous synchronous and asynchronous TDM routers are integrated, leveraging the extra transistors available due to dark silicon for a hard real-time multiprocessor platform. At a given time, at most, only one of the network layers is illuminated while the other is dark. Layer selection performed at the granularity of router port with applied power gating technique uniquely in a source routed TDM router. To verify the design, a$4times4$ bitorus synchronous, asynchronous and DSSA NoCs are implemented in 90-nm CMOS technology and the results are compared in the fields of area, speed, latency and power consumption, which shows improved message latency by DSSA NoC. The results further show the effects of the power gating technique to the TDM NoC, which offers up to 50% power savings in comparison to the conventional synchronous and asynchronous TDM NoCs. This illustrates DSSA as an improved latency NoC solution in future dark silicon chips.","PeriodicalId":152077,"journal":{"name":"2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124354779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}