2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)最新文献

筛选
英文 中文
Accurate Synthesis of Arithmetic Operations with Stochastic Logic 算术运算与随机逻辑的精确综合
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.74
E. Vahapoglu, M. Altun
{"title":"Accurate Synthesis of Arithmetic Operations with Stochastic Logic","authors":"E. Vahapoglu, M. Altun","doi":"10.1109/ISVLSI.2016.74","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.74","url":null,"abstract":"In this study, we propose a method to overcome the main drawback in stochastic computing, low accuracy or related long computing times. Our method exploits dependency in stochastic bit streams with the aid of feedback mechanisms. Accurate (error-free) arithmetic multiplier and adder circuits are implemented. Operations are performed using both stochastic and binary inputs/outputs, binary-stochastic number conversion circuits are implemented for this purpose. We test our circuits by considering performance parameters area, delay, and accuracy. The simulation results are evaluated in a comparison with the results of other stochastic and deterministic (conventional) computing techniques in the literature. Additionally, we discuss the applicability of our method in emerging technologies including printed/flexible electronics for which low transistor counts is desired.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116239930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A Low-Cost Mixed Clock Generator for High Speed Adiabatic Logic 用于高速绝热逻辑的低成本混合时钟发生器
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.44
Zhou Zhao, A. Srivastava, Lu Peng, S. Mohanty
{"title":"A Low-Cost Mixed Clock Generator for High Speed Adiabatic Logic","authors":"Zhou Zhao, A. Srivastava, Lu Peng, S. Mohanty","doi":"10.1109/ISVLSI.2016.44","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.44","url":null,"abstract":"Low power and robust circuitry are permanent hotspots in VLSI design. Adiabatic logic is one of potential breakthroughs for these goals. Especially, designing reliable clock tree is very significant for adiabatic logic due to four-phase clocked power required for pipelined data transmission in adiabatic system. In this paper, we present analysis of charging speed and clock types that influence power dissipation in adiabatic logic and comparison of current mainstream clock generators suitable for adiabatic system. Based on the characteristics of current designs, using TSMC 180nm fabrication process, we propose a novel mixed clock generator, including four-phase source, switch controller and clock MUX to build a robust clock using only one reference clock. The test shows that below 600MHz, the proposed design has negligible signal attenuation with low power dissipation. We have also compared our work and current designs in device cost, and suitable work frequency based on circuit structure.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126727973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A System-Level Exploration of Power Delivery Architectures for Near-Threshold Manycores Considering Performance Constraints 考虑性能约束的近阈值多核供电架构的系统级探索
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.65
Ioannis S. Stamelakos, A. Djahromi, A. Eltawil, G. Palermo, C. Silvano, F. Kurdahi
{"title":"A System-Level Exploration of Power Delivery Architectures for Near-Threshold Manycores Considering Performance Constraints","authors":"Ioannis S. Stamelakos, A. Djahromi, A. Eltawil, G. Palermo, C. Silvano, F. Kurdahi","doi":"10.1109/ISVLSI.2016.65","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.65","url":null,"abstract":"Continuous technology scaling and increased demand for computational power have introduced a paradigm shift in manycore design requirements. On the other hand, tight power budgets and limitations of voltage scaling are throttling the ability to optimally exploit the potential of these systems, leading researchers to adopt aggressive voltage scaling techniques such as Near-Threshold Computing (NTC). In this paper we evaluate and compare the efficiency of different power delivery schemes for NT manycore architectures under process variation while meeting performance constraints. For platforms operating in a specific voltage range, simple and cost effective Power Delivery (PD) architectures can deliver average power savings ranging from 24% up to 50%, when taking into account the workload characteristics of the target applications at design time1.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128160410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Low-Power and High Performance Sinusoidal Clocked Dynamic Circuit Design 低功耗高性能正弦时钟动态电路设计
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.34
Raghava Katreepalli, Hemanth Chemanchula, T. Haniotakis, Y. Tsiatouhas
{"title":"Low-Power and High Performance Sinusoidal Clocked Dynamic Circuit Design","authors":"Raghava Katreepalli, Hemanth Chemanchula, T. Haniotakis, Y. Tsiatouhas","doi":"10.1109/ISVLSI.2016.34","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.34","url":null,"abstract":"Important characteristic of any VLSI circuit is its power consumption, reliability, operating speed and silicon area. Dynamic CMOS designs provide high operating speeds compared to static CMOS designs combined with low silicon area requirements. Pipelines can be used for achieving high circuit operating speeds. However, as the operating frequency increases, the number of pipeline stages should also increase and so the memory elements. These memory elements increase the area overhead and restrict the maximum achievable frequency due to their delays. Memory-less pipelines based on dynamic design address these issues but, still require high power consumption for the clock signal. In this paper we propose a sinusoidal three-phase clocking scheme that reduces the power required by the clock and offers high circuit operating frequencies. Thus the proposed technique provides advantages over preexisting techniques in terms of power requirement, area of implementation and operating speed.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134314693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Using Statistical Models to Improve the Reliability of Delay-Based PUFs 利用统计模型提高时延puf的可靠性
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.125
Xiaolin Xu, W. Burleson, Daniel E. Holcomb
{"title":"Using Statistical Models to Improve the Reliability of Delay-Based PUFs","authors":"Xiaolin Xu, W. Burleson, Daniel E. Holcomb","doi":"10.1109/ISVLSI.2016.125","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.125","url":null,"abstract":"Physical Unclonable Functions (PUFs) use random physical variations to map input challenges to output responses in a way that is unique to each chip. PUFs are promising low cost security primitives but unreliability of outputs limits the practical applications of PUFs. This work addresses two causes of unreliability: environmental noise and device aging. To improve reliability, we constructively apply Machine Learning modeling, and use the models to predict and then discard challenge-response pairs (CRPs) that will be unreliable with respect to noise and aging on a given PUF instance. The proposed method provides flexibility to control error rate by deciding what percentage of challenges to discard. Our experiments find that a PUF with nominal reliability of 91% can be made fully reliable by discarding only 20% of challenges.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132588015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
A Pruning Technique for B&B Based Design Exploration of Approximate Computing Variants 基于B&B的近似计算变量设计探索的剪枝技术
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.110
M. Barbareschi, F. Iannucci, A. Mazzeo
{"title":"A Pruning Technique for B&B Based Design Exploration of Approximate Computing Variants","authors":"M. Barbareschi, F. Iannucci, A. Mazzeo","doi":"10.1109/ISVLSI.2016.110","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.110","url":null,"abstract":"Approximate Computing is revealing a new design paradigm which trades algorithms precision off for enhancing performance parameters, commonly energy consumption and computation time. Applications which are characterized by the inherent resiliency property tolerate some quality loss, w.r.t. the optimal result. The approximation is accomplished by combining substitutions of fully-precise block operations with inaccurate ones. However, exploring every possible approximate variant of an algorithm would be extremely costly due to countless configurations. IDEA, a design exploration tool for approximate computing algorithms, introduced a branch and bound exploration approach to make it affordable. In this paper, we enhance the IDEA B&B exploration approach by introducing a pruning technique, which significantly reduces the design solution space to explore. We demonstrate the effectiveness of approach by comparing the execution of approximating campaigns over some algorithms employing proposed pruning rules.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133334672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Power-Delay-Area Performance Modeling and Analysis for Nano-Crossbar Arrays 纳米交叉棒阵列的功率延迟区性能建模与分析
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.100
Muhammed Ceylan Morgül, Furkan Peker, M. Altun
{"title":"Power-Delay-Area Performance Modeling and Analysis for Nano-Crossbar Arrays","authors":"Muhammed Ceylan Morgül, Furkan Peker, M. Altun","doi":"10.1109/ISVLSI.2016.100","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.100","url":null,"abstract":"In this study, we introduce an accurate capacitor-resistor model for nano-crossbar arrays that is to be used for power/delay/area performance analysis and optimization. Although the proposed model is technology independent, we explicitly show its applicability for three different nanoarray technologies where each crosspoint behaves as a diode, a FET, and a four-terminal switch. In order to find related capacitor and resistor values, we investigate upper/lower value limits for technology dependent parameters including doping concentration, nanowire dimension, pitch size, and layer thickness. We also use different fan-out capacitors to test the integration capability of these technologies. Comparison between the proposed model and a conventional simple one, which generally uses one/two capacitors for each crosspoint, demonstrates the necessity of using our model in order to accurately calculate power and delay values. The only exception where both models give approximately same results is the presence of considerably low valued resistive connections between switches. However, we show that this is a rare case for nano-crossbar technologies.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124090614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Low Cost VLSI Architecture for Sample Adaptive Offset Encoder in HEVC HEVC中样本自适应偏移编码器的低成本VLSI架构
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.78
Sayed El Gendy, A. Shalaby, M. Sayed
{"title":"Low Cost VLSI Architecture for Sample Adaptive Offset Encoder in HEVC","authors":"Sayed El Gendy, A. Shalaby, M. Sayed","doi":"10.1109/ISVLSI.2016.78","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.78","url":null,"abstract":"Sample Adaptive Offset (SAO) has been adopted as a new in-loop filtering block in High Efficiency Video Coding (HEVC). It can significantly increase compression efficiency especially for sequences that contain computer graphics content up to 23%. To get the optimum SAO parameters, exhaustive operations are required because of the huge amount of samples which the encoder has to study. In this work, a low cost high throughput VLSI implementation for the parameter estimation (encoding) phase is proposed. The proposed novel architecture reduces the cost in terms of gates count by 47% in comparison with prior work. The proposed design is prototyped using 65 nm CMOS technology. It has 89.3 Kgates, 8832 bits SRAM, and a maximum clock frequency of 426 MHz. It can support real time 8K×4K@120fps videos at 378 MHz.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"971 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123079693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Thermal-Aware Preemptive Test Scheduling for Network-on-Chip Based 3D ICs 基于片上网络的3D集成电路热感知抢占式测试调度
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.75
K. Manna, C. S. Sagar, S. Chattopadhyay, I. Sengupta
{"title":"Thermal-Aware Preemptive Test Scheduling for Network-on-Chip Based 3D ICs","authors":"K. Manna, C. S. Sagar, S. Chattopadhyay, I. Sengupta","doi":"10.1109/ISVLSI.2016.75","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.75","url":null,"abstract":"The recently proposed three-dimensional (3D) integration promises to enhance the system performance. However, it poses several test challenges. Thermal safety of the 3D system is the foremost concern. Testing of the system plays an important role to improve the yield. This work presents a thermal-aware core test scheduling technique in 3D stacked multicore system using Particle Swarm Optimization (PSO) strategy. To improve the solution quality, the basic PSO has been augmented with multiple PSO operations. The proposed strategy has been compared with other the techniques available in the literature. Thermal-safety has been achieved by providing nominal sacrifice in the test time. The experimental results project the improvement over other state-of-the-art strategies.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116145675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Architecture Exploration for Energy-Efficient Embedded Vision Applications: From General Purpose Processor to Domain Specific Accelerator 节能嵌入式视觉应用的架构探索:从通用处理器到特定领域加速器
2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2016-07-01 DOI: 10.1109/ISVLSI.2016.112
Maria Malik, Farnoud Farahmand, P. Otto, N. Akhlaghi, T. Mohsenin, S. Sikdar, H. Homayoun
{"title":"Architecture Exploration for Energy-Efficient Embedded Vision Applications: From General Purpose Processor to Domain Specific Accelerator","authors":"Maria Malik, Farnoud Farahmand, P. Otto, N. Akhlaghi, T. Mohsenin, S. Sikdar, H. Homayoun","doi":"10.1109/ISVLSI.2016.112","DOIUrl":"https://doi.org/10.1109/ISVLSI.2016.112","url":null,"abstract":"OpenCV applications are computationally intensive tasks among computer vision algorithms. The demand for low power yet high performance real-time processing of OpenCV embedded vision applications have led to developing their customized implementations on state-of-the-art embedded processing platforms. Given the industry move to heterogeneous platforms which integrates single core or multicore CPU with on-chip FPGA accelerators and GPU accelerators, the question of what platform and what implementation, whether hardware or software, is best suited for energy-efficient processing of this class of applications is becoming important. In this paper, we seek to answer this question through a detailed hardware and software implementation of OpenCV applications and methodically measurement and comprehensive analysis of their power and performance on state-of-the-art heterogeneous embedded processing platforms. The results show that in addition to application behavior, the size of image is an important factor in deciding the efficient platform in terms of highest energy-efficiency (EDP) among hardware accelerators on FPGA and software accelerators on GPU and multicore CPUs. While hardware implementation on ZYNQ shown to be the most performance and energy-efficient for image size of 500x500 or less, software GPU implementation found to be the most efficient and achieves highest speedup for larger image sizes. In addition, while for compute intensive vision applications the gap between FPGA, CPU and GPU reduces as the size of image increases, for non-intensive applications, a large performance and EDP gap is observed between the studied platforms, as the size of the image increases.","PeriodicalId":140647,"journal":{"name":"2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123090613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书