Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)最新文献

筛选
英文 中文
Low power chips: a fabless asic perspective 低功耗芯片:无晶圆厂基本视角
S. Bhonge, V. Boppana
{"title":"Low power chips: a fabless asic perspective","authors":"S. Bhonge, V. Boppana","doi":"10.1145/1393921.1394013","DOIUrl":"https://doi.org/10.1145/1393921.1394013","url":null,"abstract":"Summary form only given. The fabless ASIC model has changed the landscape of ASIC design by offering a high-quality, cost-effective and open alternative to realizing ASICs. The very nature of this model (because of its reliance on the third-party foundry, IP ecosystem) offers unique challenges and opportunities for implementing low power chips. This tutorial presents an overview of the exciting low power challenges, opportunities and solutions available in a fabless ASIC model. We review state-of-the-art low power IC solutions and case studies from varied markets, including processor-based, wired, wireless, consumer and multi-core chips. We start with a discussion on technology trends and low power challenges. We next review the spectrum of low power solutions and identify the appropriate opportunities that are applicable to the fabless ASIC model. We also discuss unique technology solutions that employ the use of transistor-level transformations that extend the solutions typically available in the ASIC model. Next, we discuss how these solutions are deployed in the model. We finally present detailed case studies of ICs. The low power techniques employed in the ICs include selection of technology node/process, selection of macros, multi-voltage design, power gating, custom transistor-level circuits, clocking, selection and optimization of standard cell libraries, design/architecture and power planning, advanced timing and power optimization, low power design closure, innovative packaging and power impact on variability-tolerance. The tutorial arms the audience with the best techniques, tools and methodologies to achieve the lowest power Silicon for state-of-the-art ASICs.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116979475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Word-interleaved cache: an energy efficient data cache architecture 字交错缓存:一种节能的数据缓存架构
T. V. Kalyan, M. Mutyam
{"title":"Word-interleaved cache: an energy efficient data cache architecture","authors":"T. V. Kalyan, M. Mutyam","doi":"10.1145/1393921.1393991","DOIUrl":"https://doi.org/10.1145/1393921.1393991","url":null,"abstract":"We propose a novel energy-efficient data cache architecture, namely, word-interleaved (WI) cache. In theWI cache, a cache block is distributed uniformly among the different cache ways and each line of a cache way holds some words of the block. This distribution provides an opportunity to activate/deactivate the cache ways based on the requested address's offset, thus minimizing the overall cache access energy. For a 4-way set associative cache of size 16KB and blocksize 32B, the proposed technique accomplishes dynamic energy savings of 54.2% without considering fast hits and 62.3% when fast hits are considered, with small performance degradation and negligible area overhead.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114722431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Penalty for power reduction -: performance or schedule or yield? 减少功率的惩罚:性能、进度还是产量?
B. Sarker, J. Ahuja, Arijit Dutta, D. Srinath, K. Sridhar, Radhakrishnan Nair, J. Lahiri
{"title":"Penalty for power reduction -: performance or schedule or yield?","authors":"B. Sarker, J. Ahuja, Arijit Dutta, D. Srinath, K. Sridhar, Radhakrishnan Nair, J. Lahiri","doi":"10.1145/1393921.1393999","DOIUrl":"https://doi.org/10.1145/1393921.1393999","url":null,"abstract":"It is often said \"It is always give and take\" and that \"there is no such thing as a free lunch\". The same would hold true for Low Power designs. The questions oft asked is What are the trade-offs for reduction in power? What would be the limits of power reduction, before it starts impacting other parameters? Designs are generally characterized by four predominant parameters - performance, timing, area and power. As design and manufacturing became different disciplines supported by independent teams, two additional parameters were added to the design characterization, schedule and yield. Schedule implies the time taken to get the design to the desired performance and yield indicates the percentage of designs that meet the performance criterion, after manufacturing. Performance, schedule and yield have become a proxy for the expertise built in the design team and the capability of the tools to handle the complex designs. Teams with expertise and access to appropriate tools, can build high performance designs faster and at the desired yields. Traditionally performance has been correlated with timing or the maximum operating frequency of the design. More recently power is becoming an important area of concern, and is forcing designers to design within the power specifications of the design. Power has been seen as limiting the timing performance for many designs. In this key panel, we will discuss: What could be some of the best practices to reduce power while maintaining timing performanceHow could one analyze the performance, schedule, yield trade-off with an exampleDiscuss industry-wide effort to reduce the penalty for further power reduction","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126763918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Innovations to extend CMOS nano-transistors to the limit 将CMOS纳米晶体管扩展到极限的创新
T. Ghani
{"title":"Innovations to extend CMOS nano-transistors to the limit","authors":"T. Ghani","doi":"10.1145/1393921.1393924","DOIUrl":"https://doi.org/10.1145/1393921.1393924","url":null,"abstract":"Summary form only given. The scaling of CMOS technology has led to phenomenal growth in transistor density and performance during the last three decades. However, starting at 90nm CMOS node, the industry started to experience significant barriers in achieving historical transistor performance gains through traditional dimensional scaling. Fortunately, the industry has responded positively to this challenge by implementing many innovations in device structure and materials to overcome traditional scaling barriers. Intel has been at the forefront in addressing these challenges by successfully driving transistor innovations from research phase to mainstream CMOS manufacturing. Implementation of uniaxial strained-silicon transistors at the 90nm node and the recently announced \"HiK+Metal Gate\" transistors for the 45nm node are two excellent examples of major innovations which have demonstrated dramatic performance enhancement. After a brief review highlighting the dramatic performance benefits demonstrated with uniaxial strained silicon technology, I will describe process details and present significant performance gains achieved with \"HiK+Metal Gate\" transistor technology for 45nm CMOS node. Finally, I will discuss the role of increasing power density and transistor parasitics in limiting future CMOS transistor scaling and describe potential new transistor structure and material innovations required to meet performance/power/density improvements beyond 45nm CMOS node. The convergence of new transistor structure and materials will be critical for successfully scaling CMOS transistors through next decade.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124167762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Instruction-driven clock scheduling with glitch mitigation 带有故障缓解的指令驱动时钟调度
Gu-Yeon Wei, D. Brooks, A. Khan, Xiaoyao Liang
{"title":"Instruction-driven clock scheduling with glitch mitigation","authors":"Gu-Yeon Wei, D. Brooks, A. Khan, Xiaoyao Liang","doi":"10.1145/1393921.1394017","DOIUrl":"https://doi.org/10.1145/1393921.1394017","url":null,"abstract":"Instruction-driven clock scheduling is a mechanism that minimizes clock power in deeply-pipelined datapaths. Analysis of realistic processor workloads shows a preponderance of bubbles persist through pipelines like the floating point unit. Clock scheduling ostensibly adapts pipeline depth with respect to bubbles in the instruction stream without performance loss. Unfortunately, shallower pipelines (i.e. longer pipe stages) are prone to larger amounts of glitches propagating through logic, increasing dynamic power. Experimentally measured results from a 130 nm FPU test chip with flexible clocking capabilities show a super-linear increase in glitch-induced dynamic power for shallower pipelines. While higher glitch power can severely diminish the power savings offered by clock scheduling, judicious clocking of intermediate stages offers glitch mitigation to recover power savings for worst-case scenarios. Detailed analysis of clock scheduling applied to a FPU in a POWER4-like processor running realistic workloads shows an average net power savings of 15% compared to an aggressively clock-gated design.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122510224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A low power layered decoding architecture for LDPC decoder implementation for IEEE 802.11n LDPC codes 针对IEEE 802.11n LDPC码的低功耗分层解码架构
Jie Jin, C. Tsui
{"title":"A low power layered decoding architecture for LDPC decoder implementation for IEEE 802.11n LDPC codes","authors":"Jie Jin, C. Tsui","doi":"10.1145/1393921.1393989","DOIUrl":"https://doi.org/10.1145/1393921.1393989","url":null,"abstract":"This paper presents a low power LDPC decoder design based on reducing the amount of memory access. By utilizing the column overlapping of the LDPC parity check matrix, the amount of access for the memory storing the posterior values is minimized. In addition, a thresholding decoding scheme is proposed which reduces the memory access by trading off the error correcting performance. The decoder was implemented in TSMC 0.18μm CMOS process. Experimental results show that for a LDPC decoder targeting for IEEE 802.11n, the power consumption of the memory and the decoder can be reduced by 72% and 24%, respectively.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123556743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Towards a green electronic world: a collaborative approach 迈向绿色电子世界:合作方法
J. Ahuja
{"title":"Towards a green electronic world: a collaborative approach","authors":"J. Ahuja","doi":"10.1145/1393921.1393922","DOIUrl":"https://doi.org/10.1145/1393921.1393922","url":null,"abstract":"Summary form only given. Increasing power density of complex SoC's have made low-power a topic of interest in the industry. Power considerations in portable and wireless consumer devices have become a key part of many product specifications. Even for wired devices and other industry segments in which battery power has not traditionally been an issue, considerations of packaging, reliability, and cooling costs brings power firmly to the forefront at smaller geometries. In particular, as designs migrate to sub-90 nm process nodes, power management becomes a serious concern across the entire design and manufacturing chain. To help design teams, there is a need to adopt advanced power reduction techniques, where a complete low power solution is needed for the design, verification, and implementation of low-power chips. Since the challenge spans across the design chain, industry collaboration is an imperative.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"605 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131788755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Thermal analysis of 8-T SRAM for nano-scaled technologies 用于纳米技术的8-T SRAM热分析
M. Meterelliyoz, J. Kulkarni, K. Roy
{"title":"Thermal analysis of 8-T SRAM for nano-scaled technologies","authors":"M. Meterelliyoz, J. Kulkarni, K. Roy","doi":"10.1145/1393921.1393953","DOIUrl":"https://doi.org/10.1145/1393921.1393953","url":null,"abstract":"Different sections of a cache memory may experience different temperature profiles depending on their proximity to other active logic units such as the execution unit. In this paper, we perform thermal analysis of cache memories under the influence of hot-spots. In particular, 8-T SRAM bit cell is chosen because of its robust functionality at nano-scaled technologies. Thermal map of entire 8-T SRAM cache is generated using hierarchical compact thermal models while solving the leakage and temperature self consistently. The impact of spatial temperature variations on 8T-SRAM parameters such as local bitline (LBL) sensing delay, noise robustness and bitcell stability are evaluated for 45nm/32nm/22nm bulk CMOS technology nodes. The effectiveness of variable keeper sizing on LBL sensing delay is analyzed. It is predicted that at 22 nm node, the leakage induced temperature rise has severe effects on the 8-T SRAM characteristics.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130410733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Low-power high-accuracy timing systems for efficient duty cycling 低功耗高精度定时系统,高效占空比
T. Schmid, J. Friedman, Zainul Charbiwala, Young H. Cho, M. Srivastava
{"title":"Low-power high-accuracy timing systems for efficient duty cycling","authors":"T. Schmid, J. Friedman, Zainul Charbiwala, Young H. Cho, M. Srivastava","doi":"10.1145/1393921.1393943","DOIUrl":"https://doi.org/10.1145/1393921.1393943","url":null,"abstract":"Time keeping and synchronization are important services for networked and embedded systems. High quality timing information allows embedded network nodes to provide accurate time-stamping, fast localization, efficient duty cycling schedules, and other basic but essential functions - all of which are required for low power operation. In this paper we present a new type of local clock source called Crystal Compensated Crystal based Timer (XCXT) and a number of novel algorithms that effectively utilize it to achieve low power consumption in wireless sensor networks. The XCXT has timing accuracies similar to timers based on temperature compensated crystal oscillators (TCXO) but has a lower implementation cost and requires less power. Our initial 8 MHz prototype unit, using the simplest algorithm, achieves an effective frequency stability of ±1.2 ppm and consumes only 1.27 mW. On the other hand, commercially available TCXOs with similar stability can cost over 10 times as much and consume over 20 mW. In addition to the prototype, we will present algorithms that will improve the XCXT's power consumption by at least 48%, depending on application and environmental conditions. We will also show how XCXT's power efficiency can be improved even further by employing clocks at different frequency when different time granularities are required by an application.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114547151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Lazy instruction scheduling: keeping performance, reducing power 懒惰指令调度:保持性能,降低功耗
A. Mahjur, M. Taghizadeh, A. Jahangir
{"title":"Lazy instruction scheduling: keeping performance, reducing power","authors":"A. Mahjur, M. Taghizadeh, A. Jahangir","doi":"10.1145/1393921.1394020","DOIUrl":"https://doi.org/10.1145/1393921.1394020","url":null,"abstract":"An important approach to reduce power dissipation is reducing the number of instructions executed by the processor. To achieve this goal, this paper introduces a novel instruction scheduling algorithm that executes an instruction only when its result is required by another instruction. In this manner, it not only does not execute useless instructions, but also reduces the number of instructions executed after a mispredicted branch. The cost of the extra hardware is 161 bytes for 128 instruction window size. Measurements done using SPEC CPU 2000 benchmarks show that the average number of executed instructions is reduced by 13.5% while the average IPC is not affected.","PeriodicalId":166672,"journal":{"name":"Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114667265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信