Proceedings International Conference on Computer Design VLSI in Computers and Processors最新文献

筛选
英文 中文
Elastic history buffer: a low-cost method to improve branch prediction accuracy 弹性历史缓冲:一种低成本提高分支预测精度的方法
Maria-Dana Tarlescu, K. B. Theobald, G. Gao
{"title":"Elastic history buffer: a low-cost method to improve branch prediction accuracy","authors":"Maria-Dana Tarlescu, K. B. Theobald, G. Gao","doi":"10.1109/ICCD.1997.628853","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628853","url":null,"abstract":"Two-level dynamic branch predictors try to predict the outcomes of conditional branches using both a table of state counters associated with specific branch instructions and a buffer of recent branch outcomes to correlate the counters with specific branch histories. However there is always a question of how much correlation to use, and some programs benefit from higher levels of correlation than others. This paper presents the Elastic History Buffer (EHB), a low-cost yet effective scheme that can exploit the property that each branch instruction may have a different degree of correlation with other branches, while keeping the simple structure of a single global branch history. We have simulated the EHB on SPECint92 for two architectures. On average, the EHB has 25% fewer mispredictions than fixed-correlation schemes and 10% fewer than frequency-based branch classification schemes. With limited hardware (1KB), the EHB is close to the optimum measured by repeating the experiments on an \"oracle\" two-level predictor.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131954401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Dynamic bounding of successor force computations in the force directed list scheduling algorithm 力有向表调度算法中后继力计算的动态边界
S. Govindarajan, R. Vemuri
{"title":"Dynamic bounding of successor force computations in the force directed list scheduling algorithm","authors":"S. Govindarajan, R. Vemuri","doi":"10.1109/ICCD.1997.628949","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628949","url":null,"abstract":"The well known Force Directed List Scheduling (FDLS) Algorithm uses a rigorous priority function called the Force of an operation. The force of an operation is governed by two components, namely the self-force of an operation and its successors' forces. The successor force in turn is governed by the self-force of all the descendants of the operation. FDLS is computationally intensive in its force calculations. For data flow dominated designs, a major portion of the FDLS execution time is spent in the computation of successor forces. However in this paper we observe that it is not always necessary to compute successor forces till the last successor level. We have shown in this paper that there usually exists a stabilization point after which successor force computations would not affect the quality of the schedule produced. This paper presents a concept of stability to show that it is possible to dynamically bound the successor force calculations in FDLS, up to a certain level of descendants. We have measured the performance of FDLS for a suite of high level synthesis benchmarks. Results presented in the paper show considerable reduction in execution time for the same schedule quality. This would allow a high-level synthesis tool to perform better design space exploration.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"13 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120914314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Synthesizing iterative functions into delay-insensitive tree circuits 将迭代函数合成为延迟不敏感的树形电路
Fu-Chiung Cheng
{"title":"Synthesizing iterative functions into delay-insensitive tree circuits","authors":"Fu-Chiung Cheng","doi":"10.1109/ICCD.1997.628883","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628883","url":null,"abstract":"Speed, cost and correctness may be the most important factors in designing a digital system. This paper proposes a novel and general methodology to synthesize iterative functions into potentially high speed low cost and very robust circuits, called delay-insensitive combinational tree iterative circuits. In particular our methodology can be applied to synthesize binary addition and comparison into delay insensitive adders and comparators.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130658989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
If software is king for systems-on-silicon, what's new in compilers? 如果软件是硅上系统之王,那么编译器有什么新变化呢?
N. Dutt, S. Malik, L. Augusteijn, B. Fu, A. Nicolau, C. Polychronopoulos
{"title":"If software is king for systems-on-silicon, what's new in compilers?","authors":"N. Dutt, S. Malik, L. Augusteijn, B. Fu, A. Nicolau, C. Polychronopoulos","doi":"10.1109/ICCD.1997.628889","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628889","url":null,"abstract":"While software is already a significant component in today's system-on-silicon, it is expected to greatly dominate future generations of systems-on-silicon that will contain several (possibly heterogeneous) programmable processors, as well as reconfigurable hardware blocks and large amounts of on-chip memory. In this context, one could argue that the problems for compiler designers haven't changed at all, since the basic issues all boil down to the traditional challenges of: 1) coarse-grain parallelism extraction for the multiple processors on chip, 2) instruction level parallelism exploitation for individual processors, and 3) retargetable code generation for versions of the on-chip processors. This paper summarizes the positions of the panelists who presented their views on the new challenges in compilers for future systems-on-silicon, and who debated these issues at ICCD '97.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133367056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CMOS gate delay models for general RLC loading 通用RLC加载的CMOS门延迟模型
Ravishankar Arunachalam, F. Dartu, L. Pileggi
{"title":"CMOS gate delay models for general RLC loading","authors":"Ravishankar Arunachalam, F. Dartu, L. Pileggi","doi":"10.1109/ICCD.1997.628872","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628872","url":null,"abstract":"Gate and cell level timing analysis remains popular yet inherently incompatible with RC and RCL interconnect loads. The Ceff concept was proposed in Qian et. al. (1994) to model the interaction of empirical gate/cell delay models and RC loads. The most efficient Ceff model works in terms of precharacterizing the parameters of a time varying Thevenin voltage source model (in series with a fixed resistor) over a wide range of effective capacitance load values. In this paper we generalize this Thevenin equivalent Ceff model to enable future technologies which may include reduced supply voltages and RCL loads, without further complicating the Ceff algorithm or iterations.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132813831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 64
An efficient multi-way algorithm for balanced partitioning of VLSI circuits VLSI电路平衡划分的一种高效多路算法
X. Tan, J. Tong, P. Tan, N. Park, F. Lombardi
{"title":"An efficient multi-way algorithm for balanced partitioning of VLSI circuits","authors":"X. Tan, J. Tong, P. Tan, N. Park, F. Lombardi","doi":"10.1109/ICCD.1997.628928","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628928","url":null,"abstract":"This paper presents an efficient algorithm for multi-way balanced partitioning of VLSI circuits. The proposed algorithm is still based on the widely used net-cut model, but its novelty is the potential gain function into the net-cut cost function to relax the single-cell-move constraint (as commonly encountered in the Kernighan-Lin algorithm) for balanced partitioning. This feature permits to move a group of cells (referred to as a Multi-Cell-Move strategy) for partitioning a circuit, while reducing its sensitivity to size constraint. The new multi-way partitioning algorithm is fully analyzed; expressions for the potential gain function (with respect to the multi-move operation) and the cost (for the min-cut objective function) are presented. The time complexity of the proposed partitioning algorithm is O(P/spl times/k/sup 2/ log/sub 2/ (k)), where k is the number of blocks and P is the number of pins. Simulation results are presented and a remarkable improvement is achieved compared with existing algorithms, such as the k-Dual Part algorithm.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"1978 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130194259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Novel simulation of deep-submicron MOSFET circuits 深亚微米MOSFET电路的新型仿真
S. Bruma, R. Otten
{"title":"Novel simulation of deep-submicron MOSFET circuits","authors":"S. Bruma, R. Otten","doi":"10.1109/ICCD.1997.628850","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628850","url":null,"abstract":"The effects of scaling down the MOSFET dimensions to the deep-submicron range lead to operating regions that may be modeled by locally linear equations. The piecewise linear simulator seems to be the obvious choice. Certain requirements must be satisfied by a PL simulator for the successful manipulation of the proposed MOSFET model. Simulation results demonstrate the efficiency in simulating large deep-submicron MOSFET circuits. Several levels of hierarchy can be simulated due to the uniform PL modeling technique.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121955123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Time-stamped transition density for the estimation of delay dependent switching activities 时延相关切换活动估计的时间戳跃迁密度
Hoon Choi, S. Hwang
{"title":"Time-stamped transition density for the estimation of delay dependent switching activities","authors":"Hoon Choi, S. Hwang","doi":"10.1109/ICCD.1997.628851","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628851","url":null,"abstract":"We propose a new method to improve the accuracy of the transition density for the estimation of delay dependent switching activities in combinational circuits. In the previous method, glitching sensitivity concept was defined and used to modify the transition density so as to generate and propagate the glitch. To account for the inertial delay effect, the same technique that is commonly used in logic simulators to filter out unacceptably short pulses was used. However, since it does not keep transition density at each occurrence time separately, it is not adequate for the accurate estimation of glitch and inertial delay effect. In addition, it underestimates the difference between probabilistic property of the transition density and the deterministic property of signals in logic simulations in computing inertial delay effect. We describe the problems of the previous methods and propose a new method called time-stamped transition density to solve the problems. We also show the extensions of the method to consider possible delay variation due to imperfections in the manufacturing process. The experimental results demonstrate the validity of our proposed method.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"404 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130103073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A survey of techniques for formal verification of combinational circuits 组合电路形式验证技术综述
J. Jain, A. Narayan, M. Fujita, A. Sangiovanni-Vincentelli
{"title":"A survey of techniques for formal verification of combinational circuits","authors":"J. Jain, A. Narayan, M. Fujita, A. Sangiovanni-Vincentelli","doi":"10.1109/ICCD.1997.628907","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628907","url":null,"abstract":"With the increase in the complexity of present day systems, proving the correctness of a design has become a major concern. Simulation based methodologies are generally inadequate to validate the correctness of a design with a reasonable confidence. More and more designers are moving towards formal methods to guarantee the correctness of their designs. The authors survey some state-of-the-art techniques used to perform automatic verification of combinational circuits. They classify the current approaches for combinational verification into two categories: functional and structural. The functional methods consist of representing a circuit as a canonical decision diagram. Two circuits are equivalent if and only if their decision diagrams are equal. The structural methods consist of identifying related nodes in the circuit and using them to simplify the problem of verification. They briefly describe some of the methods in both the categories and discuss their merits and drawbacks.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121213096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Design optimization for high-speed per-address two-level branch predictors 高速每地址两级分支预测器的设计优化
I-Cheng K. Chen, Chih-Chieh Lee, M. Postiff, T. Mudge
{"title":"Design optimization for high-speed per-address two-level branch predictors","authors":"I-Cheng K. Chen, Chih-Chieh Lee, M. Postiff, T. Mudge","doi":"10.1109/ICCD.1997.628854","DOIUrl":"https://doi.org/10.1109/ICCD.1997.628854","url":null,"abstract":"Per-address two-level branch predictors have been shown to be among the best predictors and have been implemented in current microprocessors. However, as the cycle time of modern microprocessors continues to decrease, the implementation of set-associative per-address two-level branch predictors will become more difficult. Instead, direct-mapped designs may be more attractive. In this paper, we investigate an alternative implementation of the per-address two-level predictor referred to as the tagless, direct-mapped predictor which is simpler and has faster access time. The tagless predictor can offer comparable performance to current set-associative designs since removal of tags allows more resources to be allocated for the predictor and branch target buffer (BTB). Removal of tags also decouples the per-address predictors from the BTB, thus allowing the two components to be optimized individually. Furthermore, our results show that this tagless implementation is more accurate because it handles conflict misses in the branch history table better. Finally, we examine the system cost-benefit for tagless per-address predictors across a wide design space using equal-cost contours. We study the sensitivity of performance to the workloads by comparing results from the Instruction Benchmark Suite (IBS) and SPEC CINT95. Our work provides principles and quantitative parameters for optimal configurations of such predictors.","PeriodicalId":154864,"journal":{"name":"Proceedings International Conference on Computer Design VLSI in Computers and Processors","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124519365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信