2013 IEEE 31st International Conference on Computer Design (ICCD)最新文献

筛选
英文 中文
A TCAM generator for packet classification 一个用于分组分类的TCAM生成器
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657060
Infall Syafalni, Tsutomu Sasao
{"title":"A TCAM generator for packet classification","authors":"Infall Syafalni, Tsutomu Sasao","doi":"10.1109/ICCD.2013.6657060","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657060","url":null,"abstract":"In the internet, packets are classified by source and destination addresses and ports, as well as protocol type. Ternary content addressable memories (TCAMs) are often used to perform this operation. This paper shows a method to reduce the number of words in TCAM for multi-field classification functions. We use head-tail expressions to represent a multi-field classification rule. Furthermore, we present an O(r2)-algorithm, called MFHT, to generate simplified TCAMs for two-field classification functions, where r is the number of rules. Experimental results show that MFHT achieves a 58% reduction of words for random rules and a 52% reduction of words for ACL and FW rules. Moreover, MFHT is fast and useful for simplifying TCAM for packet classification.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114461591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Exploiting dynamic phase distance mapping for phase-based tuning of embedded systems 利用动态相位距离映射实现嵌入式系统的相位调优
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657066
Tosiron Adegbija, A. Gordon-Ross
{"title":"Exploiting dynamic phase distance mapping for phase-based tuning of embedded systems","authors":"Tosiron Adegbija, A. Gordon-Ross","doi":"10.1109/ICCD.2013.6657066","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657066","url":null,"abstract":"Phase-based tuning increases optimization potential by configuring system parameters for application execution phases. Previous work proposed phase distance mapping (PDM), which relied on extensive a priori analysis of executing applications to dynamically estimate the best configuration using the correlation between phases. We propose DynaPDM, a new dynamic phase distance mapping methodology that eliminates a priori designer effort, dynamically analyzes phases, and determines the best configurations, yielding average energy delay product savings of 28%-an 8% improvement on PDM-and configurations within 1% of the optimal.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134635929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards efficient dynamic data placement in NoC-based multicores 在基于noc的多核中实现高效的动态数据放置
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657067
Qingchuan Shi, Farrukh Hijaz, O. Khan
{"title":"Towards efficient dynamic data placement in NoC-based multicores","authors":"Qingchuan Shi, Farrukh Hijaz, O. Khan","doi":"10.1109/ICCD.2013.6657067","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657067","url":null,"abstract":"Next generation multicores will process massive data with significant sharing. Since future processors will also be inherently limited by the off-chip bandwidth, the on-chip data management is emerging as a first-order design constraint. On-chip memory latency increases as more cores are added since the diameter of most on-chip networks increases with the number of cores. We observe that a large fraction of on-chip traffic originates from communication between the cores to maintain cache coherence. Motivated by these observations, we propose a novel on-chip data placement mechanism that optimizes shared data placement by minimizing the distance of data from the requesting cores (improve locality) while paying attention to load balancing network contention and the utilization of percore cache capacity. Using simulations of a 64-core multicore, we show that our proposal outperforms state-of-the-art static and dynamic data placement mechanisms by an average of 5.5% and 8.5% respectively.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130578939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Resource allocation algorithms for guaranteed service in application-specific NoCs 特定应用noc中保证服务的资源分配算法
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657088
Gongmin Yang, Hao He, Jiang Hu
{"title":"Resource allocation algorithms for guaranteed service in application-specific NoCs","authors":"Gongmin Yang, Hao He, Jiang Hu","doi":"10.1109/ICCD.2013.6657088","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657088","url":null,"abstract":"Networks-on-chip (NoC) has been recognized as a scalable approach to cope with the increasingly large demand for on-chip communication. This work focuses on how to achieve guaranteed service for application-specific NoCs through resource reservation. A graph model is adopted to describe physical and temporal sources of an NoC in a unified manner. Based on the graph model, two resource allocation heuristics are proposed and investigated. One heuristic leverages the idea of chip layout routing and the other utilizes Boolean satisfiability. Results from simulation from various testcases indicate that the proposed methods significantly outperform a state-of-the-art previous work.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128889995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Lazy Precharge: An overhead-free method to reduce precharge overhead for memory parallelism improvement of DRAM system 惰性预充:一种减少预充开销以提高DRAM系统内存并行性的无开销方法
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657036
Zhang Tao, Cong Xu, Yuan Xie, Guangyu Sun
{"title":"Lazy Precharge: An overhead-free method to reduce precharge overhead for memory parallelism improvement of DRAM system","authors":"Zhang Tao, Cong Xu, Yuan Xie, Guangyu Sun","doi":"10.1109/ICCD.2013.6657036","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657036","url":null,"abstract":"As we enter the multi-core era, the main memory becomes the bottleneck due to the exploded memory requests. In this work, we propose a novel memory architecture-Lazy Precharge (LaPRE) that enables aggressive activation schemes so that multiple rows in a bank can be activated successively without the interrupt from precharges. Therefore, LaPRE effectively reduces the precharge overhead and thus improves memory parallelism. In addition, three memory scheduling schemes are proposed correspondingly to fully make use of the improved memory parallelism. The experimental results show that LaPRE can achieve 14% performance improvement on average without hardware overhead.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117043371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Equivalence checking for compiler transformations in behavioral synthesis 行为综合中编译器转换的等价性检查
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657090
Zhenkun Yang, K. Hao, Kai Cong, S. Ray, Fei Xie
{"title":"Equivalence checking for compiler transformations in behavioral synthesis","authors":"Zhenkun Yang, K. Hao, Kai Cong, S. Ray, Fei Xie","doi":"10.1109/ICCD.2013.6657090","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657090","url":null,"abstract":"Behavioral synthesis entails application of a sequence of transformations to compile a high-level description of a hardware design (e.g., in C/C++/SystemC) into a Register-Transfer Level (RTL) implementation. We present a scalable equivalence checking framework to validate the correctness of compiler transformations employed by behavioral synthesis. Our approach is based on dual-rail symbolic simulation of the input and output design representations of a transformation. We have evaluated our framework on transformations applied to several designs by an open source behavioral synthesis tool, and we present initial results demonstrating the approach.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124355178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Optimizing post-silicon conformance checking 优化后硅一致性检查
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657092
Li Lei, Kai Cong, Fei Xie
{"title":"Optimizing post-silicon conformance checking","authors":"Li Lei, Kai Cong, Fei Xie","doi":"10.1109/ICCD.2013.6657092","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657092","url":null,"abstract":"Virtual prototypes of hardware devices, a.k.a, virtual devices, are increasingly used to enable early software development before silicon prototypes/devices are available. In previous work, we presented a post-silicon conformance checking approach to detecting interface state inconsistencies between a silicon device and its virtual device. In this paper, we present an optimization, adaptive concretization, to reduce the overhead incurred by symbolic execution, a key technique used in our conformance checking approach. We have evaluated our optimized approach on three Ethernet adapters and their virtual devices. The results demonstrate that it is effective and efficient: 21 inconsistencies are discovered and time usages are reduced by an order of magnitude, comparing to the previous approach.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132704777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
CG-Resync: Conversion-guided resynchronization for a SSD-based RAID array CG-Resync: ssd RAID的转换引导重同步
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657081
Letian Yi, J. Shu, Jiaxin Ou, Weimin Zheng
{"title":"CG-Resync: Conversion-guided resynchronization for a SSD-based RAID array","authors":"Letian Yi, J. Shu, Jiaxin Ou, Weimin Zheng","doi":"10.1109/ICCD.2013.6657081","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657081","url":null,"abstract":"SSD-based RAID arrays have been widely adopted in large-scale systems. One requirement on a RAID is to provide data consistency, which can be an issue during serving write requests. While using NVRAM or on-storage logging can ensure the consistency, the approaches can either be very expensive or substantially compromise performance. For SSD-based RAID, scanning the entire storage space during rebooting after a crash can recover the consistency. However, it takes a long resychronization time. To address the issue efficiently and cost-effectively, we propose CG-Resync, a scheme providing consistency assurance for SSD-based RAIDs by leveraging logging mechanism readily available in almost all SSDs for accommodating flash's out-of-place-write requirement. To identify uncompleted writes resulting in inconsistent stripes, we use guided conversion in managing SSD's internal logs. In particular, only when a stripe becomes consistent does CG-Resync allow the updated data on the stripe to be removed from the log. We evaluate CG-Resync and experiments show that it provides improved RAID reliability and availability upon a crash with little performance loss during regular I/O operations.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129280478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
FastLanes: An FPGA accelerated GPU microarchitecture simulator FastLanes: FPGA加速GPU微架构模拟器
2013 IEEE 31st International Conference on Computer Design (ICCD) Pub Date : 2013-10-01 DOI: 10.1109/ICCD.2013.6657049
Kuan Fang, Yufei Ni, Jiayuan He, Zonghui Li, Shuai Mu, Yangdong Deng
{"title":"FastLanes: An FPGA accelerated GPU microarchitecture simulator","authors":"Kuan Fang, Yufei Ni, Jiayuan He, Zonghui Li, Shuai Mu, Yangdong Deng","doi":"10.1109/ICCD.2013.6657049","DOIUrl":"https://doi.org/10.1109/ICCD.2013.6657049","url":null,"abstract":"Graphic Processing Units (GPUs) have emerged as a new general purpose computing platform that attracts significant research efforts. Currently, GPU architecture research resorts to time-consuming software simulations to evaluate microarchitecture innovations. In this paper, we propose FastLanes, an FPGA based simulator for a generic GPU microarchitecture, to enable hardware-accelerated simulation. FastLanes consists of a function model and a timing model, both implemented on FPGA. The functional model implements the full functionality of a multiprocessor of GPU and emulates multiple multiprocessors via time-division multiplexing. We develop a hybrid implementation strategy in which certain GPU logic is directly mapped to FPGA while the other logic is simulated by reusing the same FPGA logic. A corresponding context shifting mechanism is proposed to store execution states of threads from FPGA to external on-board memory, and vice versa. Such a mechanism makes it possible to simulate hundreds of GPU cores on a single FPGA evaluation board. Driven by the functional simulation results, the timing model considers the detailed configuration of GPU microarchitecture to derive the performance evaluation. A compiler tool-chain is also developed to allow the execution of NVIDIA GPU binary on FastLanes. Experimental results prove that FastLanes outperforms its software equivalent by up to 2 orders of magnitude.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114665791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信