14th IEEE Symposium on High-Performance Interconnects (HOTI'06)最新文献

筛选
英文 中文
Designing Full-Connectivity WDM Optical Interconnects with Reduced Switching and Conversion Complexity 降低交换和转换复杂度的全连接WDM光互连设计
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.11
H. Hamza, J. Deogun
{"title":"Designing Full-Connectivity WDM Optical Interconnects with Reduced Switching and Conversion Complexity","authors":"H. Hamza, J. Deogun","doi":"10.1109/HOTI.2006.11","DOIUrl":"https://doi.org/10.1109/HOTI.2006.11","url":null,"abstract":"Most existing wavelength division multiplexing (WDM) optical interconnects make use of a large number of switching elements and require wide-range tunable wavelength converters to support full-connectivity among inputs and outputs. This results in complex and expensive designs. In this paper, we propose new full-connectivity single-stage and multi-stage WDM optical interconnects with and reduced hardware complexity. The proposed designs require a smaller number of switching elements and use only fixed-range wavelength conversion. Analysis of hardware complexity shows that, the proposed designs have a smaller number of switching and conversion costs compared to most existing interconnects","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126789448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Fast Buffer Memory with Deterministic Packet Departures 具有确定性包偏离的快速缓冲存储器
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.13
Mayank Kabra, Siddharth Saha, Bill Lin
{"title":"Fast Buffer Memory with Deterministic Packet Departures","authors":"Mayank Kabra, Siddharth Saha, Bill Lin","doi":"10.1109/HOTI.2006.13","DOIUrl":"https://doi.org/10.1109/HOTI.2006.13","url":null,"abstract":"High-performance routers need to store temporarily a large number of packets in response to congestion. DRAM is typically used to implement the needed packet buffers, but DRAM devices are too slow to match the bandwidth requirements. To bridge the bandwidth gap, a number of hybrid SRAM/DRAM packet buffer architectures have been proposed (S. Iyer and N. Mckeown, 2002) (S. Kumar et al., 2005). These packet buffer architectures assume a very general model where the buffer consists of many logically separated FIFO queues that may be accessed in random order. For example, virtual output queues (VOQs) are used in crossbar routers, where each VOQ corresponds to a logical queue corresponding to a particular output. Depending on the scheduling algorithm used, the access pattern to these logical queues may indeed be at random. However, for a number of router architectures, this worst-case random access assumption is unnecessary since packet departure times are deterministic. One architecture is the switch-memory-switch router architecture (A. Prakash et al., 2002) (S. Iyer et al., 2002) that efficiently mimics an output queueing switch. Another architecture is the load-balanced router architecture (C.S. Chang et al., 2002) (I. Keslassy et al., 2003) that has interesting scalability properties. In these architectures, for best-effort routing, the departure times of packets can be deterministically calculated before inserting packets into packet buffers. In this paper, we describe a novel packet buffer architecture based on interleaved memories that takes advantage of the known packet departure times to achieve simplicity and determinism. The number of interleaved DRAM banks required to implement the proposed packet buffer architecture is independent of the number of logical queues, yet the proposed architecture can achieve the performance of an SRAM implementation","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"88 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114086291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
ExpressEther - Ethernet-Based Virtualization Technology for Reconfigurable Hardware Platform 基于ExpressEther - ethernet的可重构硬件平台虚拟化技术
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.12
Jun Suzuki, Y. Hidaka, J. Higuchi, T. Yoshikawa, A. Iwata
{"title":"ExpressEther - Ethernet-Based Virtualization Technology for Reconfigurable Hardware Platform","authors":"Jun Suzuki, Y. Hidaka, J. Higuchi, T. Yoshikawa, A. Iwata","doi":"10.1109/HOTI.2006.12","DOIUrl":"https://doi.org/10.1109/HOTI.2006.12","url":null,"abstract":"We propose ExpressEther as Ethernet-based virtualization technology for a reconfigurable hardware platform. It groups modularized hardware resources interconnected by an Ethernet, and transports a PCI express (PCIe) packet between the grouped modules by encapsulating it into an Ethernet frame. The configuration of the group is dynamically reconfigured by an Ethernet connection, followed by standardized PCIe hot-plug event. Such reconfigurability enables sharing of a physical resource among computer entities. We demonstrate that an I/O device is shared by servers, using our developed prototype consisted of an interface card and I/O concentrator. A commercially available server and serial ATA card are used for the demonstration, without any change for an operating system, device driver, PCIe interface, and Ethernet switch. The benchmark of I/O performance shows at most 16% degradation, which is caused by the implementation matters of our prototype. No degradation is measured when data flow from an I/O to a server","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125386886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Loosely Coupled TCP Acceleration Architecture 松散耦合的TCP加速架构
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.18
Lea Shalev, Vadim Makhervaks, Zorik Machulsky, G. Biran, J. Satran, Muli Ben-Yehuda, Ilan Shimony
{"title":"Loosely Coupled TCP Acceleration Architecture","authors":"Lea Shalev, Vadim Makhervaks, Zorik Machulsky, G. Biran, J. Satran, Muli Ben-Yehuda, Ilan Shimony","doi":"10.1109/HOTI.2006.18","DOIUrl":"https://doi.org/10.1109/HOTI.2006.18","url":null,"abstract":"We present a novel approach for scalable network acceleration. The architecture uses limited hardware support and preserves protocol processing flexibility, combining the benefits of TCP offload and onload. The architecture is based on decoupling the data movement functions, accelerated by a hardware engine, from complex protocol processing, controlled by an isolated software entity running on a central CPU. These operate in parallel and interact asynchronously. We describe a prototype implementation which achieves multi-gigabit throughput with extremely low CPU utilization","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127941236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A New Dynamic Bandwidth Re-Allocation Technique in Optically Interconnected High-Performance Computing Systems 一种新的光互联高性能计算系统动态带宽重分配技术
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.6
Avinash Karanth Kodi, A. Louri
{"title":"A New Dynamic Bandwidth Re-Allocation Technique in Optically Interconnected High-Performance Computing Systems","authors":"Avinash Karanth Kodi, A. Louri","doi":"10.1109/HOTI.2006.6","DOIUrl":"https://doi.org/10.1109/HOTI.2006.6","url":null,"abstract":"As bit rates increase, optical interconnects based high-performance computing (HPC) systems improve performance by increasing the available bandwidth (using wavelength-division multiplexing (WDM) and space-division multiplexing (SDM)) and decreasing power dissipation as compared to traditional electrical interconnects. While static allocation of wavelengths (channels) in optical interconnects provide every node with equal opportunity for communication, it can lead to network congestion for non-uniform traffic patterns. In this paper, we propose an opto-electronic interconnect for designing a flexible, high-bandwidth, low-latency, dynamically reconfigurable architecture for scalable HPC systems. Reconfigurability is realized by monitoring traffic intensities, and implementing dynamic bandwidth re-allocation (DBR) technique that adapts to changes in communication patterns. We propose a DBR technique - lock-step (LS) that balances the load on each communication channel based on past utilization. Simulation results indicate that the reconfigured architecture shows 40% increased throughput and 20% reduced network latency as compared to HPC electrical networks","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122214484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Case Study in I/O Disaggregation using PCI Express Advanced Switching Interconnect (ASI) 基于PCI Express高级交换互连(ASI)的I/O分解案例研究
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.5
V. Krishnan, Todd Comins, R. Stalzer, David Wong
{"title":"A Case Study in I/O Disaggregation using PCI Express Advanced Switching Interconnect (ASI)","authors":"V. Krishnan, Todd Comins, R. Stalzer, David Wong","doi":"10.1109/HOTI.2006.5","DOIUrl":"https://doi.org/10.1109/HOTI.2006.5","url":null,"abstract":"Decoupling the processor and I/O subsystem provides immense benefits that include high availability, efficient allocation and cost-effective upgrade of system resources. Such a disaggregation model calls for a high-performance interconnect to isolate the processor and I/O subsystem domains, yet provide the veneer of a single system. PCI express (PCIe) is one such interconnect and is becoming the de-facto I/O fabric. However, PCIe, as specified currently, provides limited support for I/O disaggregation and does not yet natively support dynamic sharing of I/O resources amongst processor subsystems - this is the next major step in I/O disaggregation. PCI express advanced switching interconnect (ASI) is well-suited for enhancing the capabilities of PCIe in a non-disruptive manner. ASI is built upon PCIe and has the innate ability to co-exist with PCIe devices due to its commonality of the link/physical layer with PCIe as well as its native support for encapsulating PCIe packets. Towards a simple yet illustrative demonstration of ASI-based disaggregation of PCIe devices, we employed StarGen's ASI products for creating a basic ASI fabric and disaggregated a PCIe based GigE NIC from a host system. The initial set of results showed a marginal effect on the application's latency, but contrary to expectations, the throughput was significantly impacted. Further analysis revealed that this unexpected drop in throughput could be rectified easily and indeed, the final results confirm that the use of ASI for supporting I/O disaggregation does not result in sub-optimal utilization of the GigE NIC","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128136075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Scheduling Traffic Matrices On General Switch Fabrics 在一般交换结构上调度流量矩阵
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.22
Xiang Wu, A. Prakash, M. Mohiyuddin, A. Aziz
{"title":"Scheduling Traffic Matrices On General Switch Fabrics","authors":"Xiang Wu, A. Prakash, M. Mohiyuddin, A. Aziz","doi":"10.1109/HOTI.2006.22","DOIUrl":"https://doi.org/10.1109/HOTI.2006.22","url":null,"abstract":"A traffic matrix is an |S| times |T| matrix M, where Mij is a non-negative integer encoding the number of packets to be transferred from source i to sink j. Chang et al. (2001) have shown how to efficiently compute an optimum schedule for transferring packets from sources to sinks when the sources and sinks are connected via a rearrangeable fabric such as crossbar. We address the same problem when the switch fabric is not rearrangeable. Specifically, we (1) prove that the optimum scheduling problem is NP-hard for general switch fabrics, (2) identify a sub-class of fabrics for which the problem is polynomial-time solvable, and (3) develop a heuristic for the general case","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122321790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Strictly Nonblocking Multicasting WDM Optical Cross Connects Using Multiwavelength Converters 使用多波长转换器的严格无阻塞多播WDM光交叉连接
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.24
H. Hamza, J. Deogun
{"title":"Strictly Nonblocking Multicasting WDM Optical Cross Connects Using Multiwavelength Converters","authors":"H. Hamza, J. Deogun","doi":"10.1109/HOTI.2006.24","DOIUrl":"https://doi.org/10.1109/HOTI.2006.24","url":null,"abstract":"In this paper, we propose new strictly nonblocking multicast capable optical cross connects (MC-OXCs) architectures that exploit the potential of multi-wavelength converters (MWCs). An MWC is capable of simultaneously replicating a signal on an input wavelength to several output wavelengths. We investigate two families of MCOXCs based on use of full- and limited-range MWCs, and present a number of architectures in each of the two families. Proposed architectures present a trade-off between switching complexity, conversion cost, and signal loss","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131322621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Reconfigurable Architecture for Multi-Gigabit Speed Content-Based Routing 多千兆速度基于内容路由的可重构架构
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.7
J. Moscola, Young-Hee Cho, J. Lockwood
{"title":"A Reconfigurable Architecture for Multi-Gigabit Speed Content-Based Routing","authors":"J. Moscola, Young-Hee Cho, J. Lockwood","doi":"10.1109/HOTI.2006.7","DOIUrl":"https://doi.org/10.1109/HOTI.2006.7","url":null,"abstract":"This paper presents a reconfigurable architecture for high-speed content-based routing. Our architecture goes beyond simple pattern matching by implementing a parsing engine that defines the semantics of patterns that are parsed within the data stream. Defining the semantics of patterns allows for more accurate processing and routing of packets using any fields that appear within the payload of the packet. The architecture consists of several components, including a pattern matcher, a parsing structure, and a routing module. Both the pattern matcher and parsing structure are automatically generated using an application-specific compiler that is described in this paper. The compiler accepts a grammar specification as input and outputs a data parser in VHDL. The routing module receives control signals from both the pattern matcher and the parsing structure that aid in the routing of packets. We illustrate how a content-based router can be implemented with our technique using an XML parser as an example. The XML parser presented was designed, implemented, and tested in a Xilinx Virtex XCV2000E FPGA on the FPX platform. It is capable of processing 32-bits of data per clock cycle and runs at 100 MHz. This allows the system to process and route XML messages at 3.2 Gbps","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131119257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A Single-Cycle Multi-Match Packet Classification Engine Using TCAMs 基于tcam的单周期多匹配包分类引擎
14th IEEE Symposium on High-Performance Interconnects (HOTI'06) Pub Date : 2006-08-23 DOI: 10.1109/HOTI.2006.8
M. Nourani, M. Faezipour
{"title":"A Single-Cycle Multi-Match Packet Classification Engine Using TCAMs","authors":"M. Nourani, M. Faezipour","doi":"10.1109/HOTI.2006.8","DOIUrl":"https://doi.org/10.1109/HOTI.2006.8","url":null,"abstract":"Most conventional packet classifiers find the highest priority filter that matches the packet. However, new networking applications such as network intrusion detection systems and load balancers require all (or the first few) matching results in packet classification. An efficient TCAM-based architecture for multi-match search is introduced in this paper. We propose a novel partitioning scheme based on filters and their intersection properties. An efficient contention resolver unit is designed to enhance performance of the search by choosing only one partition. Our approach finds all matches in exactly one conventional TCAM cycle while reducing the power consumption by at least two orders of magnitude","PeriodicalId":288349,"journal":{"name":"14th IEEE Symposium on High-Performance Interconnects (HOTI'06)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124190861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信