{"title":"Empirical and theoretical lower bounds on energy consumption for networks on chip","authors":"George B. P. Bezerra, D. Arnold, S. Forrest","doi":"10.1145/2536522.2536535","DOIUrl":"https://doi.org/10.1145/2536522.2536535","url":null,"abstract":"This paper focuses on the network on chip of multi-core systems and proposes empirical and theoretical lower bounds on the energy consumption of applications. The empirical method consists of an linear programming model that simultaneously reduces communication distances and network traffic. When applied to standard benchmarks, our method shows that locality exploitation can lead to 50% energy reduction on average compared to no optimization. The theoretical lower bound is based on the Rent's rule model from VLSI design, and is obtained analytically from the communication graph structure of applications. The theoretical results show excellent agreement with the empirical lower bound.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125310342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Towards optimal adaptive routing in 3D NoC with limited vertical bandwidth","authors":"Gunhee Lee, Jinho Lee, Kiyoung Choi","doi":"10.1145/2536522.2536534","DOIUrl":"https://doi.org/10.1145/2536522.2536534","url":null,"abstract":"3D NoC is one of the most promising technologies that can overcome the performance barrier of traditional multi-core system. While TSV is the most popular method to realize vertical link for 3D IC, it has many fabrication challenges and the number of TSVs can be limited in practice, thereby limiting vertical bandwidth. Some adaptive routing algorithms have been studied to overcome the bandwidth mismatch. However, there has been no previous study on how much improvement can be achieved by such new algorithms. This paper explores 3D NoC's performance upper-bound beyond which we believe no practically realizable algorithm can reach. We also propose a practically realizable algorithm that achieves performance close to the upper-bound.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115030815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Millimeter (mm)-wave wireless NoC as interconnection backbone for multicore chips: promises and challenges","authors":"P. Pande","doi":"10.1145/2536522.2536524","DOIUrl":"https://doi.org/10.1145/2536522.2536524","url":null,"abstract":"The continuing progress and integration levels in silicon technologies make complete end-user systems on a single chip possible. This massive level of integration makes modern multi-core chips all pervasive in domains ranging from weather forecasting, astronomical data analysis, and biological applications to consumer electronics and smart phones. NoCs have emerged as communication backbones to enable a high degree of integration in multi-core SoCs. Despite their advantages, an important performance limitation in traditional NoCs arises from planar metal interconnect-based multi-hop communications, wherein the data transfer between far-apart blocks causes high latency and power consumption. The latency, power consumption, and interconnect routing problems of NoCs can be simultaneously addressed by replacing multi-hop wired paths with high-bandwidth single-hop long-range wireless links. Recent investigations have established that the silicon integrated on-chip antenna operating in the millimeter wave range of a few tens to one hundred GHz is now a viable technology. Coupled with significant advances in mm-wave transceiver design, this opens up new opportunities for detailed investigations into wireless NoCs (WiNoCs). This talk will present methodologies and challenges for designing WiNoC architectures.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127653793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Exploiting emerging technologies for nanoscale photonic networks-on-chip","authors":"Jun Pang, C. Dwyer, A. Lebeck","doi":"10.1145/2536522.2536525","DOIUrl":"https://doi.org/10.1145/2536522.2536525","url":null,"abstract":"In this paper, we explore the use of emerging molecular scale devices to construct nanophotonic networks --- called Molecular-scale Network-on-Chip (mNoC). We leverage quantum dot LEDs, which provide electrical to optical signal modulation, and chromophores, which provide optical signal filtering for receivers. These devices replace the ring resonators and the external laser source used in contemporary nanophotonic NoCs. We present different crossbar structures such as Single Writer Multiple Reader (SWMR) etc. We also discuss implications of the new mNoC crossbar on overall system design. An mNoC SWMR crossbar can scale up to radix 256 and our preliminary evaluation shows that it reduces over 50% average packet latency and 40% power consumption compared with ring-based alternative.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"813 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116420019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design space exploration for streaming applications on multiprocessors with guaranteed service NoC","authors":"U. Mirza, F. Gruian, K. Kuchcinski","doi":"10.1145/2536522.2536527","DOIUrl":"https://doi.org/10.1145/2536522.2536527","url":null,"abstract":"This paper addresses design space exploration for streaming applications (such as MPEG) running on multi-processor platforms with guaranteed service interconnects. In particular, we solve mapping, path selection and router configuration problems. Given the complexity of these problems, state of the art approaches in this area largely rely on greedy heuristics, which do not guarantee optimality. Our approach is based on a constraint programming formulation that combines a number of steps, sequential in classical approaches. Thus, our method has the potential of finding optimal solutions with respect to resource usage under processing and bandwidth constraints. The experimental evaluation shows that our approach is capable of exploring a range of solutions while giving the designer the opportunity to emphasize the importance of various design metrics.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"15 10","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113964899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"LEF: long edge first routing for two-dimensional mesh network on chip","authors":"Ryosuke Sasakawa, Kenji Kise","doi":"10.1145/2536522.2536530","DOIUrl":"https://doi.org/10.1145/2536522.2536530","url":null,"abstract":"In 2D mesh Network-on-Chip, if dimensions have distinct lengths, there is the difference between the performance of DORs (XY-routing and YX-routing). Furthermore, when multiple parallel applications are mapped in a 2D mesh manycore processor at a time, each parallel application are not necessarily mapped as a square mesh (like n X n mesh). In this paper, we present the routing algorithm which selects a routing algorithm performing well from both of DORs even if applications were not mapped as a square mesh.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114358795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Costs and benefits of flexibility in spatial division circuit switched networks-on-chip","authors":"Ahsen Ejaz, A. Jantsch","doi":"10.1145/2536522.2536526","DOIUrl":"https://doi.org/10.1145/2536522.2536526","url":null,"abstract":"Although most Network-on-Chip (NoC) designs are based on Packet Switching (PS), the importance of Circuit Switching (CS) should not be underestimated. Many MPSoC executing real-time applications require an underlying communication backbone that can relay messages from one node to another with guaranteed throughput. Compared to PS, CS can provide guaranteed throughput with lower area and power overheads. It is also highly suited for applications where nodes transfer long messages. Spatial Division Multiplexing (SDM) can allow more efficient use of available network resources by dividing them among multiple simultaneous transactions. The network developed by Vali [1] has three design variations based on the number of sub-channels, has a predictable connection setup time, and uses CS to provide guaranteed throughput once a connection is established. In this paper we use this network as a basis to study the effect of flexibility based on SDM, on the performance of a CS networks. A network evaluation platform has been developed to configure and evaluate networks with a maximum of 8 sub-networks, with each subnetwork comprising of 1, 2 or 4 sub-channels. We show that under uniform traffic pattern with requests of uniform random bandwidth (BW) requirement, a less flexible network outperforms a network with higher flexibility due to a phenomenon we call 'stray requests'. We conclude this paper by showing that under high network traffic, performance of our flexible networks can be as much as 113% better than HAGAR [2] and Liu's [3] network.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129401129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
V. Catania, Andrea Mineo, Salvatore Monteleone, Davide Patti
{"title":"A first effort for a distributed segment-based approach on self-assembled nano networks","authors":"V. Catania, Andrea Mineo, Salvatore Monteleone, Davide Patti","doi":"10.1145/2536522.2536532","DOIUrl":"https://doi.org/10.1145/2536522.2536532","url":null,"abstract":"In this paper we present DiSR, a first effort for a distributed segment-based approach to routing and defect mapping in a nano-scale, topology agnostic scenario based on DNA self-assembly. The main aim is exploiting the already-proven qualities of segment-based routing without neither requiring a topology graph as input, nor needing a centralized algorithm to configure network paths. After introducing the conceptual elements and the execution model of DiSR, we show how the opensource Nanoxim platform has been used to evaluate the proposed approach in the process of discovering irregular network topology while establishing network segments. Results show how DiSR still preserves some important properties (coverage, defect tolerance, scalability) while avoiding centralized tree-based broadcasting and resource hungry solutions such as virtual channels and hardware redundancy. Finally, we analyzed a first, not yet optimised gate-level hardware implementation of the required control logic and storage for DiSR, demonstrating a relatively acceptable impact ranging from 10 to about 20% of the budget of transistors available for each node.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"28 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120879935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On multicast for dynamic and irregular on-chip networks using dynamic programming method","authors":"W. Zong, Xiaohang Wang, T. Mak","doi":"10.1145/2536522.2536529","DOIUrl":"https://doi.org/10.1145/2536522.2536529","url":null,"abstract":"Mutlicast is an intrinsic communication pattern in emerging applications including the Internet service, bio-inspired computing, online data analysis, etc. Providing hardware multicast largely boosts system performance and reduce power consumption for these applications running on many-core systems. However, many-core systems suffer from dynamically changing topologies, which can be caused by traffic isolation, power management and faults. Links and routers may be removed from a subnetwork or added to a subnetwork, this imposes the routers to alter the routing paths accordingly to make the routing energy efficient. Furthermore an energy efficient multicast scheme that fits to any topology is required in this scenario. Most existing fault tolerant routings cannot detect and found energy efficient communication paths effectively for both unicast and multicast. In this work, a lightweight network couples with on-chip routers is used to propagates topological information. The shortest path between any node pair is also computed in this network using dynamic programming method. Unicast packets are routed along a shortest path to its destination, and multicast packet copies at each router is minimized based a rule called minimal replication to reduce multicast link occupation and energy consumption. Simulation results show that the proposed NoC routes 96% packets to their destinations if there exists a path, and demonstrate 40% lower latency and 30% power consumption compared with Stochastic communication. The extra hardware cost to build optimal multicast path is estimated to occupy less than 5% of the total area of a router.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128440593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On heterogeneous network-on-chip design based on constraint programming","authors":"A. Demiriz, N. Bagherzadeh","doi":"10.1145/2536522.2536528","DOIUrl":"https://doi.org/10.1145/2536522.2536528","url":null,"abstract":"Core mapping and application scheduling problems coupled with routing schemes are essential design considerations for an efficient Network-on-Chip (NoC) design. This paper discusses heterogeneous NoC design from a Constraint Programming (CP) perspective using a two-stage solution. Given a Communication Task Graph (CTG) and subsequent task assignments for cores, cores are allocated to the best possible places on the chip in the first stage in order to minimize the overall communication cost among cores. We then solve the application scheduling problem in the second stage to determine the optimum core types from a list of technological alternatives and to minimize the makespan i.e. time to complete all tasks. As a design extension, surface area constraint can be introduced to the underlying problem. The paper reports results based on real benchmark datasets from the literature.","PeriodicalId":344147,"journal":{"name":"Network on Chip Architectures","volume":"251 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131555179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}