{"title":"Micronmesh for fault-tolerant GALS Multiprocessors on FPGA","authors":"H. Kariniemi, J. Nurmi","doi":"10.1109/ISSOC.2008.4694870","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694870","url":null,"abstract":"System-on-Chip (SoC) circuits have evolved to single chip Multiprocessor systems. Due to increasing variance of process parameters, which produces synchronization problems on large SoCs, a globally-asynchronous locally-synchronous (GALS) design style must have been mobilized. In addition, the large VLSI circuits are also becoming more susceptible to transient and intermittent faults which can corrupt their operation. This paper presents a new micronmesh network-on-chip (NoC) which is targeted to fault-tolerant communication of GALS Multiprocessor SoCs (MPSoC). It is fully synthesizable with current design tools and it can be used for prototyping MPSoCs on FPGA circuits. The Micronmesh incorporates a new improved fault-diagnosis-and-repair (FDAR) system which is able to diagnose and repair also buffer memories in addition to wire connections while fault-tolerant DOR (FTDOR) routing is used for routing packets to their destinations around defected parts. Owing to the FDAR system and the FTDOR Micronmesh degrades gracefully as permanent faults appear and it is able to recover from transient and intermittent faults. The fault-tolerance of the Micronmesh is also improved by switch-to-switch (S2S) level retransmissions which reduce the number of end-to-end (E2E) level retransmissions that produce considerably higher latencies. These methods targeted at improving the fault-tolerance are also becoming necessary for improving the manufacturability of the circuits in the future.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"779 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122626449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
V. Guzma, S. Bhattacharyya, Pertti Kellomäki, J. Takala
{"title":"Trade-offs in mapping high-level dataflow graphs onto ASIPs","authors":"V. Guzma, S. Bhattacharyya, Pertti Kellomäki, J. Takala","doi":"10.1109/ISSOC.2008.4694876","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694876","url":null,"abstract":"Data-flow based design environments bring advantages of specification, validation and synthesis to embedded systems design by decoupling computation from transfer of data. The former is performed by actors, and data transfer between actors and an execution order of actors is determined by scheduling and buffering strategies. In this work, we examine code sizes and cycle counts resulting from combinations of scheduling and buffering techniques. The experiments were carried out by designing an application specific instruction-set processor streamlined for each of the benchmarks, using a codesign environment called TCE. We also show what additional overhead is introduced when an architecture implemented using our approach is employed for an application outside its targeted domain.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"918 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116626304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Heikki Orsila, E. Salminen, Marko Hännikäinen, T. Hämäläinen
{"title":"Evaluation of heterogeneous multiprocessor architectures by energy and performance optimization","authors":"Heikki Orsila, E. Salminen, Marko Hännikäinen, T. Hämäläinen","doi":"10.1109/ISSOC.2008.4694884","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694884","url":null,"abstract":"Design space exploration aims to find an energy-efficient architecture with high performance. A trade-off is needed between these goals, and the optimization effort should also be minimized. In this paper, we evaluate heterogeneous multiprocessor architectures by optimizing both energy and performance for applications. Ten random task graphs are optimized for each architecture, and evaluated with simulations. The energy versus performance trade-off is analyzed by looking at Pareto optimal solutions. It is assumed that there is a variety of processing elements whose number, frequency and microarchitecture can be modified for exploration purposes. It is found that both energy-efficient and well performing solutions exist, and in general, performance is traded for energy-efficiency. Results indicate that automated exploration tools are needed when the complexity of the mapping problem grows, starting already with our experiment setup: 6 types of PEs to select from, and the system consists of 2 to 5 PEs. Our results indicate that our Simulated Annealing method can be used for energy optimization with heterogeneous architectures, in addition to performance optimization with homogeneous architectures.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128703723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"PER performance enhancement through antenna and transceiver co-design for multi-band OFDM UWB communication","authors":"Peng Wang, H. Tenhunen, Dian Zhou, Lirong Zheng","doi":"10.1109/ISSOC.2008.4694875","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694875","url":null,"abstract":"This paper investigates the packet error rate (PER) performance enhancement through the antenna and the transceiver co-design for MB-OFDM UWB system. Five different UWB antennas, covering the whole UWB spectrum, are selected for study. Through the link-margin analysis and PER performance simulation, radio transceiver design specifications are optimized according to different antennaspsila performance at different band groups. Transmitter pre-distortion and receiver equalization are applied at the front-end of the transceiver to co-design with the antenna, which further enhance PER performance. Our study reveals that, antenna is an important part of radio transceiver which has to be considered in advance during the chip design, particularly in the ultra-wide band radio system. Through the antenna and radio front-end co-design, not only could PER performance be enhanced, but also could design parameters be relaxed for the power amplifier (PA) and the low noise amplifier (LNA), particularly for higher band groups.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124202441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
H. Kooti, M. Mirza-Aghatabar, S. Hessabi, Arash Tavakkol
{"title":"Energy analysis of re-injection based deadlock recovery routing algorithms","authors":"H. Kooti, M. Mirza-Aghatabar, S. Hessabi, Arash Tavakkol","doi":"10.1109/ISSOC.2008.4694864","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694864","url":null,"abstract":"There are two strategies for deadlock handling in routing algorithms in NoC: deadlock avoidance and deadlock recovery. Some deadlock recovery routing algorithms are re-injection based, such as: Compressionless (CR), Software-Based (SW_TFAR) and AFBAR. In spite of the performance comparison, none of existing researches have focused on the energy consumption of various routing algorithms. We evaluate these routing algorithms according to their energy consumption and latency. Our experimental results show the better performance and worse energy consumption of deadlock recovery routing algorithms compared to deadlock avoidance routing algorithms. In addition, the best and worst energy consumption is dedicated to AFBAR and CR, respectively.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121565661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A state based framework for efficient system-level power estimation of of costum reconfigurable cores","authors":"A. Ahmadinia, Balal Ahmad, T. Arslan","doi":"10.1109/ISSOC.2008.4694889","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694889","url":null,"abstract":"This paper presents a new system level power estimation methodology based on transaction level modeling for costum reconfigurable cores. The methodology can lead to significant improvement in trade-off between accuracy and efficiency of power estimation at system level. A SystemC based simulation environment is presented that allows rapid introduction of a power model into the executable specification of a sophisticated reconfigurable hardware design. The proposed environment allows efficient power estimation of custom reconfigurable cores through state based power modeling, leading to a viable solution for early power aware design. The simulator has been applied to SystemC module of a custom reconfigurable core for Viterbi decoding. Power figures have been compared with the results obtained by state of the art industrial tools.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115039935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analyzing models of computation for software defined radio applications","authors":"H. Berg, C. Brunelli, Ulf Lücking","doi":"10.1109/ISSOC.2008.4694886","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694886","url":null,"abstract":"Applying design principles and methodologies constituted in the software domain and being adapted to the complete execution environment provides new perspectives for future multi-radio computers. In order to share the underlying hardware resources efficiently, the overall system architecture and related programming model has to support dynamic behavior and extensive changes in the configuration during run-time. The requirements for such a multi-radio computer are demanding, as there will be various radio access stacks with inhomogeneous characteristics executing in parallel. This implies a configuration and control framework, besides the different protocol stacks, that is aware of the managed system in every state and is capable of dynamically scheduling different dataflow graphs corresponding to the applications running on the underlying system. This paper presents the main concepts behind such a reactive system, focusing in particular on the proposed model of computation, giving an overview on the software architecture and related problems to be solved.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115920376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Stavros Georgiopoulos, G. Dimitroulakos, C. Goutis
{"title":"Integrating high speed multipliers in Coarse Grain Reconfigurable Arrays","authors":"Stavros Georgiopoulos, G. Dimitroulakos, C. Goutis","doi":"10.1109/ISSOC.2008.4694885","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694885","url":null,"abstract":"The efficiency of a Coarse Grained Reconfigurable Array architecture in terms of performance and hardware cost is hard to be determined. Until now, few case studies have been published to determine the impact of the architecture parameters on the Instructions per Cycle and the architecture area. However, none of those have considered the impact of multipliers embedded in the Processing Elements of Coarse Grain Reconfigurable Array architectures. This paper focuses on multipliers both from the compiler and the architecture perspective. An already existing exploration framework has been used for our study. It consists of two parts: a) an existing retargetable compiler from which the mapping efficiency is estimated and b) from the parametric realization of the coarse grained reconfigurable array in hardware description language (VHDL). The latter is used as input in the Synopsys Design Compiler for the estimation of the area and clock frequency of each architecture instance. The system has been realized using the 0.13 mum process of ASIC technology. The experiments report the system area, clock frequency and performance for different embedded multipliers.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130031569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Congestion-aware task mapping in heterogeneous MPSoCs","authors":"Ewerson Carvalho, F. Moraes","doi":"10.1109/ISSOC.2008.4694878","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694878","url":null,"abstract":"Multiprocessors systems-on-chip (MPSoCs) are a trend in VLSI design, since they minimize the design crisis represented by the gap between the silicon technology and the actual SoC design capacity. MPSoCs may employ NoCs to integrate several programmable processors, specialized memories, and other specific IPs in a scalable way. Besides communication infrastructure, another important issue in MPSoCs is task mapping. Dynamic task mapping is needed, since the number of tasks running in the MPSoC may exceed the available resources. Most works in literature present static mapping solutions, not appropriate for this scenario. This paper investigates the performance of mapping algorithms in NoC-based heterogeneous MPSoCs, targeting NoC congestion minimization. The use of the proposed congestion-aware heuristics reduces the NoC channel load, congestion, and packet latency.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130246424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Taeyoon Kim, Wonki Park, Hee-Sun Ahn, Kyongwon Min, Sangyong Lee, Jongchan Choi, Chulwoo Kim, Kynnyun Kim, Sungchul Lee
{"title":"A 110 dB, 3-mW fourth-order Σ-Δ modulator for atmospheric pressure sensor","authors":"Taeyoon Kim, Wonki Park, Hee-Sun Ahn, Kyongwon Min, Sangyong Lee, Jongchan Choi, Chulwoo Kim, Kynnyun Kim, Sungchul Lee","doi":"10.1109/ISSOC.2008.4694888","DOIUrl":"https://doi.org/10.1109/ISSOC.2008.4694888","url":null,"abstract":"In this paper, a 110 dB, 1.024 MHz fourth-order single-loop sigma-delta modulator has been presented with an over-sampling ratio of 128 and an overload factor of -6 dB for a bandwidth of 4 kHz. In particular, this Sigma-Delta modulator is well suited for atmospheric pressure sensor. The whole modulator consumes only 3-mW from a single 3.3 V supply in a 0.35-mum CMOS technology and chip size is 1.68 mm2.","PeriodicalId":168022,"journal":{"name":"2008 International Symposium on System-on-Chip","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133190868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}