{"title":"A New LMMSE Receiver Architecture With Dynamic Filter Length Optimisation","authors":"M. Tennant, A. Erdogan, T. Arslan, J. Thompson","doi":"10.1109/ISSOC.2007.4427439","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427439","url":null,"abstract":"This paper presents a novel architecture for tap-length optimisation of the linear LMS adaptive filter within an LMMSE receiver architecture. No investigation has previously been carried out to determine the suitability of this concept or the power saving that can be achieved. A low-complexity length update algorithm is employed to dynamically adjust and optimise the number of taps in the adaptive filter present within the LMMSE receiver according to channel conditions. The results show that the chosen algorithm presents minimal overhead and reduces power consumed due to optimisation of the filter length. This paper presents the first architectural VLSI implementation of the LMMSE receiver using the length optimised adaptive filter and includes a performance study in terms of area and power.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"302 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115651398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Rendezvous-based MoC for untimed TLM","authors":"R. Jindal, L. Maillet-Contoz","doi":"10.1109/ISSOC.2007.4427449","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427449","url":null,"abstract":"Transaction level modeling (TLM) is increasingly been touted as the next abstraction level for SoC design after the register-transfer level. The benefits of this new approach are many and find its applications in verification, software validation and architectural exploration. As the TLM systems are modeled as a set of parallel processes, issues of communication, synchronization, and timing are essential to understand, analyze and verify for overall system behavior -choosing an appropriate Model of Computation (MoC) for this purpose becomes important.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114222698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"3-Gb/s, Single-ended Adaptive Equalization of Bidirectional Data over a Multi-drop Bus","authors":"H. Fredriksson, C. Svensson","doi":"10.1109/ISSOC.2007.4427445","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427445","url":null,"abstract":"This paper presents a design for single-ended adaptive equalization. The design enables mitigation of inter-symbol interference in communication systems where it is desirable to utilize signal processing resources on only one side of a communication channel. Utilizing the reciprocity principle we show that this idea is suitable for both point-to-point and point-to-multi-point links. Simulation results show that the presented design can mitigate ISI generated by a four drop memory bus at 3 Gb/s.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"8 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123541823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Control and datapath decoupling in the design of a NoC switch: area, power and performance implications","authors":"S. Medardoni, D. Bertozzi, L. Benini, E. Macii","doi":"10.1109/ISSOC.2007.4427438","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427438","url":null,"abstract":"Networks on chip are emerging as a disruptive technology to tackle the problem of scalable on-chip communication. An intensive research effort is being devoted to customizing generic network building blocks for specific design objectives such as low-latency or low-power. In this work, we identify in control and datapath decoupling inside a switch architecture an effective means of achieving the needed flexibility, while taking into account the switching, buffering and flow control implications of each design point. We deploy a 65 nm low-power technology library to explore the performance-power trade-off in the design of a NoC switch with area awareness, while leveraging placement-aware logic synthesis tools to deal with the predictability challenges posed by nanoscale designs.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124790741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Configurable Hardware/Software Support for Single Processor Real-Time Kernels","authors":"S. Nordstrom, L. Asplund","doi":"10.1109/ISSOC.2007.4427426","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427426","url":null,"abstract":"Today's software based real-time operating systems (RTOS) have the ability to be configured in order to decrease memory footprint, important when the RTOS is used in resource restricted embedded environments. When the kernel is implemented in hardware, not only memory footprint is motivation for configuration for minimization, the number of logic cells occupied in the PPGA has to be considered as well. We have modified existing hardware support, the real-time unit (RTU) for increased configurability. The modified RTU was compared regarding configuration and footprint with a corresponding system based on the commercial software RTOS MicroC/OS-ll. Four different configuration settings were compared and the results show that the RTU memory footprint was 24% to 38% of the size of the MicroC/OS-II footprint. In FPGA area, the smallest configuration of the RTU occupied only 50% of the logic cells used by the largest configuration. Configurability of an RTOS with hardware support, like the RTU, decreases project cost since it is more likely to be able to use a low cost FPGA when adjusting implemented functionality to only include what is required by the application.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116343369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Managing Reconfigurable Resources in Heterogeneous Cores Using Portable Pre-Synthesized Templates","authors":"M. Santambrogio, M. Giani, S. Memik","doi":"10.1109/ISSOC.2007.4427446","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427446","url":null,"abstract":"In this paper we consider multi-FPGAs, reconfiguration and system description portability as the processes of specifying and modeling a complete system before it is partitioned and committed to a style/flow of implementation. In the case of a high performance computing cluster employing FPGAs, the reconfigurable elements need to be dynamically re-allocated and reconfigured based on the prevailing workload at a given instance. We particularly target fast configuration and task migration in high performance computing systems, such as server farms. This paper presents a design flow, based in resource sharing across configurations on the reconfigurable devices, which can manage the resource allocation and reconfiguration quickly when resources need to be migrated between different applications. The resources shared across configuration are defined using an isomorphic-driven partitioning approach. This technique detects recurrent structures and produces a partitioned specification in which the identified clusters are instances of repeating templates in the original graph used to describe the input application. Two algorithms for regularity extraction were implemented. The performances of the algorithms were compared with regard to the size and number of regular structures identified by the two approaches, as well as the lime taken to perform their task.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"128 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130801952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Bulk Built In Current Sensor Approach for Single Event Transient Detection","authors":"G. Wirth, C. Fayomi","doi":"10.1109/ISSOC.2007.4427422","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427422","url":null,"abstract":"Radiation effects, particularly single event transients (SETs), are increasingly affecting the reliability of integrated circuits as device dimensions are scaling down. This paper presents the use of bulk built in current sensors (Bulk-BICS) for SET detection. The efficiency and applicability of the bulk-BICS approach for Single Event Transient detection is demonstrated through device and circuit level simulations.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121078556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"MiGra: A Task Migration Algorithm for Reducing Temperature Gradient in Multiprocessor Systems on Chip","authors":"S. Carta, Fabio Mereu, A. Acquaviva, G. Micheli","doi":"10.1109/ISSOC.2007.4427441","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427441","url":null,"abstract":"Increasing operating temperature of Multiprocessor Systems on Chip coupled with uneven distribution of power dissipation may lead to large spatial and temporal temperature gradient that strongly impact system reliability. In this paper we propose a novel technique for reducing on-chip temperature gradient based on task migration between processing cores. The proposed algorithms exploit run-time temperature information to balance the chip temperature without impacting energy consumption. We demonstrate the effectiveness of the proposed algorithms by comparing the temperature variance obtained using the proposed algorithm with simple balancing strategies considering only load and energy as metrics. Migration costs are also considered as a metric for comparison.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130169857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Heikki Orsila, E. Salminen, Marko Hännikäinen, T. Hämäläinen
{"title":"Optimal Subset Mapping And Convergence Evaluation of Mapping Algorithms for Distributing Task Graphs on Multiprocessor SoC","authors":"Heikki Orsila, E. Salminen, Marko Hännikäinen, T. Hämäläinen","doi":"10.1109/ISSOC.2007.4427433","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427433","url":null,"abstract":"Mapping an application on multiprocessor system-on-chip (MPSoC) is a crucial step in architecture exploration. The problem is to minimize optimization effort and application execution time. Applications are modeled as static acyclic task graphs which are mapped to an MPSoC. The analysis is based on extensive simulations with 300 node graphs from the standard graph set. We present a new algorithm, optimal subset mapping (OSM), that rapidly evaluates task distribution mapping space, and then compare it to simulated annealing (SA) and group migration (GM) algorithms. OSM was developed to make architecture exploration faster. Efficiency of OSM is 5.0 times and 2.4 times than that of GM and SA, respectively, when efficiency is measured as the application speedup divided by the number of iterations needed for optimization. This saves 81% and 62% in wall clock optimization time, respectively. However, this is a tradeoff because OSM reaches 96% and 89% application speedup compared to GM and SA. Results show that OSM and GM have opposite convergence behavior and SA comes between these two.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131475803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Is Your Low Power Design Switched On?","authors":"M. Croft, Stephen Bailey","doi":"10.1109/ISSOC.2007.4427451","DOIUrl":"https://doi.org/10.1109/ISSOC.2007.4427451","url":null,"abstract":"Power is the key driver in most electronic systems design. Managing system power to increase battery life and reduce electricity consumption and heat generation are key considerations in developing many types of systems from mobile handsets, portable navigation systems and compute servers. New design techniques including clock gating, power gating, multi-voltage and dynamic voltage and frequency scaling have been employed to reduce both dynamic and static energy consumption. The design of low power electronic systems creates new design and verification challenges. Today's systems-on-chip are composed of large IP blocks approaching and exceeding 1 million gates. How can the system low power design intent be applied without requiring modification to and time consuming re-verification of the IP's golden RTL code? How can the functional impacts of power gating, retention and isolation be verified early in the design process when it is cheaper and easier to fix problems? Can implementation-dependent low power design functionality such as accurate retention save and restore protocols be verified prior to synthesis? In this presentation, we will discuss Accellcra's Unified Power Format standard for low power design intent and using UPF for verification.","PeriodicalId":244119,"journal":{"name":"2007 International Symposium on System-on-Chip","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124161251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}