M. K. Hung, Yaoyao Ye, Xiaowen Wu, Wei Zhang, Weichen Liu, Jiang Xu
{"title":"A Hierarchical Hybrid Optical-Electronic Network-on-Chip","authors":"M. K. Hung, Yaoyao Ye, Xiaowen Wu, Wei Zhang, Weichen Liu, Jiang Xu","doi":"10.1109/ISVLSI.2010.17","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.17","url":null,"abstract":"Network-on-chip (NoC) can improve the performance, power efficiency, and scalability of multiprocessor system-on-chip (MPSoC). However, traditional NoCs using metallic interconnects consume significant amount of power to deliver even higher communication bandwidth required in the near future. Optical NoCs are based on CMOS-compatible optical waveguides and micro resonators, and promise significant bandwidth and power advantages. In this paper, we propose a hybrid optical mesh NoC, HOME, which utilizes optical waveguides as well as metallic interconnects in a hierarchical manner. HOME employs a new set of protocols to improve the network throughput and latency. We compared HOME with a matched optical mesh NoC for a 64-core MPSoC in 45nm, using SPICE simulations and our cycle-accurate multi-objective NoC simulation platform, MoLab. Comparing with the optical mesh NoC, HOME uses 75% less optical/electronic interfaces and laser diodes. Simulation results show that HOME achieves 17% higher throughput and 40% less latency while consuming 42% less power.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"219 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115243994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"BBVC-3D-NoC: An Efficient 3D NoC Architecture Using Bidirectional Bisynchronous Vertical Channels","authors":"A. Rahmani, P. Liljeberg, J. Plosila, H. Tenhunen","doi":"10.1109/ISVLSI.2010.21","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.21","url":null,"abstract":"In this paper, a 3D NoC architecture based on Bidirectional Bisynchronous Vertical Channels (BBVC) is proposed as a solution to mitigate area footprints of vertical interconnects. BBVCs, which can be dynamically self-configured to transmit flits in either direction, enable the system to benefit from a high-speed bidirectional channel instead of a pair of unidirectional channels for inter-layer communication. By exploiting the high-speed nature of the vertical links in 3D ICs, this substitution indicates better bandwidth utilization, lower area footprint, and improved routability at each layer. Our results reveal that the proposed architecture helps to achieve up to 47% savings in TSV area footprint at the 65nm technology node.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"17 11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125766778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Defect and Variation Issues on Design Mapping of Reconfigurable Nanoscale Crossbars","authors":"B. Ghavami, A. Tajary, Mohsen Raji, H. Pedram","doi":"10.1109/ISVLSI.2010.43","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.43","url":null,"abstract":"High defect density and extreme process variation for nanoscale self-assembled crossbar-based architectures have been expected to be as fundamental design challenges. Consequently, defect and variation issues must be considered on logic mapping on nanoscale crossbars. In this paper, we investigate a greedy algorithm for the variation and defect aware logic mapping of crossbar arrays. Based on Mont-Carlo simulation, we compare the proposed technique with other logic mapping techniques such as variation unaware and exhaustive search mapping in terms of accuracy as well as runtime.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128223696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Master-Slave TMR Inspired Technique for Fault Tolerance of SRAM-Based FPGA","authors":"Farid Lahrach, A. Doumar, E. Châtelet, A. Abdaoui","doi":"10.1109/ISVLSI.2010.38","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.38","url":null,"abstract":"In order to increase reliability and availability of Static-RAM based field programmable gate arrays (SRAM-based FPGAs), several methods of tolerating defects and permanent faults have been developed and applied. These methods are not well adapted for handling high fault rates for SRAM based FPGAs. In this paper, both single and double faults affecting configurable logic blocks (CLBs) are addressed. We have developed a new fault-tolerance technique that capitalizes on the partial reconfiguration capabilities of SRAM-based FPGA. The proposed fault-tolerance method is based on triple modular redundancy (TMR) combined with master-slave technique, and exploiting partial reconfiguration to tolerate permanent faults. Simulation results on reliability improvement corroborate the efficiency of the proposed method and prove that it compares favorably to previous methods.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"46 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124415287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. Silvano, W. Fornaciari, S. Crespi-Reghizzi, G. Agosta, G. Palermo, V. Zaccaria, P. Bellasi, F. Castro, S. Corbetta, A. D. Biagio, E. Speziale, M. Tartara, D. Siorpaes, H. Hübert, B. Stabernack, J. Brandenburg, M. Palkovic, P. Raghavan, C. Ykman-Couvreur, A. Bartzas, S. Xydis, D. Soudris, T. Kempf, G. Ascheid, R. Leupers, H. Meyr, J. Ansari, P. Mähönen, B. Vanthournout
{"title":"2PARMA: Parallel Paradigms and Run-Time Management Techniques for Many-Core Architectures","authors":"C. Silvano, W. Fornaciari, S. Crespi-Reghizzi, G. Agosta, G. Palermo, V. Zaccaria, P. Bellasi, F. Castro, S. Corbetta, A. D. Biagio, E. Speziale, M. Tartara, D. Siorpaes, H. Hübert, B. Stabernack, J. Brandenburg, M. Palkovic, P. Raghavan, C. Ykman-Couvreur, A. Bartzas, S. Xydis, D. Soudris, T. Kempf, G. Ascheid, R. Leupers, H. Meyr, J. Ansari, P. Mähönen, B. Vanthournout","doi":"10.1109/ISVLSI.2010.93","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.93","url":null,"abstract":"The main goals of the 2PARMA project are: the definition of a parallel programming model combining component-based and single-instruction multiple-thread approaches, instruction set virtualisation based on portable byte-code, run-time resource management policies and mechanisms as well as design space exploration methodologies for many-core computing architectures.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134066825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
I. Mansouri, Camille Jalier, F. Clermidy, P. Benoit, L. Torres
{"title":"Implementation Analysis of a Dynamic Energy Management Approach Inspired by Game-Theory","authors":"I. Mansouri, Camille Jalier, F. Clermidy, P. Benoit, L. Torres","doi":"10.1109/ISVLSI.2010.61","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.61","url":null,"abstract":"MPSoC architectures bring flexibility and performance, but to avoid high power consumption, they still require a careful design. When taking into account variability in advanced CMOS technologies, individual optimization of each chip is necessary. In this paper, we present an approach based on Game Theory to address this issue at run-time in a distributed way, frequencies are dynamically adjusted according to changes in the system, so that power consumption is reduced while application constraints are fulfilled. Software and hardware solutions are proposed for different flexibility/performance trade-offs. Results show a latency of 5ms and an area of 0.014m² for the optimization stage in the hardware implementation. These figures are promising and allow envisaging run-time optimization on large-scale multi-cores.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130954714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Calibration Circuit for Reconfigurable Smart ADC for Biomedical Signal Processing","authors":"S. Mostafa, W. Qu, S. Islam, M. Mahfouz","doi":"10.1109/ISVLSI.2010.85","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.85","url":null,"abstract":"Reconfigurable analog-to-digital converters (ADC) have been receiving increased attention in the research community for the capability to adapt to continuously varying signal processing requirements. The reconfigurable ADC is particularly advantageous in implantable biomedical sensor signal processing systems that processes signal in different range of frequency and amplitude levels and has low power consumption constraints. In this work a calibration circuit is presented that analyses the input signal from biomedical sensors and based on the dynamic range and frequency of the signal generates a two-bit control signal to change the configuration of the ADC for optimum resolution and power consumption. The two-bit control signal reconfigures the 10-bit 40MSPS pipeline ADC to operate in four different configurations by changing resolution, sampling clock and bias current for optimized operation. The calibration circuit is designed in 90 nm process and requires only 36.9μA current with 1V power supply and occupies 0.0054mm2 chip area. The reconfigurable ADC is designed in a 0.35 μm bulk CMOS process occupying 1.9mm2 area and consumes 35.4mW power in maximum performance configuration and 7.9mW in minimum power consumption configuration.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131449763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Husemann, M. Majolo, A. Susin, V. Roesler, J. V. D. Lima
{"title":"Highly Efficient Transforms Module Solution for a H.264/SVC Encoder","authors":"R. Husemann, M. Majolo, A. Susin, V. Roesler, J. V. D. Lima","doi":"10.1109/ISVLSI.2010.87","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.87","url":null,"abstract":"The computational-intensive demands of H.264 video encoder normally imply to the use of high performance hardware solutions like dedicated multimedia DSP or programmable logic devices. These demands can be even more critical when it is necessary to implement a H.264/SVC (Scalable Video Coding) solution, an emergent encoder standard that provides the generation of flexible and adaptive multi-layer streams. The complexity of a SVC encoder increases proportionally with number of configured layers, introducing new challenges to multimedia market. In this work we propose an efficient hardware module, responsible for the transform algorithms (Hadamard and DCT) of a SVC encoder, processing up eight samples per clock. The proposed module take into account specific memory demands in order to produce an optimized solution with respect to encoder performance and complexity, aiming to reach a realizable SVC encoder.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133276745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An 8-Bit Voltage Mode Analog to Digital Converter Based on Integer Division","authors":"N. Petrellis, M. Birbas, J. Kikidis, A. Birbas","doi":"10.1109/ISVLSI.2010.27","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.27","url":null,"abstract":"An 8-bit subrange Analog to Digital Converter (ADC) designed in a CMOS TSMC90nm process consisting of a coarse 4-bit and a fine 4-bit ADC stage is presented in this paper. It occupies only 0.04mm2 of die area and dissipates less than 22mW without sacrificing speed since the sampling rate is higher than 500MS/s and the achieved SNDR is higher than 40dB. These features have been estimated by post-layout simulations. In ordinary subrange architectures a Digital to Analog Converter (DAC) generates an analog value from the digital coarse ADC output that is subtracted by the input in order to produce a residue that serves as input to the fine ADC stage. In such an approach, the linearity errors of the coarse ADC, the DAC and the subtractor are accumulated. In the present work, a voltage mode modulo-16 integer divider is used at the input of the subrange ADC and a “thermometer to binary encoder” that is driven directly by the comparator outputs of the integer divider is also employed instead of a full 4-bit coarse ADC, leading to an ultra low area and power design with less linearity error sources.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117272047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Analytical Framework with Bounded Deflection Adaptive Routing for Networks-on-Chip","authors":"P. Ghosh, Arvind Ravi, Arunabha Sen","doi":"10.1109/ISVLSI.2010.90","DOIUrl":"https://doi.org/10.1109/ISVLSI.2010.90","url":null,"abstract":"In a Multi-Processor System-on-Chip (MPSoC)-based embedded system with Network-on-chip (NoC) as the communication architecture, routing of the communication traffic among the Processing Elements (PEs) contributes significantly to the overall latency, throughput and energy consumption. Design of an efficient routing algorithm for NoC requires a thorough understanding of the role of individual components of NoC. Simulation based studies are time-consuming and do not provide adequate insight into the design parameters for performance improvement. In this paper, we provide a framework for the analytical study of the NoC components and design an adaptive routing algorithm. Based on the traffic pattern of the communication traffic among PEs, we perform analytical studies based on network calculus and probabilistic analysis. Analytical study relates the design parameters with the worst case and average case latency and buffer requirements. Knowledge obtained from the analytical study is utilized for resource allocation of NoC, which further constitutes the design philosophy of the proposed Bounded Deflection Adaptive Routing (BDAR) algorithm. Our routing algorithm is deadlock-live lock free and efficiently reacts to link congestions. Experimental results based on simulations show that our routing algorithm performs significantly better than some existing static and dynamic routing in terms of link utilization, average and maximum end-to-end latency.","PeriodicalId":187530,"journal":{"name":"2010 IEEE Computer Society Annual Symposium on VLSI","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124091066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}