Yen-Liang Chen, Chun-Yu Chen, Kai-Yuan Jheng, A. Wu
{"title":"A universal look-ahead algorithm for pipelining IIR filters","authors":"Yen-Liang Chen, Chun-Yu Chen, Kai-Yuan Jheng, A. Wu","doi":"10.1109/VDAT.2008.4542462","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542462","url":null,"abstract":"Look-ahead (LA) techniques can pipeline IIR digital filters to attain high sampling rate. In the literatures, however, only simple recursive formulas are derived for special cases. In this paper, we present a universal look-ahead (ULA) approach to find the coefficients systematically. This approach is based on that the polynomial multiplication can be expressed in a matrix form. Hence, some matrix operations, such as inversion and multiplication can be applied to calculate the coefficients directly. The proposed approach is suitable for the design of pipelined recursive digital filters with any LA scheme and arbitrary pipelining levels.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125676952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"High-throughput 12-mode CTC decoder for WiMAX standard","authors":"Cheng-Hung Lin, Chun-Yu Chen, A. Wu","doi":"10.1109/VDAT.2008.4542451","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542451","url":null,"abstract":"In this paper, the high-throughput 12-mode convolutional turbo decoder which employs the parallel-window enhanced Max-log-MAP decoding is presented for WiMAX standard. To satisfy the multiple transmissions, 10 parallel windows decoding and the basic parallel-window designs are realized to have the features of efficient reconfigurability and no memory collision. After the implementation using the TSMC 0.13 mum CMOS process, the proposed WiMAX-compliant 12-mode CTC decoder occupies silicon area of 5.21 mm2 and achieves the maximum throughput rate of 115.4 Mbps.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115891106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jia-Lung Cheng, Chao-Shiun Wang, Wei-Chang Li, Chorng-Kuang Wang
{"title":"A 1.1-V CMOS frequency synthesizer with pass-transistor-logic prescaler for U-NII band system","authors":"Jia-Lung Cheng, Chao-Shiun Wang, Wei-Chang Li, Chorng-Kuang Wang","doi":"10.1109/VDAT.2008.4542426","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542426","url":null,"abstract":"This paper presents a 1.1-V low-voltage frequency synthesizer for U-NII band system. In order to achieve low-voltage and high-speed operations, a new 32/33 dual-modulus prescaler with pass-transistor-logic (PTL) topology and folding-injection architecture is proposed. Fabricated in 0.18-mum CMOS technology, the measured phase noise of the 6.08-GHz frequency synthesizer at a 1 MHz offset is -114 dBc/Hz and the reference spur at an offset of 5 MHz is less than 61 dBc. The settling time within 20 ppm after 640 MHz jump (widest jump from low band to high band) is less than 40 mus. The frequency synthesizer dissipates 31 mW from a 1.1-V supply and occupies a chip area of 1.3 mm2.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123790446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design of an OFDMA baseband receiver for 3GPP long-term evolution","authors":"Chin-Wei Chu, Chia-Ching Lee, Yuan-Hao Huang","doi":"10.1109/VDAT.2008.4542446","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542446","url":null,"abstract":"In this paper, we propose a downlink OFDMA processor based on the 3GPP LTE. This processor supports scalable bandwidth for different OFDMA modes. A low-cost sampling rate conversion is employed to obtain different sampling rates for different bandwidths. Timing and frequency synchronization are then designed to recover the time and frequency mismatch between the transmitter and receiver. Afterward, sampling clock offset is tracked and compensated in the frequency domain. Finally, reconfigurable channel estimation and equalization are designed for optimal performance. The circuit architecture is designed and simulated to show that its performance meets the system requirement.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124985565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A hardware/software co-solution to achieving high throughput required by motion estimation part in H.264/AVC HDTV real-time application","authors":"Zhenxing Chen, T. Ikenaga, S. Goto","doi":"10.1109/VDAT.2008.4542429","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542429","url":null,"abstract":"The high throughput required by Motion Estimation (ME) part in H.264/AVC High Definition TV (HDTV) real-time application is very difficult to achieve. Currently one kind of the solutions to this problem is multipling the processing element (PE) array to construct redundant PE array structure. Although redundant structure can take advantages of parallel processing to achieve high throughput, meanwhile it bring in linear increasing of hardware cost. In this paper, a hardware/software co-solusion is proposed to achieve the required throughput of ME part in H.264/AVC HDTV realtime application. In software side, one adaptive search range (ASR) algorithm which is previously proposed by us [12] is firstly introduced and then experimentally proved can improve the throughput 11.48 times averagely in HDTV1080p video sequence. In hardware side, a previously proposed architecture called SAD-tree [17] is firstly introduced. Then based on this architecture, optimization that increase the frequence is proposed. The hardware implementing result of the optimized architecture shows the proposed optimization can triple the frequence. Finally, it is illustrated that the hardware/software co-solution can help to achieve the required throughput.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125556777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Impact quantification of the dummy metal fills on nanometer VLSI designs for DFM","authors":"Keh-Jeng Chang, Jyh-Jeng Chou, Hung-Chih Li, Kuo-Cheng Chang","doi":"10.1109/VDAT.2008.4542470","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542470","url":null,"abstract":"Dummy metal fills can cause systematic variations in capacitance and the impact on the parametric yields should be quantified rigorously. A new set of experimental nanometer circuit structures with close-to-reality dummy metal fills are designed and simulated using 3D electromagnetic field simulations and SPICE to quantify the impacts on the capacitance, the timing, and the crosstalk noise more realistically for design for manufacturability (DFM). With the proposed set of test structures scaled down for the upcoming advanced technology nodes, the foundries and the designers can incorporate the parasitic impacts by dummy metal fills as part of the systematic variation for LPE and SSTA tools. From our detailed analyses, the impacts are growing from 180 nm to 90 nm and are likely to become more prominent in 65 nm and 45 nm designs.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126252223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A 0.35-um CMOS switched-inductor dual-band LC-tank frequency divider","authors":"S. Jang, Che Yi Lin, Chien-Feng Lee","doi":"10.1109/VDAT.2008.4542457","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542457","url":null,"abstract":"This paper presents a switched-inductor dual-band injection locked frequency divider (ILFD) fabricated in a 0.35-um CMOS process. The circuit is made of a differential double cross-coupled CMOS LC oscillator with a switched-inductor LC-tank, which consists of a parallel LC tank in parallel with one switched inductor composite. The dual-band function is obtained by switching the MOS switch in the inductor composite. At the supply voltage of 2.4 V, the measured free-running frequencies of the divide-by-two dual band ILFD are from 2.52 GHz to 2.61 GHz for the low-frequency band and from 4.18 GHz to 4.39 GHz for the high-frequency band. At the incident power of 0 dBm. the locking ranges of low-frequency and high-frequency band are respectively from 4.0 GHz to 5.8 GHz and from 7.1 GHz to 9.3 GHz.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126887624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Y. Murachi, T. Kamino, J. Miyakoshi, H. Kawaguchi, M. Yoshimoto
{"title":"A power-efficient SRAM core architecture with segmentation-free and rectangular accessibility for super-parallel video processing","authors":"Y. Murachi, T. Kamino, J. Miyakoshi, H. Kawaguchi, M. Yoshimoto","doi":"10.1109/VDAT.2008.4542413","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542413","url":null,"abstract":"This paper describes a unique SRAM architecture for super- parallel video processing. It features one cycle functional access of a rectangular image data (n x m pixels) with segmentation-free. To achieve this accessibility, a local word-line select scheme and a merged X-decoder method are newly introduced with elimination of extra X-decoder employed in usage of the conventional divided SRAM macro. The proposed SRAM has been adopted to a search window buffer for H.264 motion estimation processor for HDTV resolution video. As a result, a power and area of the search window buffer are reduced by 49% and by 48%, respectively. Furthermore, it is shown that the proposed SRAM is more efficient for super-HDTV resolution video which requires more parallelism.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130366230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive digital techniques for efficiency and linearity enhancement of CMOS RF power amplifiers","authors":"D. Kwon, Hao Li, Y. Chili","doi":"10.1109/VDAT.2008.4542443","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542443","url":null,"abstract":"This paper presents a baseband adaptive digital compensation technique for cost-effective CMOS integration of next-generation OFDM wireless transmitters. High power efficiency and linearity are simultaneously achieved by using a class-B power amplifier (PA) with less back-off and treating the memoryless AM-AM and AM-PM distortions in the digital domain. When a 64-QAM OFDM signal is transmitted, computer simulations indicate that the post-compensation ACPR and EVM are improved by 13 dB and 30 dB, respectively. The average power efficiency of the PA is 10.9% as opposed to 3.2% of a class-A counterpart for typical usage patterns.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125584476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tong-Yu Hsieh, Kuen-Jong Lee, Chia-Lin Lu, M. Breuer
{"title":"A systematic methodology to employ error-tolerance for yield improvement","authors":"Tong-Yu Hsieh, Kuen-Jong Lee, Chia-Lin Lu, M. Breuer","doi":"10.1109/VDAT.2008.4542423","DOIUrl":"https://doi.org/10.1109/VDAT.2008.4542423","url":null,"abstract":"Error-tolerance is an innovative concept that can significantly improve the yield of integrated circuits (IC's) by identifying defective yet acceptable chips. A systematic method to employ this concept, however, has not been addressed. In this paper, we propose a general methodology to systematically utilize error- tolerance for practical applications. The proposed methodology explores the error-tolerance features of target designs, evaluates the acceptability of defective chips, and predicts the yield improvement that can be achieved. To illustrate and validate the proposed methodology, we employ a discrete cosine transform (DCT) circuit that has been widely used in multimedia compression systems in a case study. By applying the proposed methodology to the DCT, an error-tolerant design flow is established. Proper attributes are determined for acceptability evaluation, and corresponding test methods are developed to identify acceptable chips. Experimental results show that one can easily specify various acceptability thresholds of the identified error-tolerable attributes to obtain different degrees of yield improvement, which validates the efficiency and effectiveness of the proposed methodology.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122448256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}