{"title":"A new systolic array algorithm for memory-based VLSI array implementation of IDCT with high throughput rate and low complexity","authors":"D. Chiper","doi":"10.1109/DSPWS.1996.555490","DOIUrl":null,"url":null,"abstract":"A new systolic algorithm for memory-based parallel VLSI implementation of the inverse to discrete cosine transform (IDCT) is proposed. The new approach is based on a new formulation of an odd prime-length IDCT which uses two half-length cyclic convolutions with the same form which can be concurrently computed and were such reformulated that an efficient substitution of multipliers with small ROMs can be obtained. Using this algorithm, a new efficient VLSI implementation with outstanding performance in structural regularity, hardware cost of the PEs, average computation time, and I/O costs can be obtained. It has a much lower control complexity, and a simpler hardware structure.","PeriodicalId":131323,"journal":{"name":"1996 IEEE Digital Signal Processing Workshop Proceedings","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1996 IEEE Digital Signal Processing Workshop Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSPWS.1996.555490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
A new systolic algorithm for memory-based parallel VLSI implementation of the inverse to discrete cosine transform (IDCT) is proposed. The new approach is based on a new formulation of an odd prime-length IDCT which uses two half-length cyclic convolutions with the same form which can be concurrently computed and were such reformulated that an efficient substitution of multipliers with small ROMs can be obtained. Using this algorithm, a new efficient VLSI implementation with outstanding performance in structural regularity, hardware cost of the PEs, average computation time, and I/O costs can be obtained. It has a much lower control complexity, and a simpler hardware structure.