{"title":"FPGA implementation of complex-valued QR decomposition","authors":"Abdulrahman Alhamed, S. Alshebeili","doi":"10.1109/ICEDSA.2016.7818557","DOIUrl":null,"url":null,"abstract":"This paper presents a hardware architecture for the QR decomposition (QRD) of a complex-valued matrix based on Modified Gram-Schmidt (MGS) algorithm. A high throughput iterative-pipelined design is implemented, which achieves similar performance of a fully parallel-pipelined design, with a significant reduction in hardware usage. For a fixed-point Field Programmable Gate Array (FPGA) implementation optimized to decompose a 4 × 4 complex-valued matrix, an 18.6% reduction in resources utilization is achieved, compared to the fully parallel-pipelined design. Extending the proposed architecture to decompose larger matrices is straight forward, and in such cases more savings with respect to resources utilization can be attained.","PeriodicalId":247318,"journal":{"name":"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEDSA.2016.7818557","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a hardware architecture for the QR decomposition (QRD) of a complex-valued matrix based on Modified Gram-Schmidt (MGS) algorithm. A high throughput iterative-pipelined design is implemented, which achieves similar performance of a fully parallel-pipelined design, with a significant reduction in hardware usage. For a fixed-point Field Programmable Gate Array (FPGA) implementation optimized to decompose a 4 × 4 complex-valued matrix, an 18.6% reduction in resources utilization is achieved, compared to the fully parallel-pipelined design. Extending the proposed architecture to decompose larger matrices is straight forward, and in such cases more savings with respect to resources utilization can be attained.