{"title":"二维不可分四元数滤波器组的高性能无乘法器流水线FPGA架构","authors":"Eugene V. Rybenkov, N. Petrovsky","doi":"10.23919/spa50552.2020.9241273","DOIUrl":null,"url":null,"abstract":"This paper presents a systematic design of the 2-D non-separable quaternionic paraunitary filter banks $(Q -$PUFB) based on the integer-to-integer invertible quaternionic multiplier applied to image processing. In order to achieve higher transform coding gains in multidimensional domain with relatively low-complexity implementation, orthogonal transform 8-channel $Q -$PUFB factorize into two-dimensional non-separable structures called ”64in-64out” (2D NS $Q -$PUFB). The given structures can be mapped directly to parallel-pipelined processor architecture with minimal latency time $4 (N +1)$ quaternion multiplication operations, where N is transform order of $Q -$PUFB. The latency of parallel-pipelined processing does not depend on the size of the original image. Experimental design results on resource utilization and total throughput are obtained using a Xilinx Ultrascale + FPGA Series. System prototype total throughput variates from 13.8 up to 55 million pixels per second and depends on fixed point constraints.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"High performance multiplier-less pipelined FPGA architecture for 2-D non-separable quaternionic filter banks\",\"authors\":\"Eugene V. Rybenkov, N. Petrovsky\",\"doi\":\"10.23919/spa50552.2020.9241273\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a systematic design of the 2-D non-separable quaternionic paraunitary filter banks $(Q -$PUFB) based on the integer-to-integer invertible quaternionic multiplier applied to image processing. In order to achieve higher transform coding gains in multidimensional domain with relatively low-complexity implementation, orthogonal transform 8-channel $Q -$PUFB factorize into two-dimensional non-separable structures called ”64in-64out” (2D NS $Q -$PUFB). The given structures can be mapped directly to parallel-pipelined processor architecture with minimal latency time $4 (N +1)$ quaternion multiplication operations, where N is transform order of $Q -$PUFB. The latency of parallel-pipelined processing does not depend on the size of the original image. Experimental design results on resource utilization and total throughput are obtained using a Xilinx Ultrascale + FPGA Series. System prototype total throughput variates from 13.8 up to 55 million pixels per second and depends on fixed point constraints.\",\"PeriodicalId\":157578,\"journal\":{\"name\":\"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/spa50552.2020.9241273\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/spa50552.2020.9241273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
High performance multiplier-less pipelined FPGA architecture for 2-D non-separable quaternionic filter banks
This paper presents a systematic design of the 2-D non-separable quaternionic paraunitary filter banks $(Q -$PUFB) based on the integer-to-integer invertible quaternionic multiplier applied to image processing. In order to achieve higher transform coding gains in multidimensional domain with relatively low-complexity implementation, orthogonal transform 8-channel $Q -$PUFB factorize into two-dimensional non-separable structures called ”64in-64out” (2D NS $Q -$PUFB). The given structures can be mapped directly to parallel-pipelined processor architecture with minimal latency time $4 (N +1)$ quaternion multiplication operations, where N is transform order of $Q -$PUFB. The latency of parallel-pipelined processing does not depend on the size of the original image. Experimental design results on resource utilization and total throughput are obtained using a Xilinx Ultrascale + FPGA Series. System prototype total throughput variates from 13.8 up to 55 million pixels per second and depends on fixed point constraints.