T. Adiono, Mahendra Drajat Adhinata, Novi Prihatiningrum, Ricky Disastra, Rachmad Vidya Wicaksana Putra, A. H. Salman
{"title":"基于模板匹配的快速队列计数器的FPGA结构设计","authors":"T. Adiono, Mahendra Drajat Adhinata, Novi Prihatiningrum, Ricky Disastra, Rachmad Vidya Wicaksana Putra, A. H. Salman","doi":"10.1109/ISPACS.2016.7824708","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an architecture design of Sum of Absolute Difference (SAD) based template matching for fast queue counter. The main idea of this architecture design are line delay and SAD processor array. Size of the line delay is 640×100, since the length of the source image is 640 pixels, the width of the template image is 100 pixels and specification of the template and source images are 640×480 and 40×100 pixels respectively. The processor array (PA) contains 40×100 SAD processing elements (PEs) that are mapped into horizontal structure. We synthesize and implement the proposed design in FPGA Altera DE2-115. It can reach 128.35 MHz as maximum frequency and occupies 59,899 memory bits; 17,158 registers and 17,411 combinationals. It needs 307,200 clock cycles to finish a single image. By using 100 MHz clock frequency, a single image processing can be conducted in 3.072 ms (325 fps). These results show that the proposed design can reach fast computational performance.","PeriodicalId":131543,"journal":{"name":"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An architecture design of SAD based template matching for fast queue counter in FPGA\",\"authors\":\"T. Adiono, Mahendra Drajat Adhinata, Novi Prihatiningrum, Ricky Disastra, Rachmad Vidya Wicaksana Putra, A. H. Salman\",\"doi\":\"10.1109/ISPACS.2016.7824708\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose an architecture design of Sum of Absolute Difference (SAD) based template matching for fast queue counter. The main idea of this architecture design are line delay and SAD processor array. Size of the line delay is 640×100, since the length of the source image is 640 pixels, the width of the template image is 100 pixels and specification of the template and source images are 640×480 and 40×100 pixels respectively. The processor array (PA) contains 40×100 SAD processing elements (PEs) that are mapped into horizontal structure. We synthesize and implement the proposed design in FPGA Altera DE2-115. It can reach 128.35 MHz as maximum frequency and occupies 59,899 memory bits; 17,158 registers and 17,411 combinationals. It needs 307,200 clock cycles to finish a single image. By using 100 MHz clock frequency, a single image processing can be conducted in 3.072 ms (325 fps). These results show that the proposed design can reach fast computational performance.\",\"PeriodicalId\":131543,\"journal\":{\"name\":\"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPACS.2016.7824708\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS.2016.7824708","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
本文提出了一种基于绝对差和(SAD)模板匹配的快速队列计数器体系结构设计。该架构设计的主要思想是线延迟和SAD处理器阵列。线延迟的大小为640×100,因为源图像的长度为640像素,模板图像的宽度为100像素,模板和源图像的规格分别为640×480和40×100像素。处理器阵列(PA)包含40×100 SAD处理元素(pe),这些处理元素映射为水平结构。我们在Altera DE2-115 FPGA上综合并实现了所提出的设计。最高频率可达128.35 MHz,占用内存59,899位;17,158个寄存器和17,411个组合。完成一张图像需要307,200个时钟周期。使用100 MHz时钟频率,可在3.072 ms (325 fps)内完成单幅图像处理。结果表明,所提出的设计能够达到快速的计算性能。
An architecture design of SAD based template matching for fast queue counter in FPGA
In this paper, we propose an architecture design of Sum of Absolute Difference (SAD) based template matching for fast queue counter. The main idea of this architecture design are line delay and SAD processor array. Size of the line delay is 640×100, since the length of the source image is 640 pixels, the width of the template image is 100 pixels and specification of the template and source images are 640×480 and 40×100 pixels respectively. The processor array (PA) contains 40×100 SAD processing elements (PEs) that are mapped into horizontal structure. We synthesize and implement the proposed design in FPGA Altera DE2-115. It can reach 128.35 MHz as maximum frequency and occupies 59,899 memory bits; 17,158 registers and 17,411 combinationals. It needs 307,200 clock cycles to finish a single image. By using 100 MHz clock frequency, a single image processing can be conducted in 3.072 ms (325 fps). These results show that the proposed design can reach fast computational performance.