实时缩放和旋转不变多模板匹配

Jung Rok Kim, J. Jeon
{"title":"实时缩放和旋转不变多模板匹配","authors":"Jung Rok Kim, J. Jeon","doi":"10.1109/IMCOM53663.2022.9721742","DOIUrl":null,"url":null,"abstract":"Object detection is an important component in the field of computer vision. Real-time detection of objects of any scale or rotation is a major challenge facing the industry today. In this study, we proposed a real-time size and orientation invariant template matching algorithm and hardware structure. In addition, we proposed image pyramid generation, patch orientation detection, descriptor generation, and descriptor matching methods. First, Scale invariance is achieved by generating a pyramid of nine images from the input image to simultaneously detect objects of different scales. Then, construct a window equal to the size of the template image from the original image to obtain the center point and direction of the window. We achieve rotation invariance by creating and rotating a descriptor based on the orientation of the window. Finally, the object is detected by matching it with the descriptor of the template. The proposed algorithm was implemented in Xilinx Virtex-7 xc7v2000tflg1925-1 FPGA. Throughput was 187 Frames/s regardless of the number of objects.","PeriodicalId":367038,"journal":{"name":"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)","volume":"47-50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-time Scale and Rotation Invariant Multiple Template Matching\",\"authors\":\"Jung Rok Kim, J. Jeon\",\"doi\":\"10.1109/IMCOM53663.2022.9721742\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object detection is an important component in the field of computer vision. Real-time detection of objects of any scale or rotation is a major challenge facing the industry today. In this study, we proposed a real-time size and orientation invariant template matching algorithm and hardware structure. In addition, we proposed image pyramid generation, patch orientation detection, descriptor generation, and descriptor matching methods. First, Scale invariance is achieved by generating a pyramid of nine images from the input image to simultaneously detect objects of different scales. Then, construct a window equal to the size of the template image from the original image to obtain the center point and direction of the window. We achieve rotation invariance by creating and rotating a descriptor based on the orientation of the window. Finally, the object is detected by matching it with the descriptor of the template. The proposed algorithm was implemented in Xilinx Virtex-7 xc7v2000tflg1925-1 FPGA. Throughput was 187 Frames/s regardless of the number of objects.\",\"PeriodicalId\":367038,\"journal\":{\"name\":\"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)\",\"volume\":\"47-50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IMCOM53663.2022.9721742\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCOM53663.2022.9721742","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目标检测是计算机视觉领域的一个重要组成部分。实时检测任何尺度或旋转的物体是当今行业面临的主要挑战。在本研究中,我们提出了一种实时大小和方向不变的模板匹配算法和硬件结构。此外,我们还提出了图像金字塔生成、斑块方向检测、描述子生成和描述子匹配方法。首先,通过从输入图像中生成一个由9幅图像组成的金字塔来同时检测不同尺度的物体,从而实现尺度不变性。然后,从原始图像中构造一个与模板图像大小相等的窗口,得到窗口的中心点和方向。我们通过基于窗口的方向创建和旋转描述符来实现旋转不变性。最后,通过将对象与模板的描述符进行匹配来检测对象。该算法在Xilinx Virtex-7 xc7v2000tflg1925-1 FPGA上实现。无论对象的数量如何,吞吐量都是187帧/秒。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Real-time Scale and Rotation Invariant Multiple Template Matching
Object detection is an important component in the field of computer vision. Real-time detection of objects of any scale or rotation is a major challenge facing the industry today. In this study, we proposed a real-time size and orientation invariant template matching algorithm and hardware structure. In addition, we proposed image pyramid generation, patch orientation detection, descriptor generation, and descriptor matching methods. First, Scale invariance is achieved by generating a pyramid of nine images from the input image to simultaneously detect objects of different scales. Then, construct a window equal to the size of the template image from the original image to obtain the center point and direction of the window. We achieve rotation invariance by creating and rotating a descriptor based on the orientation of the window. Finally, the object is detected by matching it with the descriptor of the template. The proposed algorithm was implemented in Xilinx Virtex-7 xc7v2000tflg1925-1 FPGA. Throughput was 187 Frames/s regardless of the number of objects.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信