{"title":"Scale-invariant Convolutional Capsule Network","authors":"Zihan Li, Yuqiu Kong, Baocai Yin","doi":"10.1145/3495018.3495058","DOIUrl":null,"url":null,"abstract":"Scale-invariant feature detection plays an important role in computer vision. Inspired by capsule networks and Hough transform, we presented a scale-invariant feature detection and extraction module that is streamlined, lightweight, and interpretable. We reduced the overhead by combining the learnable feature detector with an efficient scale aware parameter voting mechanism. Also, a routing mechanism can be added to further refine the extracted features to boost performance. Compared with popular multicolumn or feature pyramid based methods, our proposed method is more lightweight both parameter wise and architectural wise, while maintaining good multiscale performance, especially in intensively scale transformed scenarios. Meanwhile, its streamlined sequential pipeline makes it easy to integrate into other models in a plug-and-play manner.","PeriodicalId":6873,"journal":{"name":"2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture","volume":"149 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3495018.3495058","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Scale-invariant feature detection plays an important role in computer vision. Inspired by capsule networks and Hough transform, we presented a scale-invariant feature detection and extraction module that is streamlined, lightweight, and interpretable. We reduced the overhead by combining the learnable feature detector with an efficient scale aware parameter voting mechanism. Also, a routing mechanism can be added to further refine the extracted features to boost performance. Compared with popular multicolumn or feature pyramid based methods, our proposed method is more lightweight both parameter wise and architectural wise, while maintaining good multiscale performance, especially in intensively scale transformed scenarios. Meanwhile, its streamlined sequential pipeline makes it easy to integrate into other models in a plug-and-play manner.