{"title":"Network generating network for multi-scale image classification","authors":"Han Dong, Liping Xiao, Longjian Cong, Bin Zhou","doi":"10.1117/12.2671561","DOIUrl":null,"url":null,"abstract":"Features extracted by the neural network do not have scale invariance, which makes multi-scale image recognition and classification a difficult problem. Recent studies have proposed many new ways to solve this problem, such as feature fusion, sensor field transformation, etc. However, none of them essentially solve the problem that the neural network does not have scale invariance. In this paper, we propose a network generating network (NGN) architecture and design the NGNResNet network, which is an improved version of the ResNet network. The network can identify images at three scales simultaneously and has scale invariance. The experimental results show that the NGN structure helps us to improve the classification accuracy of small-scale images by about 10 percentage points, and helps to improve the performance of the network in the face of small targets.","PeriodicalId":120866,"journal":{"name":"Artificial Intelligence and Big Data Forum","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence and Big Data Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2671561","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Features extracted by the neural network do not have scale invariance, which makes multi-scale image recognition and classification a difficult problem. Recent studies have proposed many new ways to solve this problem, such as feature fusion, sensor field transformation, etc. However, none of them essentially solve the problem that the neural network does not have scale invariance. In this paper, we propose a network generating network (NGN) architecture and design the NGNResNet network, which is an improved version of the ResNet network. The network can identify images at three scales simultaneously and has scale invariance. The experimental results show that the NGN structure helps us to improve the classification accuracy of small-scale images by about 10 percentage points, and helps to improve the performance of the network in the face of small targets.