{"title":"基于多尺度破坏与构建学习的细粒度目标识别","authors":"Jun Luo, Yingzhe Jiang, Jinyi Qiu","doi":"10.1145/3501409.3501625","DOIUrl":null,"url":null,"abstract":"With the development of image processing technology, how to effectively use computer vision technology for image classification has become an important research branch in the field of computer vision. Traditional image classification is to identify coarse-grained categories, such as using computer vision technology to distinguish meta-categories such as \"dog\", \"grape\" and \"bird\". In many practical applications, images need to be classified at a fine-grained level, such as distinguishing which subcategory the image belongs to in the \"dog\" category: \"husky\", \"Samoyed\" and \"Alaska\". Due to the characteristics of large intra-class differences and small inter-class differences in fine-grained image classification, fine-grained image classification has become a challenging research. This paper proposes multi-scale input (MS_DCL) and multi-scale detail boosting (MSDB_DCL) destruction and construction learning methods, so that the network can learn object and component-level discriminative information for fine-grained image classification. The experimental results show that MS_DCL is finer than other existing ones. The granular image classification method has higher recognition accuracy, and the test accuracy rate can reach 87.4% on the CUB200-2011 dataset.","PeriodicalId":191106,"journal":{"name":"Proceedings of the 2021 5th International Conference on Electronic Information Technology and Computer Engineering","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fine-grained Object Recognition based on Multi-Scale Destruction and Construction Learning\",\"authors\":\"Jun Luo, Yingzhe Jiang, Jinyi Qiu\",\"doi\":\"10.1145/3501409.3501625\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of image processing technology, how to effectively use computer vision technology for image classification has become an important research branch in the field of computer vision. Traditional image classification is to identify coarse-grained categories, such as using computer vision technology to distinguish meta-categories such as \\\"dog\\\", \\\"grape\\\" and \\\"bird\\\". In many practical applications, images need to be classified at a fine-grained level, such as distinguishing which subcategory the image belongs to in the \\\"dog\\\" category: \\\"husky\\\", \\\"Samoyed\\\" and \\\"Alaska\\\". Due to the characteristics of large intra-class differences and small inter-class differences in fine-grained image classification, fine-grained image classification has become a challenging research. This paper proposes multi-scale input (MS_DCL) and multi-scale detail boosting (MSDB_DCL) destruction and construction learning methods, so that the network can learn object and component-level discriminative information for fine-grained image classification. The experimental results show that MS_DCL is finer than other existing ones. The granular image classification method has higher recognition accuracy, and the test accuracy rate can reach 87.4% on the CUB200-2011 dataset.\",\"PeriodicalId\":191106,\"journal\":{\"name\":\"Proceedings of the 2021 5th International Conference on Electronic Information Technology and Computer Engineering\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 5th International Conference on Electronic Information Technology and Computer Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3501409.3501625\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 5th International Conference on Electronic Information Technology and Computer Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3501409.3501625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fine-grained Object Recognition based on Multi-Scale Destruction and Construction Learning
With the development of image processing technology, how to effectively use computer vision technology for image classification has become an important research branch in the field of computer vision. Traditional image classification is to identify coarse-grained categories, such as using computer vision technology to distinguish meta-categories such as "dog", "grape" and "bird". In many practical applications, images need to be classified at a fine-grained level, such as distinguishing which subcategory the image belongs to in the "dog" category: "husky", "Samoyed" and "Alaska". Due to the characteristics of large intra-class differences and small inter-class differences in fine-grained image classification, fine-grained image classification has become a challenging research. This paper proposes multi-scale input (MS_DCL) and multi-scale detail boosting (MSDB_DCL) destruction and construction learning methods, so that the network can learn object and component-level discriminative information for fine-grained image classification. The experimental results show that MS_DCL is finer than other existing ones. The granular image classification method has higher recognition accuracy, and the test accuracy rate can reach 87.4% on the CUB200-2011 dataset.