{"title":"Semantic guided level-category hybrid prediction network for hierarchical image classification","authors":"Peng Wang, Jingzhou Chen, Yuntao Qian","doi":"10.1142/s0219691323500236","DOIUrl":null,"url":null,"abstract":"Hierarchical classification (HC) assigns each object with multiple labels organized into a hierarchical structure. The existing deep learning-based HC methods usually predict an instance starting from the root node until a leaf node is reached. However, in the real world, images impaired by noise, occlusion, blur, or low resolution may not provide sufficient information for the classification at subordinate levels. To address this issue, we propose a novel Semantic Guided level-category Hybrid Prediction Network (SGHPN) that can jointly perform the level and category prediction in an end-to-end manner. SGHPN comprises two modules: a visual transformer that extracts feature vectors from the input images, and a semantic guided cross-attention module that uses categories word embeddings as queries to guide learning category-specific representations. In order to evaluate the proposed method, we construct two new datasets in which images are at a broad range of quality and thus are labeled to different levels (depths) in the hierarchy according to their individual quality. Experimental results demonstrate the effectiveness of our proposed HC method.","PeriodicalId":50282,"journal":{"name":"International Journal of Wavelets Multiresolution and Information Processing","volume":"77 1","pages":"0"},"PeriodicalIF":0.9000,"publicationDate":"2023-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Wavelets Multiresolution and Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219691323500236","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Hierarchical classification (HC) assigns each object with multiple labels organized into a hierarchical structure. The existing deep learning-based HC methods usually predict an instance starting from the root node until a leaf node is reached. However, in the real world, images impaired by noise, occlusion, blur, or low resolution may not provide sufficient information for the classification at subordinate levels. To address this issue, we propose a novel Semantic Guided level-category Hybrid Prediction Network (SGHPN) that can jointly perform the level and category prediction in an end-to-end manner. SGHPN comprises two modules: a visual transformer that extracts feature vectors from the input images, and a semantic guided cross-attention module that uses categories word embeddings as queries to guide learning category-specific representations. In order to evaluate the proposed method, we construct two new datasets in which images are at a broad range of quality and thus are labeled to different levels (depths) in the hierarchy according to their individual quality. Experimental results demonstrate the effectiveness of our proposed HC method.
期刊介绍:
International Journal of Wavelets, Multiresolution and Information Processing (hereafter referred to as IJWMIP) is a bi-monthly publication for theoretical and applied papers on the current state-of-the-art results of wavelet analysis, multiresolution and information processing.
Papers related to the IJWMIP theme are especially solicited, including theories, methodologies, algorithms and emerging applications. Topics of interest of the IJWMIP include, but are not limited to:
1. Wavelets:
Wavelets and operator theory
Frame and applications
Time-frequency analysis and applications
Sparse representation and approximation
Sampling theory and compressive sensing
Wavelet based algorithms and applications
2. Multiresolution:
Multiresolution analysis
Multiscale approximation
Multiresolution image processing and signal processing
Multiresolution representations
Deep learning and neural networks
Machine learning theory, algorithms and applications
High dimensional data analysis
3. Information Processing:
Data sciences
Big data and applications
Information theory
Information systems and technology
Information security
Information learning and processing
Artificial intelligence and pattern recognition
Image/signal processing.