{"title":"广义零次学习的属性对齐网络","authors":"Nannan Lu, Mingkai Qiu, Jiansheng Qian","doi":"10.1016/j.patrec.2025.09.010","DOIUrl":null,"url":null,"abstract":"<div><div>Part-based embedding methods with attention mechanism achieved outstanding results in zero-shot learning (ZSL). However, affected by intra-class variations in the datasets (i.e., different samples of the same class present different attribute characteristics), it is difficult for models based on traditional attention mechanisms to achieve accurate visual-attribute alignment. To tackle this problem, we propose a novel approach to fully utilize attributes information, referred to as attribute alignment networks (AAN). It consists of the attribute alignment (AA) pipeline and the attribute enhancement (AE) module. AA pipeline is a brand-new solution for part-based embedding method, which realizes visual-attribute alignment in both attribute space and attribute semantic space under the supervision of class attribute vectors and attribute word vectors, respectively. AE module employs the Graph Neural Networks (GNNs) to project visual features to the attribute semantic space. Based on the constructed attribute relation graph (ARG) and self-attention mechanism, AE module generates the enhanced representation of attributes to minimize the influence of intra-class variations. Experiments on standard datasets demonstrate that the enhanced attribute representation greatly improves the classification performance. Overall, AAN outperforms the other state-of-the-art performances in ZSL and GZSL tasks.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"198 ","pages":"Pages 50-56"},"PeriodicalIF":3.3000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attribute alignment networks for generalized zero-shot learning\",\"authors\":\"Nannan Lu, Mingkai Qiu, Jiansheng Qian\",\"doi\":\"10.1016/j.patrec.2025.09.010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Part-based embedding methods with attention mechanism achieved outstanding results in zero-shot learning (ZSL). However, affected by intra-class variations in the datasets (i.e., different samples of the same class present different attribute characteristics), it is difficult for models based on traditional attention mechanisms to achieve accurate visual-attribute alignment. To tackle this problem, we propose a novel approach to fully utilize attributes information, referred to as attribute alignment networks (AAN). It consists of the attribute alignment (AA) pipeline and the attribute enhancement (AE) module. AA pipeline is a brand-new solution for part-based embedding method, which realizes visual-attribute alignment in both attribute space and attribute semantic space under the supervision of class attribute vectors and attribute word vectors, respectively. AE module employs the Graph Neural Networks (GNNs) to project visual features to the attribute semantic space. Based on the constructed attribute relation graph (ARG) and self-attention mechanism, AE module generates the enhanced representation of attributes to minimize the influence of intra-class variations. Experiments on standard datasets demonstrate that the enhanced attribute representation greatly improves the classification performance. Overall, AAN outperforms the other state-of-the-art performances in ZSL and GZSL tasks.</div></div>\",\"PeriodicalId\":54638,\"journal\":{\"name\":\"Pattern Recognition Letters\",\"volume\":\"198 \",\"pages\":\"Pages 50-56\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167865525003204\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525003204","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Attribute alignment networks for generalized zero-shot learning
Part-based embedding methods with attention mechanism achieved outstanding results in zero-shot learning (ZSL). However, affected by intra-class variations in the datasets (i.e., different samples of the same class present different attribute characteristics), it is difficult for models based on traditional attention mechanisms to achieve accurate visual-attribute alignment. To tackle this problem, we propose a novel approach to fully utilize attributes information, referred to as attribute alignment networks (AAN). It consists of the attribute alignment (AA) pipeline and the attribute enhancement (AE) module. AA pipeline is a brand-new solution for part-based embedding method, which realizes visual-attribute alignment in both attribute space and attribute semantic space under the supervision of class attribute vectors and attribute word vectors, respectively. AE module employs the Graph Neural Networks (GNNs) to project visual features to the attribute semantic space. Based on the constructed attribute relation graph (ARG) and self-attention mechanism, AE module generates the enhanced representation of attributes to minimize the influence of intra-class variations. Experiments on standard datasets demonstrate that the enhanced attribute representation greatly improves the classification performance. Overall, AAN outperforms the other state-of-the-art performances in ZSL and GZSL tasks.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.