Khondaker Tasrif Noor , Wei Luo , Antonio Robles-Kelly , Leo Yu Zhang , Mohamed Reda Bouadjenek
{"title":"Taxonomy-guided routing in capsule network for hierarchical image classification","authors":"Khondaker Tasrif Noor , Wei Luo , Antonio Robles-Kelly , Leo Yu Zhang , Mohamed Reda Bouadjenek","doi":"10.1016/j.knosys.2025.114444","DOIUrl":null,"url":null,"abstract":"<div><div>Hierarchical multi-label classification in computer vision presents significant challenges in maintaining consistency across different levels of class granularity while capturing fine-grained visual details. This paper presents Taxonomy-aware Capsule Network (HT-CapsNet), a novel capsule network architecture that explicitly incorporates taxonomic relationships into its routing mechanism to address these challenges. Our key innovation lies in a taxonomy-aware routing algorithm that dynamically adjusts capsule connections based on known hierarchical relationships, enabling more effective learning of hierarchical features while enforcing taxonomic consistency. Extensive experiments on six benchmark datasets, including Fashion-MNIST, Marine-Tree, CIFAR-10, CIFAR-100, CUB-200-2011, and Stanford Cars, demonstrate that HT-CapsNet significantly outperforms existing methods across various hierarchical classification metrics. Notably, on CUB-200-2011, HT-CapsNet achieves absolute improvements of <span><math><mrow><mn>10.32</mn><mspace></mspace><mo>%</mo></mrow></math></span>, <span><math><mrow><mn>10.2</mn><mspace></mspace><mo>%</mo></mrow></math></span>, <span><math><mrow><mn>10.3</mn><mspace></mspace><mo>%</mo></mrow></math></span>, and <span><math><mrow><mn>8.55</mn><mspace></mspace><mo>%</mo></mrow></math></span> in hierarchical accuracy, F1-score, consistency, and exact match, respectively, compared to the best-performing baseline. On the Stanford Cars dataset, the model improves upon the best baseline by <span><math><mrow><mn>21.69</mn><mspace></mspace><mo>%</mo></mrow></math></span>, <span><math><mrow><mn>18.29</mn><mspace></mspace><mo>%</mo></mrow></math></span>, <span><math><mrow><mn>37.34</mn><mspace></mspace><mo>%</mo></mrow></math></span>, and <span><math><mrow><mn>19.95</mn><mspace></mspace><mo>%</mo></mrow></math></span> in the same metrics, demonstrating the robustness and effectiveness of our approach for complex hierarchical classification tasks.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"329 ","pages":"Article 114444"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125014832","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Hierarchical multi-label classification in computer vision presents significant challenges in maintaining consistency across different levels of class granularity while capturing fine-grained visual details. This paper presents Taxonomy-aware Capsule Network (HT-CapsNet), a novel capsule network architecture that explicitly incorporates taxonomic relationships into its routing mechanism to address these challenges. Our key innovation lies in a taxonomy-aware routing algorithm that dynamically adjusts capsule connections based on known hierarchical relationships, enabling more effective learning of hierarchical features while enforcing taxonomic consistency. Extensive experiments on six benchmark datasets, including Fashion-MNIST, Marine-Tree, CIFAR-10, CIFAR-100, CUB-200-2011, and Stanford Cars, demonstrate that HT-CapsNet significantly outperforms existing methods across various hierarchical classification metrics. Notably, on CUB-200-2011, HT-CapsNet achieves absolute improvements of , , , and in hierarchical accuracy, F1-score, consistency, and exact match, respectively, compared to the best-performing baseline. On the Stanford Cars dataset, the model improves upon the best baseline by , , , and in the same metrics, demonstrating the robustness and effectiveness of our approach for complex hierarchical classification tasks.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.