{"title":"带标签置信度优化的弱监督学习的正交和球面四元数特征","authors":"Heng Zhou, Ping Zhong","doi":"10.1007/s10489-025-06575-2","DOIUrl":null,"url":null,"abstract":"<div><p>Weakly supervised learning (WSL) addresses the challenge of incomplete or noisy labels, but current methods often fail to capture the complexities introduced by weak labels in feature extraction, revealing the limitations of neural networks in modeling the intricate relationships between features and labels. To address these issues, we introduce the Orthogonal and Spherical Quaternion Neural Network (OSQNN), which utilizes quaternion feature embedding with an orthogonal constraint to map real-valued features into quaternion space. This approach improves the understanding of feature-label relationships by overcoming the challenge of embedding real-world data into quaternion spaces. OSQNN maps quaternion features onto a sphere and estimates label reliability through nearest neighbors, maintaining a coherent geometric structure in feature distributions. Furthermore, quaternion convolutions are transformed into parallel grouped real-valued convolutions, enhancing processing efficiency without sacrificing the benefits of quaternion-based computations. Additionally, we propose the Label Confidence Guided Expectation-Maximization (LCGEM) algorithm, integrated into OSQNN, to more effectively capture the complex relationships between weak labels and feature distributions. Experimental results across eight datasets demonstrate the superiority of OSQNN. For instance, in SSL on CIFAR10 (20% labeled data) and CIFAR100, it achieved 91.06% and 69.16% accuracy respectively; in NSL with 40% incorrect labels on CIFAR10 and CIFAR100, the accuracies were 80.84% and 51.98%, showing its high accuracy and robustness. The ablation study highlights the role of the orthogonal constraint and spherical feature mapping in improving performance, while t-SNE visualization confirms the ability of OSQNN to learn discriminative feature representations.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 10","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Orthogonal and spherical quaternion features for weakly supervised learning with label confidence optimization\",\"authors\":\"Heng Zhou, Ping Zhong\",\"doi\":\"10.1007/s10489-025-06575-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Weakly supervised learning (WSL) addresses the challenge of incomplete or noisy labels, but current methods often fail to capture the complexities introduced by weak labels in feature extraction, revealing the limitations of neural networks in modeling the intricate relationships between features and labels. To address these issues, we introduce the Orthogonal and Spherical Quaternion Neural Network (OSQNN), which utilizes quaternion feature embedding with an orthogonal constraint to map real-valued features into quaternion space. This approach improves the understanding of feature-label relationships by overcoming the challenge of embedding real-world data into quaternion spaces. OSQNN maps quaternion features onto a sphere and estimates label reliability through nearest neighbors, maintaining a coherent geometric structure in feature distributions. Furthermore, quaternion convolutions are transformed into parallel grouped real-valued convolutions, enhancing processing efficiency without sacrificing the benefits of quaternion-based computations. Additionally, we propose the Label Confidence Guided Expectation-Maximization (LCGEM) algorithm, integrated into OSQNN, to more effectively capture the complex relationships between weak labels and feature distributions. Experimental results across eight datasets demonstrate the superiority of OSQNN. For instance, in SSL on CIFAR10 (20% labeled data) and CIFAR100, it achieved 91.06% and 69.16% accuracy respectively; in NSL with 40% incorrect labels on CIFAR10 and CIFAR100, the accuracies were 80.84% and 51.98%, showing its high accuracy and robustness. The ablation study highlights the role of the orthogonal constraint and spherical feature mapping in improving performance, while t-SNE visualization confirms the ability of OSQNN to learn discriminative feature representations.</p></div>\",\"PeriodicalId\":8041,\"journal\":{\"name\":\"Applied Intelligence\",\"volume\":\"55 10\",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10489-025-06575-2\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06575-2","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Orthogonal and spherical quaternion features for weakly supervised learning with label confidence optimization
Weakly supervised learning (WSL) addresses the challenge of incomplete or noisy labels, but current methods often fail to capture the complexities introduced by weak labels in feature extraction, revealing the limitations of neural networks in modeling the intricate relationships between features and labels. To address these issues, we introduce the Orthogonal and Spherical Quaternion Neural Network (OSQNN), which utilizes quaternion feature embedding with an orthogonal constraint to map real-valued features into quaternion space. This approach improves the understanding of feature-label relationships by overcoming the challenge of embedding real-world data into quaternion spaces. OSQNN maps quaternion features onto a sphere and estimates label reliability through nearest neighbors, maintaining a coherent geometric structure in feature distributions. Furthermore, quaternion convolutions are transformed into parallel grouped real-valued convolutions, enhancing processing efficiency without sacrificing the benefits of quaternion-based computations. Additionally, we propose the Label Confidence Guided Expectation-Maximization (LCGEM) algorithm, integrated into OSQNN, to more effectively capture the complex relationships between weak labels and feature distributions. Experimental results across eight datasets demonstrate the superiority of OSQNN. For instance, in SSL on CIFAR10 (20% labeled data) and CIFAR100, it achieved 91.06% and 69.16% accuracy respectively; in NSL with 40% incorrect labels on CIFAR10 and CIFAR100, the accuracies were 80.84% and 51.98%, showing its high accuracy and robustness. The ablation study highlights the role of the orthogonal constraint and spherical feature mapping in improving performance, while t-SNE visualization confirms the ability of OSQNN to learn discriminative feature representations.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.