{"title":"基于生成对抗网络和Kolmogorov-Arnold网络的甲状腺疾病分类。","authors":"Aysel Topşir, Ferdi Güler, Ecesu Çetin, Mehmet Furkan Burak, Melih Ağraz","doi":"10.1186/s12911-025-03014-7","DOIUrl":null,"url":null,"abstract":"<p><p>Thyroid disease classification is a critical challenge in medical diagnostics, requiring accurate differentiation between hyperthyroidism, hypothyroidism, and normal thyroid function. This study introduces an advanced machine learning approach that integrates generative adversarial networks (GANs) for data augmentation and Kolmogorov-Arnold networks (KANs) for classification. Various machine learning models including logistic regression, random forest, support vector machines, multilayer perceptrons, and KANs were trained and evaluated. The results indicate that the application of GAN-based data augmentation has significantly improved classification accuracy, particularly for minority classes. Specifically, the KAN model achieved an accuracy of 98.68% and random forest (RF) F1-score of 98.00%, outperforming traditional neural network applications. The results demonstrate that GAN-augmented datasets significantly improve classification accuracy, and the KAN model achieves superior performance and generalization capabilities compared to traditional neural networks. Additionally, the SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) were employed to ensure model transparency and interpretability. These explainability methods highlight thyroid stimulating hormone as the most prominent feature in classification, further supporting its clinical utility in the diagnosis of thyroid diseases. The findings underscore the potential of advanced AI-driven techniques in improving thyroid disease classification, addressing class imbalance, and enhancing explainability in healthcare applications. By leveraging synthetic data generation, this study provides a feasible framework for actual clinical application, particularly in situations where clinical data are limited or imbalanced. The integration of GANs and KANs enhances diagnostic accuracy while preserving robustness and generalizability to diverse patient populations. Besides, the approach fosters the deployment of explainable AI models in clinical decision support systems so that healthcare practitioners can make improved and more reliable decisions, thus leading to better patient outcomes and resource allocation.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"284"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12315378/pdf/","citationCount":"0","resultStr":"{\"title\":\"Thyroid disease classification using generative adversarial networks and Kolmogorov-Arnold network for three-class classification.\",\"authors\":\"Aysel Topşir, Ferdi Güler, Ecesu Çetin, Mehmet Furkan Burak, Melih Ağraz\",\"doi\":\"10.1186/s12911-025-03014-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Thyroid disease classification is a critical challenge in medical diagnostics, requiring accurate differentiation between hyperthyroidism, hypothyroidism, and normal thyroid function. This study introduces an advanced machine learning approach that integrates generative adversarial networks (GANs) for data augmentation and Kolmogorov-Arnold networks (KANs) for classification. Various machine learning models including logistic regression, random forest, support vector machines, multilayer perceptrons, and KANs were trained and evaluated. The results indicate that the application of GAN-based data augmentation has significantly improved classification accuracy, particularly for minority classes. Specifically, the KAN model achieved an accuracy of 98.68% and random forest (RF) F1-score of 98.00%, outperforming traditional neural network applications. The results demonstrate that GAN-augmented datasets significantly improve classification accuracy, and the KAN model achieves superior performance and generalization capabilities compared to traditional neural networks. Additionally, the SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) were employed to ensure model transparency and interpretability. These explainability methods highlight thyroid stimulating hormone as the most prominent feature in classification, further supporting its clinical utility in the diagnosis of thyroid diseases. The findings underscore the potential of advanced AI-driven techniques in improving thyroid disease classification, addressing class imbalance, and enhancing explainability in healthcare applications. By leveraging synthetic data generation, this study provides a feasible framework for actual clinical application, particularly in situations where clinical data are limited or imbalanced. The integration of GANs and KANs enhances diagnostic accuracy while preserving robustness and generalizability to diverse patient populations. Besides, the approach fosters the deployment of explainable AI models in clinical decision support systems so that healthcare practitioners can make improved and more reliable decisions, thus leading to better patient outcomes and resource allocation.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"284\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12315378/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-03014-7\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03014-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
摘要
甲状腺疾病的分类是医学诊断中的一个关键挑战,需要准确区分甲状腺功能亢进、甲状腺功能减退和甲状腺功能正常。本研究引入了一种先进的机器学习方法,该方法集成了用于数据增强的生成对抗网络(gan)和用于分类的Kolmogorov-Arnold网络(KANs)。各种机器学习模型,包括逻辑回归、随机森林、支持向量机、多层感知器和KANs进行了训练和评估。结果表明,基于gan的数据增强的应用显著提高了分类精度,特别是对于少数类。具体而言,KAN模型的准确率为98.68%,随机森林(RF) f1得分为98.00%,优于传统的神经网络应用。结果表明,gan增强的数据集显著提高了分类精度,与传统神经网络相比,KAN模型具有更好的性能和泛化能力。此外,采用SHapley加性解释(SHapley Additive interpretation)和LIME (Local Interpretable model -agnostic interpretation)来保证模型的透明性和可解释性。这些解释方法突出了促甲状腺激素作为分类中最突出的特征,进一步支持了其在甲状腺疾病诊断中的临床应用。研究结果强调了先进的人工智能驱动技术在改善甲状腺疾病分类、解决类别不平衡和增强医疗保健应用中的可解释性方面的潜力。通过利用合成数据生成,本研究为实际临床应用提供了一个可行的框架,特别是在临床数据有限或不平衡的情况下。gan和KANs的集成提高了诊断准确性,同时保持了对不同患者群体的鲁棒性和通用性。此外,该方法促进在临床决策支持系统中部署可解释的人工智能模型,使医疗从业者能够做出更好、更可靠的决策,从而改善患者的治疗效果和资源分配。
Thyroid disease classification using generative adversarial networks and Kolmogorov-Arnold network for three-class classification.
Thyroid disease classification is a critical challenge in medical diagnostics, requiring accurate differentiation between hyperthyroidism, hypothyroidism, and normal thyroid function. This study introduces an advanced machine learning approach that integrates generative adversarial networks (GANs) for data augmentation and Kolmogorov-Arnold networks (KANs) for classification. Various machine learning models including logistic regression, random forest, support vector machines, multilayer perceptrons, and KANs were trained and evaluated. The results indicate that the application of GAN-based data augmentation has significantly improved classification accuracy, particularly for minority classes. Specifically, the KAN model achieved an accuracy of 98.68% and random forest (RF) F1-score of 98.00%, outperforming traditional neural network applications. The results demonstrate that GAN-augmented datasets significantly improve classification accuracy, and the KAN model achieves superior performance and generalization capabilities compared to traditional neural networks. Additionally, the SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) were employed to ensure model transparency and interpretability. These explainability methods highlight thyroid stimulating hormone as the most prominent feature in classification, further supporting its clinical utility in the diagnosis of thyroid diseases. The findings underscore the potential of advanced AI-driven techniques in improving thyroid disease classification, addressing class imbalance, and enhancing explainability in healthcare applications. By leveraging synthetic data generation, this study provides a feasible framework for actual clinical application, particularly in situations where clinical data are limited or imbalanced. The integration of GANs and KANs enhances diagnostic accuracy while preserving robustness and generalizability to diverse patient populations. Besides, the approach fosters the deployment of explainable AI models in clinical decision support systems so that healthcare practitioners can make improved and more reliable decisions, thus leading to better patient outcomes and resource allocation.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.