{"title":"基于启发式算法的大数据驱动最优加权融合特征集成学习甲状腺预测分类器","authors":"K. Hema Priya, K. Valarmathi","doi":"10.1007/s10878-025-01304-4","DOIUrl":null,"url":null,"abstract":"<p>Diagnosis of thyroid disease is a most important cause in the field of medicinal research and it is a complex onset axiom. Secretion of Thyroid hormone plays a major role in the regulation of metabolism. Hence, it is very significant to predict thyroid disease in the initial stage, which is helpful for preventing more serious health complications due to thyroid cancer. The diagnostic accuracy of machine leaning-based approaches is greater but these techniques require large amounts of data for the diagnosis process. In the conventional approaches, the time needed for the prediction process is also high. Feature engineering is less investigated in conventional models and hence error produced during the prediction process is high. Hence, in this research work, a machine learning-aided thyroid disease prediction technique is designed to provide higher prediction accuracy and reliability. Initially, the thyroid data is gathered from the standard benchmark resources. Next, the data transformation process is carried out to make the data usable for analysis and visualization. After, the features are extracted using Principal Component Analysis (PCA), “One-Dimensional Convolutional Neural Network Model (1DCNN). Moreover, the statistical features are also extracted for getting more relevant information from the data. The three sets of features such as PCA-based, 1DCNN-based and statistical are concatenated and fed to the “optimal weighted feature selection” process, where the optimal features and weights are tuned by an Improved Archimedes Optimization Algorithm (IAOA). Next, the selected optimally fused features are given to the Ensemble Learning (EL) for predicting the thyroid diseases, where the EL with be suggested by incorporating stacking classifier, XGboost, and Multivariate regression classifier. Ensembling of three different classifiers provides higher thyroid disease prediction accuracy and it makes the decision about normal and abnormal classes. Here, the same IAOA is used for optimizing the parameters of every classifier. The investigational outcomes demonstrate that the proposed ensemble classifier provides higher performance than others. Experimental results prove that the thyroid prediction accuracy of the developed EL approach is 96.30%, precision is 99.67% and F1-score is 97.93%, which is more extensive than the state-of-the-art approaches.</p>","PeriodicalId":50231,"journal":{"name":"Journal of Combinatorial Optimization","volume":"17 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Big data-driven optimal weighted fused features-based ensemble learning classifier for thyroid prediction with heuristic algorithm\",\"authors\":\"K. Hema Priya, K. Valarmathi\",\"doi\":\"10.1007/s10878-025-01304-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Diagnosis of thyroid disease is a most important cause in the field of medicinal research and it is a complex onset axiom. Secretion of Thyroid hormone plays a major role in the regulation of metabolism. Hence, it is very significant to predict thyroid disease in the initial stage, which is helpful for preventing more serious health complications due to thyroid cancer. The diagnostic accuracy of machine leaning-based approaches is greater but these techniques require large amounts of data for the diagnosis process. In the conventional approaches, the time needed for the prediction process is also high. Feature engineering is less investigated in conventional models and hence error produced during the prediction process is high. Hence, in this research work, a machine learning-aided thyroid disease prediction technique is designed to provide higher prediction accuracy and reliability. Initially, the thyroid data is gathered from the standard benchmark resources. Next, the data transformation process is carried out to make the data usable for analysis and visualization. After, the features are extracted using Principal Component Analysis (PCA), “One-Dimensional Convolutional Neural Network Model (1DCNN). Moreover, the statistical features are also extracted for getting more relevant information from the data. The three sets of features such as PCA-based, 1DCNN-based and statistical are concatenated and fed to the “optimal weighted feature selection” process, where the optimal features and weights are tuned by an Improved Archimedes Optimization Algorithm (IAOA). Next, the selected optimally fused features are given to the Ensemble Learning (EL) for predicting the thyroid diseases, where the EL with be suggested by incorporating stacking classifier, XGboost, and Multivariate regression classifier. Ensembling of three different classifiers provides higher thyroid disease prediction accuracy and it makes the decision about normal and abnormal classes. Here, the same IAOA is used for optimizing the parameters of every classifier. The investigational outcomes demonstrate that the proposed ensemble classifier provides higher performance than others. Experimental results prove that the thyroid prediction accuracy of the developed EL approach is 96.30%, precision is 99.67% and F1-score is 97.93%, which is more extensive than the state-of-the-art approaches.</p>\",\"PeriodicalId\":50231,\"journal\":{\"name\":\"Journal of Combinatorial Optimization\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2025-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Combinatorial Optimization\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1007/s10878-025-01304-4\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Combinatorial Optimization","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10878-025-01304-4","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Big data-driven optimal weighted fused features-based ensemble learning classifier for thyroid prediction with heuristic algorithm
Diagnosis of thyroid disease is a most important cause in the field of medicinal research and it is a complex onset axiom. Secretion of Thyroid hormone plays a major role in the regulation of metabolism. Hence, it is very significant to predict thyroid disease in the initial stage, which is helpful for preventing more serious health complications due to thyroid cancer. The diagnostic accuracy of machine leaning-based approaches is greater but these techniques require large amounts of data for the diagnosis process. In the conventional approaches, the time needed for the prediction process is also high. Feature engineering is less investigated in conventional models and hence error produced during the prediction process is high. Hence, in this research work, a machine learning-aided thyroid disease prediction technique is designed to provide higher prediction accuracy and reliability. Initially, the thyroid data is gathered from the standard benchmark resources. Next, the data transformation process is carried out to make the data usable for analysis and visualization. After, the features are extracted using Principal Component Analysis (PCA), “One-Dimensional Convolutional Neural Network Model (1DCNN). Moreover, the statistical features are also extracted for getting more relevant information from the data. The three sets of features such as PCA-based, 1DCNN-based and statistical are concatenated and fed to the “optimal weighted feature selection” process, where the optimal features and weights are tuned by an Improved Archimedes Optimization Algorithm (IAOA). Next, the selected optimally fused features are given to the Ensemble Learning (EL) for predicting the thyroid diseases, where the EL with be suggested by incorporating stacking classifier, XGboost, and Multivariate regression classifier. Ensembling of three different classifiers provides higher thyroid disease prediction accuracy and it makes the decision about normal and abnormal classes. Here, the same IAOA is used for optimizing the parameters of every classifier. The investigational outcomes demonstrate that the proposed ensemble classifier provides higher performance than others. Experimental results prove that the thyroid prediction accuracy of the developed EL approach is 96.30%, precision is 99.67% and F1-score is 97.93%, which is more extensive than the state-of-the-art approaches.
期刊介绍:
The objective of Journal of Combinatorial Optimization is to advance and promote the theory and applications of combinatorial optimization, which is an area of research at the intersection of applied mathematics, computer science, and operations research and which overlaps with many other areas such as computation complexity, computational biology, VLSI design, communication networks, and management science. It includes complexity analysis and algorithm design for combinatorial optimization problems, numerical experiments and problem discovery with applications in science and engineering.
The Journal of Combinatorial Optimization publishes refereed papers dealing with all theoretical, computational and applied aspects of combinatorial optimization. It also publishes reviews of appropriate books and special issues of journals.