{"title":"基于LIME和shap可解释性的假新闻检测自适应集成分类器","authors":"Ashima Kukkar, Gagandeep Kaur","doi":"10.1016/j.eswa.2025.127751","DOIUrl":null,"url":null,"abstract":"<div><div>The constant availability of fake news on social media and other information-sharing platforms has raised a demand for efficient and effective fake news detection models. Current solutions propose the principle of static feature extraction, the dependence on a single classifier, or modest results that do not meet the requirements in today’s environment with its complexity of data. As a result of these, this study presents the Adaptive Ensemble Classifier (AEC), a new ensemble system that consists of hybrid decision trees and Support Vector Machine (SVM) similar to margin optimization for the improvement of classification performance. The proposed AEC incorporates several innovative features: dynamic feature selection through adaptive neighbourhood selection based on feature importance, SVM-based refinement of decision boundaries for improved precision, and a weighted ensemble voting mechanism to ensure robust predictions. In addition, to ensuring explain ability, the system uses LIME and SHAP to provide probability-based explanations for the predictions and the features that influence the results. The performance of the AEC is evaluated using public datasets such as the Fake News dataset and cross-domain performance using COVID-19 Fake News Dataset. Experimental results confirmed that the proposed model achieved an impressive accuracy of 99.74% compared to traditional Machine Learning (ML) and Deep Learning (DL) models in particular in aspects of accuracy, precision, recall, and F1 score. The computational efficiency is also evaluated by comparing training time, memory usage, peak memory usage, inference time, and model size with existing models. The interpretability offered by LIME focuses on the most important features that affect predictions, which makes the system more useful in real-world situations. Finally, the different statistical analysis tests are also employed on proposed AEC and existing ML and DL models such as Paired <em>t</em>-test, Kruskal-Wallis, Dunn’s, Bootstrap Resampling, Cohen’s d Effect Size and Confidence Intervals. The results showed that significant performance difference between proposed AEC and other models.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"281 ","pages":"Article 127751"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AEC: A novel adaptive ensemble classifier with LIME and SHAP-Based interpretability for fake news detection\",\"authors\":\"Ashima Kukkar, Gagandeep Kaur\",\"doi\":\"10.1016/j.eswa.2025.127751\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The constant availability of fake news on social media and other information-sharing platforms has raised a demand for efficient and effective fake news detection models. Current solutions propose the principle of static feature extraction, the dependence on a single classifier, or modest results that do not meet the requirements in today’s environment with its complexity of data. As a result of these, this study presents the Adaptive Ensemble Classifier (AEC), a new ensemble system that consists of hybrid decision trees and Support Vector Machine (SVM) similar to margin optimization for the improvement of classification performance. The proposed AEC incorporates several innovative features: dynamic feature selection through adaptive neighbourhood selection based on feature importance, SVM-based refinement of decision boundaries for improved precision, and a weighted ensemble voting mechanism to ensure robust predictions. In addition, to ensuring explain ability, the system uses LIME and SHAP to provide probability-based explanations for the predictions and the features that influence the results. The performance of the AEC is evaluated using public datasets such as the Fake News dataset and cross-domain performance using COVID-19 Fake News Dataset. Experimental results confirmed that the proposed model achieved an impressive accuracy of 99.74% compared to traditional Machine Learning (ML) and Deep Learning (DL) models in particular in aspects of accuracy, precision, recall, and F1 score. The computational efficiency is also evaluated by comparing training time, memory usage, peak memory usage, inference time, and model size with existing models. The interpretability offered by LIME focuses on the most important features that affect predictions, which makes the system more useful in real-world situations. Finally, the different statistical analysis tests are also employed on proposed AEC and existing ML and DL models such as Paired <em>t</em>-test, Kruskal-Wallis, Dunn’s, Bootstrap Resampling, Cohen’s d Effect Size and Confidence Intervals. The results showed that significant performance difference between proposed AEC and other models.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"281 \",\"pages\":\"Article 127751\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425013739\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425013739","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
AEC: A novel adaptive ensemble classifier with LIME and SHAP-Based interpretability for fake news detection
The constant availability of fake news on social media and other information-sharing platforms has raised a demand for efficient and effective fake news detection models. Current solutions propose the principle of static feature extraction, the dependence on a single classifier, or modest results that do not meet the requirements in today’s environment with its complexity of data. As a result of these, this study presents the Adaptive Ensemble Classifier (AEC), a new ensemble system that consists of hybrid decision trees and Support Vector Machine (SVM) similar to margin optimization for the improvement of classification performance. The proposed AEC incorporates several innovative features: dynamic feature selection through adaptive neighbourhood selection based on feature importance, SVM-based refinement of decision boundaries for improved precision, and a weighted ensemble voting mechanism to ensure robust predictions. In addition, to ensuring explain ability, the system uses LIME and SHAP to provide probability-based explanations for the predictions and the features that influence the results. The performance of the AEC is evaluated using public datasets such as the Fake News dataset and cross-domain performance using COVID-19 Fake News Dataset. Experimental results confirmed that the proposed model achieved an impressive accuracy of 99.74% compared to traditional Machine Learning (ML) and Deep Learning (DL) models in particular in aspects of accuracy, precision, recall, and F1 score. The computational efficiency is also evaluated by comparing training time, memory usage, peak memory usage, inference time, and model size with existing models. The interpretability offered by LIME focuses on the most important features that affect predictions, which makes the system more useful in real-world situations. Finally, the different statistical analysis tests are also employed on proposed AEC and existing ML and DL models such as Paired t-test, Kruskal-Wallis, Dunn’s, Bootstrap Resampling, Cohen’s d Effect Size and Confidence Intervals. The results showed that significant performance difference between proposed AEC and other models.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.