{"title":"CredBERT: Credibility-aware BERT model for fake news detection","authors":"Anju R., Nargis Pervin","doi":"10.1016/j.datak.2025.102461","DOIUrl":null,"url":null,"abstract":"<div><div>The spread of fake news on social media poses significant challenges, especially in distinguishing credible sources from unreliable ones. Existing methods primarily rely on text analysis, often neglecting user credibility, a key factor in enhancing detection accuracy. To address this, we propose CredBERT, a framework that combines credibility scores derived from user interactions and domain expertise with BERT-based text embeddings. CredBERT employs a multi-classifier ensemble, integrating Multi-Layer Perceptron (MLP), Convolutional Neural Networks (CNN), BiLSTM, Logistic Regression, and k-Nearest Neighbors, with predictions aggregated using majority voting, ensuring robust performance across both balanced and imbalanced class datasets. This approach effectively merges user credibility with content-based features, improving prediction accuracy and reducing biases. Compared to state-of-the art baselines FakeBERT and BiLSTM, CredBERT achieves 6.45% and 4.21% higher accuracy, respectively. By evaluating user credibility and content features, our model not only enhances fake news detection but also contributes to mitigating misinformation by identifying unreliable sources.</div></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"160 ","pages":"Article 102461"},"PeriodicalIF":2.7000,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data & Knowledge Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169023X25000564","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The spread of fake news on social media poses significant challenges, especially in distinguishing credible sources from unreliable ones. Existing methods primarily rely on text analysis, often neglecting user credibility, a key factor in enhancing detection accuracy. To address this, we propose CredBERT, a framework that combines credibility scores derived from user interactions and domain expertise with BERT-based text embeddings. CredBERT employs a multi-classifier ensemble, integrating Multi-Layer Perceptron (MLP), Convolutional Neural Networks (CNN), BiLSTM, Logistic Regression, and k-Nearest Neighbors, with predictions aggregated using majority voting, ensuring robust performance across both balanced and imbalanced class datasets. This approach effectively merges user credibility with content-based features, improving prediction accuracy and reducing biases. Compared to state-of-the art baselines FakeBERT and BiLSTM, CredBERT achieves 6.45% and 4.21% higher accuracy, respectively. By evaluating user credibility and content features, our model not only enhances fake news detection but also contributes to mitigating misinformation by identifying unreliable sources.
期刊介绍:
Data & Knowledge Engineering (DKE) stimulates the exchange of ideas and interaction between these two related fields of interest. DKE reaches a world-wide audience of researchers, designers, managers and users. The major aim of the journal is to identify, investigate and analyze the underlying principles in the design and effective use of these systems.