机器学习算法检测假新闻的比较分析:现代信息生态系统中的有效性和准确性

Gregorius Airlangga
{"title":"机器学习算法检测假新闻的比较分析:现代信息生态系统中的有效性和准确性","authors":"Gregorius Airlangga","doi":"10.47709/cnahpc.v6i1.3466","DOIUrl":null,"url":null,"abstract":"In an era where the spread of fake news poses a significant threat to the integrity of the information landscape, the need for effective detection tools is paramount. This study evaluates the efficacy of three machine learning algorithms—Multinomial Naive Bayes, Passive Aggressive Classifier, and Logistic Regression—in distinguishing fake news from genuine articles. Leveraging a balanced dataset, meticulously processed and vectorized through Term Frequency-Inverse Document Frequency (TF-IDF), we subjected each algorithm to a rigorous classification process. The algorithms were evaluated on metrics such as precision, recall, and F1-score, with the Passive Aggressive Classifier outperforming others, achieving a remarkable 0.99 in both precision and recall. Logistic Regression followed with an accuracy of 0.98, while Multinomial Naive Bayes displayed robust recall at 1.00 but lower precision at 0.91, resulting in an accuracy of 0.95. These metrics underscored the nuanced capabilities of each algorithm in correctly identifying fake and real news, with the Passive Aggressive Classifier demonstrating superior balance in performance. The study's findings highlight the potential of employing machine learning techniques in the fight against fake news, with the Passive Aggressive Classifier showing promise due to its high accuracy and balanced precision-recall trade-off. These insights contribute to the ongoing efforts in digital media to develop advanced, ethical, and accurate tools for maintaining information veracity. Future research should continue to refine these models, ensuring their applicability in diverse and evolving news ecosystems.","PeriodicalId":15605,"journal":{"name":"Journal Of Computer Networks, Architecture and High Performance Computing","volume":"52 20","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative Analysis of Machine Learning Algorithms for Detecting Fake News: Efficacy and Accuracy in the Modern Information Ecosystem\",\"authors\":\"Gregorius Airlangga\",\"doi\":\"10.47709/cnahpc.v6i1.3466\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In an era where the spread of fake news poses a significant threat to the integrity of the information landscape, the need for effective detection tools is paramount. This study evaluates the efficacy of three machine learning algorithms—Multinomial Naive Bayes, Passive Aggressive Classifier, and Logistic Regression—in distinguishing fake news from genuine articles. Leveraging a balanced dataset, meticulously processed and vectorized through Term Frequency-Inverse Document Frequency (TF-IDF), we subjected each algorithm to a rigorous classification process. The algorithms were evaluated on metrics such as precision, recall, and F1-score, with the Passive Aggressive Classifier outperforming others, achieving a remarkable 0.99 in both precision and recall. Logistic Regression followed with an accuracy of 0.98, while Multinomial Naive Bayes displayed robust recall at 1.00 but lower precision at 0.91, resulting in an accuracy of 0.95. These metrics underscored the nuanced capabilities of each algorithm in correctly identifying fake and real news, with the Passive Aggressive Classifier demonstrating superior balance in performance. The study's findings highlight the potential of employing machine learning techniques in the fight against fake news, with the Passive Aggressive Classifier showing promise due to its high accuracy and balanced precision-recall trade-off. These insights contribute to the ongoing efforts in digital media to develop advanced, ethical, and accurate tools for maintaining information veracity. Future research should continue to refine these models, ensuring their applicability in diverse and evolving news ecosystems.\",\"PeriodicalId\":15605,\"journal\":{\"name\":\"Journal Of Computer Networks, Architecture and High Performance Computing\",\"volume\":\"52 20\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal Of Computer Networks, Architecture and High Performance Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47709/cnahpc.v6i1.3466\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal Of Computer Networks, Architecture and High Performance Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47709/cnahpc.v6i1.3466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在假新闻的传播对信息环境的完整性构成重大威胁的时代,最需要的是有效的检测工具。本研究评估了三种机器学习算法--多项式 Naive Bayes、被动攻击分类器和逻辑回归--在区分假新闻和真文章方面的功效。我们利用通过词频-反向文档频率(TF-IDF)精心处理和矢量化的平衡数据集,对每种算法进行了严格的分类处理。我们根据精确度、召回率和 F1 分数等指标对这些算法进行了评估,其中被动进取分类器的表现优于其他算法,在精确度和召回率方面都达到了惊人的 0.99。逻辑回归以 0.98 的准确率紧随其后,而多项式 Naive Bayes 的召回率为 1.00,表现强劲,但准确率较低,为 0.91,准确率为 0.95。这些指标强调了每种算法在正确识别假新闻和真新闻方面的细微差别,其中被动攻击分类器在性能平衡方面表现出色。研究结果凸显了机器学习技术在打击假新闻方面的潜力,其中被动攻击型分类器因其高精度和平衡的精度-召回权衡而显示出前景。这些见解有助于数字媒体不断努力开发先进、道德和准确的工具,以维护信息的真实性。未来的研究应继续完善这些模型,确保它们适用于多样化和不断发展的新闻生态系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comparative Analysis of Machine Learning Algorithms for Detecting Fake News: Efficacy and Accuracy in the Modern Information Ecosystem
In an era where the spread of fake news poses a significant threat to the integrity of the information landscape, the need for effective detection tools is paramount. This study evaluates the efficacy of three machine learning algorithms—Multinomial Naive Bayes, Passive Aggressive Classifier, and Logistic Regression—in distinguishing fake news from genuine articles. Leveraging a balanced dataset, meticulously processed and vectorized through Term Frequency-Inverse Document Frequency (TF-IDF), we subjected each algorithm to a rigorous classification process. The algorithms were evaluated on metrics such as precision, recall, and F1-score, with the Passive Aggressive Classifier outperforming others, achieving a remarkable 0.99 in both precision and recall. Logistic Regression followed with an accuracy of 0.98, while Multinomial Naive Bayes displayed robust recall at 1.00 but lower precision at 0.91, resulting in an accuracy of 0.95. These metrics underscored the nuanced capabilities of each algorithm in correctly identifying fake and real news, with the Passive Aggressive Classifier demonstrating superior balance in performance. The study's findings highlight the potential of employing machine learning techniques in the fight against fake news, with the Passive Aggressive Classifier showing promise due to its high accuracy and balanced precision-recall trade-off. These insights contribute to the ongoing efforts in digital media to develop advanced, ethical, and accurate tools for maintaining information veracity. Future research should continue to refine these models, ensuring their applicability in diverse and evolving news ecosystems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信