Data-driven approaches into political orientation and news outlet discrimination: The case of news articles in South Korea

IF 7.6 2区 管理学 Q1 INFORMATION SCIENCE & LIBRARY SCIENCE
Jungkyun Lee , Junyeop Cha , Eunil Park
{"title":"Data-driven approaches into political orientation and news outlet discrimination: The case of news articles in South Korea","authors":"Jungkyun Lee ,&nbsp;Junyeop Cha ,&nbsp;Eunil Park","doi":"10.1016/j.tele.2023.102066","DOIUrl":null,"url":null,"abstract":"<div><p>With the advancement of the internet, the public now has easy access to news from various media outlets. However, a number of news outlets tend to report their content based on their political orientations or affiliations, which may compromise the objectivity of the news. This research used machine learning to analyze whether it is possible to tell the political orientation and news outlet apart based on news articles in South Korea. We collected a lot of news articles spanning over five years and used the text data. We chose major conservative and progressive news outlets and tried classifying them into two groups. We even looked into classifying articles by each news outlet. We used different machine learning methods like Logistic Regression, Random Forest Classifier, and eXtreme Gradient Boosting, and tried to improve the performance by combining these models. The research found that the combined model had high accuracy, up to 91.9% for binary classification of news outlet political orientations and up to 84.0% for classifying news outlets in multiple categories. This shows that you can determine the political leaning of news outlets based on their articles, highlighting the importance of considering bias in news outlets when evaluating information instead of solely relying on the article content.</p></div>","PeriodicalId":48257,"journal":{"name":"Telematics and Informatics","volume":"85 ","pages":"Article 102066"},"PeriodicalIF":7.6000,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Telematics and Informatics","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736585323001302","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

With the advancement of the internet, the public now has easy access to news from various media outlets. However, a number of news outlets tend to report their content based on their political orientations or affiliations, which may compromise the objectivity of the news. This research used machine learning to analyze whether it is possible to tell the political orientation and news outlet apart based on news articles in South Korea. We collected a lot of news articles spanning over five years and used the text data. We chose major conservative and progressive news outlets and tried classifying them into two groups. We even looked into classifying articles by each news outlet. We used different machine learning methods like Logistic Regression, Random Forest Classifier, and eXtreme Gradient Boosting, and tried to improve the performance by combining these models. The research found that the combined model had high accuracy, up to 91.9% for binary classification of news outlet political orientations and up to 84.0% for classifying news outlets in multiple categories. This shows that you can determine the political leaning of news outlets based on their articles, highlighting the importance of considering bias in news outlets when evaluating information instead of solely relying on the article content.

政治取向和新闻媒体歧视的数据驱动方法:以韩国新闻文章为例
随着互联网的发展,公众现在可以很容易地从各种媒体渠道获取新闻。然而,一些新闻媒体倾向于根据他们的政治取向或从属关系报道他们的内容,这可能会损害新闻的客观性。该研究利用机器学习,分析了以韩国的新闻报道为基础,区分政治倾向和新闻媒体的可能性。我们收集了五年来的大量新闻文章,并使用了文本数据。我们选择了主要的保守派和进步派新闻媒体,并尝试将它们分为两组。我们甚至研究了按每个新闻媒体对文章进行分类。我们使用了不同的机器学习方法,如逻辑回归、随机森林分类器和极端梯度增强,并试图通过组合这些模型来提高性能。研究发现,该组合模型具有较高的准确率,对新闻媒体政治倾向的二元分类准确率高达91.9%,对新闻媒体多类分类准确率高达84.0%。这表明,你可以根据新闻媒体的文章来确定其政治倾向,突出了在评估信息时考虑新闻媒体偏见的重要性,而不是仅仅依靠文章内容。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Telematics and Informatics
Telematics and Informatics INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
17.00
自引率
4.70%
发文量
104
审稿时长
24 days
期刊介绍: Telematics and Informatics is an interdisciplinary journal that publishes cutting-edge theoretical and methodological research exploring the social, economic, geographic, political, and cultural impacts of digital technologies. It covers various application areas, such as smart cities, sensors, information fusion, digital society, IoT, cyber-physical technologies, privacy, knowledge management, distributed work, emergency response, mobile communications, health informatics, social media's psychosocial effects, ICT for sustainable development, blockchain, e-commerce, and e-government.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信