{"title":"Data-driven approaches into political orientation and news outlet discrimination: The case of news articles in South Korea","authors":"Jungkyun Lee , Junyeop Cha , Eunil Park","doi":"10.1016/j.tele.2023.102066","DOIUrl":null,"url":null,"abstract":"<div><p>With the advancement of the internet, the public now has easy access to news from various media outlets. However, a number of news outlets tend to report their content based on their political orientations or affiliations, which may compromise the objectivity of the news. This research used machine learning to analyze whether it is possible to tell the political orientation and news outlet apart based on news articles in South Korea. We collected a lot of news articles spanning over five years and used the text data. We chose major conservative and progressive news outlets and tried classifying them into two groups. We even looked into classifying articles by each news outlet. We used different machine learning methods like Logistic Regression, Random Forest Classifier, and eXtreme Gradient Boosting, and tried to improve the performance by combining these models. The research found that the combined model had high accuracy, up to 91.9% for binary classification of news outlet political orientations and up to 84.0% for classifying news outlets in multiple categories. This shows that you can determine the political leaning of news outlets based on their articles, highlighting the importance of considering bias in news outlets when evaluating information instead of solely relying on the article content.</p></div>","PeriodicalId":48257,"journal":{"name":"Telematics and Informatics","volume":"85 ","pages":"Article 102066"},"PeriodicalIF":7.6000,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Telematics and Informatics","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736585323001302","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
With the advancement of the internet, the public now has easy access to news from various media outlets. However, a number of news outlets tend to report their content based on their political orientations or affiliations, which may compromise the objectivity of the news. This research used machine learning to analyze whether it is possible to tell the political orientation and news outlet apart based on news articles in South Korea. We collected a lot of news articles spanning over five years and used the text data. We chose major conservative and progressive news outlets and tried classifying them into two groups. We even looked into classifying articles by each news outlet. We used different machine learning methods like Logistic Regression, Random Forest Classifier, and eXtreme Gradient Boosting, and tried to improve the performance by combining these models. The research found that the combined model had high accuracy, up to 91.9% for binary classification of news outlet political orientations and up to 84.0% for classifying news outlets in multiple categories. This shows that you can determine the political leaning of news outlets based on their articles, highlighting the importance of considering bias in news outlets when evaluating information instead of solely relying on the article content.
期刊介绍:
Telematics and Informatics is an interdisciplinary journal that publishes cutting-edge theoretical and methodological research exploring the social, economic, geographic, political, and cultural impacts of digital technologies. It covers various application areas, such as smart cities, sensors, information fusion, digital society, IoT, cyber-physical technologies, privacy, knowledge management, distributed work, emergency response, mobile communications, health informatics, social media's psychosocial effects, ICT for sustainable development, blockchain, e-commerce, and e-government.