Mykola Makhortykh, Ernesto de León, Clara Christner, Maryna Sydorova, Aleksandra Urman, Silke Adam, Michaela Maier, Teresa Gil-Lopez
{"title":"Is a single model enough? The systematic comparison of computational approaches for detecting populist radical right content.","authors":"Mykola Makhortykh, Ernesto de León, Clara Christner, Maryna Sydorova, Aleksandra Urman, Silke Adam, Michaela Maier, Teresa Gil-Lopez","doi":"10.1007/s11135-024-02034-1","DOIUrl":null,"url":null,"abstract":"<p><p>The rise of populist radical right (PRR) ideas stresses the importance of understanding how individuals engage with PRR content online. However, this task is complicated by the variety of channels through which such engagement can take place. In this article, we systematically compare computational approaches for detecting PRR content in textual data. Using 66 dictionary, classic supervised machine learning, and deep learning (DL) models, we compare how these distinct approaches perform on the PRR detection task for three Germanophone test datasets and how their performance is affected by different modes of text preprocessing. In addition to individual models, we examine the performance of 330 ensemble models combining the above-mentioned approaches for the dataset with a particularly high volume of noise. Our findings demonstrate that the DL models, in combination with more computationally intense forms of preprocessing, show the best performance among the individual models, but it remains suboptimal in the case of more noisy datasets. While the use of ensemble models shows some improvement for specific modes of preprocessing, overall, it mostly remains on par with individual DL models, thus stressing the challenging nature of computational detection of PRR content.</p>","PeriodicalId":49649,"journal":{"name":"Quality & Quantity","volume":"59 Suppl 2","pages":"1163-1207"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12055619/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quality & Quantity","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11135-024-02034-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/29 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0
Abstract
The rise of populist radical right (PRR) ideas stresses the importance of understanding how individuals engage with PRR content online. However, this task is complicated by the variety of channels through which such engagement can take place. In this article, we systematically compare computational approaches for detecting PRR content in textual data. Using 66 dictionary, classic supervised machine learning, and deep learning (DL) models, we compare how these distinct approaches perform on the PRR detection task for three Germanophone test datasets and how their performance is affected by different modes of text preprocessing. In addition to individual models, we examine the performance of 330 ensemble models combining the above-mentioned approaches for the dataset with a particularly high volume of noise. Our findings demonstrate that the DL models, in combination with more computationally intense forms of preprocessing, show the best performance among the individual models, but it remains suboptimal in the case of more noisy datasets. While the use of ensemble models shows some improvement for specific modes of preprocessing, overall, it mostly remains on par with individual DL models, thus stressing the challenging nature of computational detection of PRR content.
期刊介绍:
Quality and Quantity constitutes a point of reference for European and non-European scholars to discuss instruments of methodology for more rigorous scientific results in the social sciences. In the era of biggish data, the journal also provides a publication venue for data scientists who are interested in proposing a new indicator to measure the latent aspects of social, cultural, and political events. Rather than leaning towards one specific methodological school, the journal publishes papers on a mixed method of quantitative and qualitative data. Furthermore, the journal’s key aim is to tackle some methodological pluralism across research cultures. In this context, the journal is open to papers addressing some general logic of empirical research and analysis of the validity and verification of social laws. Thus The journal accepts papers on science metrics and publication ethics and, their related issues affecting methodological practices among researchers.
Quality and Quantity is an interdisciplinary journal which systematically correlates disciplines such as data and information sciences with the other humanities and social sciences. The journal extends discussion of interesting contributions in methodology to scholars worldwide, to promote the scientific development of social research.