{"title":"我们一起努力有效处理宣传相关任务的路线图","authors":"Raquel Rodríguez-García, Roberto Centeno, Álvaro Rodrigo","doi":"10.1108/intr-05-2024-0785","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>In this paper, we address the need to study automatic propaganda detection to establish a course of action when faced with such a complex task. Although many isolated tasks have been proposed, a roadmap on how to best approach a new task from the perspective of text formality or the leverage of existing resources has not been explored yet.</p><!--/ Abstract__block -->\n<h3>Design/methodology/approach</h3>\n<p>We present a comprehensive study using several datasets on textual propaganda and different techniques to tackle it. We explore diverse collections with varied characteristics and analyze methodologies, from classic machine learning algorithms, to multi-task learning to utilize the available data in such models.</p><!--/ Abstract__block -->\n<h3>Findings</h3>\n<p>Our results show that transformer-based approaches are the best option with high-quality collections, and emotionally enriched inputs improve the results for Twitter content. Additionally, MTL achieves the best results in two of the five scenarios we analyzed. Notably, in one of the scenarios, the model achieves an F1 score of 0.78, significantly surpassing the transformer baseline model’s F1 score of 0.68.</p><!--/ Abstract__block -->\n<h3>Research limitations/implications</h3>\n<p>After finding a positive impact when leveraging propaganda’s emotional content, we propose further research into exploiting other complex dimensions, such as moral issues or logical reasoning.</p><!--/ Abstract__block -->\n<h3>Originality/value</h3>\n<p>Based on our findings, we provide a roadmap for tackling propaganda-related tasks, depending on the types of training data available and the task to solve. This includes the application of MTL, which has yet to be fully exploited in propaganda detection.</p><!--/ Abstract__block -->","PeriodicalId":54925,"journal":{"name":"Internet Research","volume":null,"pages":null},"PeriodicalIF":5.9000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Together we can do it! A roadmap to effectively tackle propaganda-related tasks\",\"authors\":\"Raquel Rodríguez-García, Roberto Centeno, Álvaro Rodrigo\",\"doi\":\"10.1108/intr-05-2024-0785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3>Purpose</h3>\\n<p>In this paper, we address the need to study automatic propaganda detection to establish a course of action when faced with such a complex task. Although many isolated tasks have been proposed, a roadmap on how to best approach a new task from the perspective of text formality or the leverage of existing resources has not been explored yet.</p><!--/ Abstract__block -->\\n<h3>Design/methodology/approach</h3>\\n<p>We present a comprehensive study using several datasets on textual propaganda and different techniques to tackle it. We explore diverse collections with varied characteristics and analyze methodologies, from classic machine learning algorithms, to multi-task learning to utilize the available data in such models.</p><!--/ Abstract__block -->\\n<h3>Findings</h3>\\n<p>Our results show that transformer-based approaches are the best option with high-quality collections, and emotionally enriched inputs improve the results for Twitter content. Additionally, MTL achieves the best results in two of the five scenarios we analyzed. Notably, in one of the scenarios, the model achieves an F1 score of 0.78, significantly surpassing the transformer baseline model’s F1 score of 0.68.</p><!--/ Abstract__block -->\\n<h3>Research limitations/implications</h3>\\n<p>After finding a positive impact when leveraging propaganda’s emotional content, we propose further research into exploiting other complex dimensions, such as moral issues or logical reasoning.</p><!--/ Abstract__block -->\\n<h3>Originality/value</h3>\\n<p>Based on our findings, we provide a roadmap for tackling propaganda-related tasks, depending on the types of training data available and the task to solve. This includes the application of MTL, which has yet to be fully exploited in propaganda detection.</p><!--/ Abstract__block -->\",\"PeriodicalId\":54925,\"journal\":{\"name\":\"Internet Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.9000,\"publicationDate\":\"2024-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet Research\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1108/intr-05-2024-0785\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BUSINESS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet Research","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1108/intr-05-2024-0785","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 0
摘要
目的在本文中,我们探讨了研究自动宣传检测的必要性,以便在面对如此复杂的任务时确定行动方案。虽然已经提出了许多孤立的任务,但如何从文本形式或利用现有资源的角度以最佳方式完成一项新任务的路线图尚未得到探索。设计/方法/途径我们使用多个文本宣传数据集和不同的技术进行了一项综合研究。我们探索了具有不同特征的各种数据集,并分析了从经典机器学习算法到多任务学习的各种方法,以便在此类模型中利用可用数据。研究结果我们的研究结果表明,基于转换器的方法是高质量数据集的最佳选择,而情感丰富的输入可改善 Twitter 内容的结果。此外,在我们分析的五个场景中,MTL 在两个场景中取得了最佳结果。值得注意的是,在其中一个场景中,该模型的 F1 得分为 0.78,大大超过了转换器基线模型 0.68 的 F1 得分。研究局限/启示在发现利用宣传的情感内容会产生积极影响后,我们建议进一步研究利用其他复杂维度,如道德问题或逻辑推理。其中包括 MTL 的应用,该方法在宣传检测中尚未得到充分利用。
Together we can do it! A roadmap to effectively tackle propaganda-related tasks
Purpose
In this paper, we address the need to study automatic propaganda detection to establish a course of action when faced with such a complex task. Although many isolated tasks have been proposed, a roadmap on how to best approach a new task from the perspective of text formality or the leverage of existing resources has not been explored yet.
Design/methodology/approach
We present a comprehensive study using several datasets on textual propaganda and different techniques to tackle it. We explore diverse collections with varied characteristics and analyze methodologies, from classic machine learning algorithms, to multi-task learning to utilize the available data in such models.
Findings
Our results show that transformer-based approaches are the best option with high-quality collections, and emotionally enriched inputs improve the results for Twitter content. Additionally, MTL achieves the best results in two of the five scenarios we analyzed. Notably, in one of the scenarios, the model achieves an F1 score of 0.78, significantly surpassing the transformer baseline model’s F1 score of 0.68.
Research limitations/implications
After finding a positive impact when leveraging propaganda’s emotional content, we propose further research into exploiting other complex dimensions, such as moral issues or logical reasoning.
Originality/value
Based on our findings, we provide a roadmap for tackling propaganda-related tasks, depending on the types of training data available and the task to solve. This includes the application of MTL, which has yet to be fully exploited in propaganda detection.
期刊介绍:
This wide-ranging interdisciplinary journal looks at the social, ethical, economic and political implications of the internet. Recent issues have focused on online and mobile gaming, the sharing economy, and the dark side of social media.