标签文本的“超级无监督”分类：以网络政治敌意为例

IF 4.6 1区社会学 Q1 POLITICAL SCIENCE

British Journal of Political Science Pub Date : 2023-04-24 DOI:10.1017/s0007123423000042

Stig Hebbelstrup Rye Rasmussen, A. Bor, Mathias Osmundsen, M. B. Petersen

{"title":"标签文本的“超级无监督”分类：以网络政治敌意为例","authors":"Stig Hebbelstrup Rye Rasmussen, A. Bor, Mathias Osmundsen, M. B. Petersen","doi":"10.1017/s0007123423000042","DOIUrl":null,"url":null,"abstract":"\n We live in a world of text. Yet the sheer magnitude of social media data, coupled with a need to measure complex psychological constructs, has made this important source of data difficult to use. Researchers often engage in costly hand coding of thousands of texts using supervised techniques or rely on unsupervised techniques where the measurement of predefined constructs is difficult. We propose a novel approach that we call ‘super-unsupervised’ learning and demonstrate its usefulness by measuring the psychologically complex construct of online political hostility based on a large corpus of tweets. This approach accomplishes the feat by combining the best features of supervised and unsupervised learning techniques: measurements of complex psychological constructs without a single labelled data source. We first outline the approach before conducting a diverse series of tests that include: (i) face validity, (ii) convergent and discriminant validity, (iii) criterion validity, (iv) external validity, and (v) ecological validity.","PeriodicalId":48301,"journal":{"name":"British Journal of Political Science","volume":" ","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2023-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"‘Super-Unsupervised’ Classification for Labelling Text: Online Political Hostility as an Illustration\",\"authors\":\"Stig Hebbelstrup Rye Rasmussen, A. Bor, Mathias Osmundsen, M. B. Petersen\",\"doi\":\"10.1017/s0007123423000042\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n We live in a world of text. Yet the sheer magnitude of social media data, coupled with a need to measure complex psychological constructs, has made this important source of data difficult to use. Researchers often engage in costly hand coding of thousands of texts using supervised techniques or rely on unsupervised techniques where the measurement of predefined constructs is difficult. We propose a novel approach that we call ‘super-unsupervised’ learning and demonstrate its usefulness by measuring the psychologically complex construct of online political hostility based on a large corpus of tweets. This approach accomplishes the feat by combining the best features of supervised and unsupervised learning techniques: measurements of complex psychological constructs without a single labelled data source. We first outline the approach before conducting a diverse series of tests that include: (i) face validity, (ii) convergent and discriminant validity, (iii) criterion validity, (iv) external validity, and (v) ecological validity.\",\"PeriodicalId\":48301,\"journal\":{\"name\":\"British Journal of Political Science\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2023-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"British Journal of Political Science\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1017/s0007123423000042\",\"RegionNum\":1,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"POLITICAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Political Science","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1017/s0007123423000042","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLITICAL SCIENCE","Score":null,"Total":0}

引用次数: 1

摘要

我们生活在一个文本的世界里。然而，社交媒体数据的巨大规模，加上需要衡量复杂的心理结构，使得这一重要的数据来源难以使用。研究人员经常使用监督技术对数千篇文本进行昂贵的手工编码，或者在难以测量预定义结构的情况下依赖于无监督技术。我们提出了一种新的方法，我们称之为“超级无监督”学习，并通过测量基于大量推文的网络政治敌意的心理复杂结构来证明其有用性。这种方法通过结合有监督和无监督学习技术的最佳特征来实现这一壮举：在没有单个标记数据源的情况下测量复杂的心理结构。在进行一系列不同的测试之前，我们首先概述了该方法，这些测试包括：（i）面部有效性，（ii）收敛和判别有效性，以及（iii）标准有效性、（iv）外部有效性和（v）生态有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

‘Super-Unsupervised’ Classification for Labelling Text: Online Political Hostility as an Illustration

We live in a world of text. Yet the sheer magnitude of social media data, coupled with a need to measure complex psychological constructs, has made this important source of data difficult to use. Researchers often engage in costly hand coding of thousands of texts using supervised techniques or rely on unsupervised techniques where the measurement of predefined constructs is difficult. We propose a novel approach that we call ‘super-unsupervised’ learning and demonstrate its usefulness by measuring the psychologically complex construct of online political hostility based on a large corpus of tweets. This approach accomplishes the feat by combining the best features of supervised and unsupervised learning techniques: measurements of complex psychological constructs without a single labelled data source. We first outline the approach before conducting a diverse series of tests that include: (i) face validity, (ii) convergent and discriminant validity, (iii) criterion validity, (iv) external validity, and (v) ecological validity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

British Journal of Political Science POLITICAL SCIENCE-

CiteScore

8.70

自引率

4.00%

发文量

期刊介绍： The British Journal of Political Science is a broadly based journal aiming to cover developments across a wide range of countries and specialisms. Contributions are drawn from all fields of political science (including political theory, political behaviour, public policy and international relations), and articles from scholars in related disciplines (sociology, social psychology, economics and philosophy) appear frequently. With a reputation established over nearly 40 years of publication, the British Journal of Political Science is widely recognised as one of the premier journals in its field.