‘Super-Unsupervised’ Classification for Labelling Text: Online Political Hostility as an Illustration

IF 4.6 1区 社会学 Q1 POLITICAL SCIENCE
Stig Hebbelstrup Rye Rasmussen, A. Bor, Mathias Osmundsen, M. B. Petersen
{"title":"‘Super-Unsupervised’ Classification for Labelling Text: Online Political Hostility as an Illustration","authors":"Stig Hebbelstrup Rye Rasmussen, A. Bor, Mathias Osmundsen, M. B. Petersen","doi":"10.1017/s0007123423000042","DOIUrl":null,"url":null,"abstract":"\n We live in a world of text. Yet the sheer magnitude of social media data, coupled with a need to measure complex psychological constructs, has made this important source of data difficult to use. Researchers often engage in costly hand coding of thousands of texts using supervised techniques or rely on unsupervised techniques where the measurement of predefined constructs is difficult. We propose a novel approach that we call ‘super-unsupervised’ learning and demonstrate its usefulness by measuring the psychologically complex construct of online political hostility based on a large corpus of tweets. This approach accomplishes the feat by combining the best features of supervised and unsupervised learning techniques: measurements of complex psychological constructs without a single labelled data source. We first outline the approach before conducting a diverse series of tests that include: (i) face validity, (ii) convergent and discriminant validity, (iii) criterion validity, (iv) external validity, and (v) ecological validity.","PeriodicalId":48301,"journal":{"name":"British Journal of Political Science","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2023-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Political Science","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1017/s0007123423000042","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLITICAL SCIENCE","Score":null,"Total":0}
引用次数: 1

Abstract

We live in a world of text. Yet the sheer magnitude of social media data, coupled with a need to measure complex psychological constructs, has made this important source of data difficult to use. Researchers often engage in costly hand coding of thousands of texts using supervised techniques or rely on unsupervised techniques where the measurement of predefined constructs is difficult. We propose a novel approach that we call ‘super-unsupervised’ learning and demonstrate its usefulness by measuring the psychologically complex construct of online political hostility based on a large corpus of tweets. This approach accomplishes the feat by combining the best features of supervised and unsupervised learning techniques: measurements of complex psychological constructs without a single labelled data source. We first outline the approach before conducting a diverse series of tests that include: (i) face validity, (ii) convergent and discriminant validity, (iii) criterion validity, (iv) external validity, and (v) ecological validity.
标签文本的“超级无监督”分类:以网络政治敌意为例
我们生活在一个文本的世界里。然而,社交媒体数据的巨大规模,加上需要衡量复杂的心理结构,使得这一重要的数据来源难以使用。研究人员经常使用监督技术对数千篇文本进行昂贵的手工编码,或者在难以测量预定义结构的情况下依赖于无监督技术。我们提出了一种新的方法,我们称之为“超级无监督”学习,并通过测量基于大量推文的网络政治敌意的心理复杂结构来证明其有用性。这种方法通过结合有监督和无监督学习技术的最佳特征来实现这一壮举:在没有单个标记数据源的情况下测量复杂的心理结构。在进行一系列不同的测试之前,我们首先概述了该方法,这些测试包括:(i)面部有效性,(ii)收敛和判别有效性,以及(iii)标准有效性、(iv)外部有效性和(v)生态有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.70
自引率
4.00%
发文量
64
期刊介绍: The British Journal of Political Science is a broadly based journal aiming to cover developments across a wide range of countries and specialisms. Contributions are drawn from all fields of political science (including political theory, political behaviour, public policy and international relations), and articles from scholars in related disciplines (sociology, social psychology, economics and philosophy) appear frequently. With a reputation established over nearly 40 years of publication, the British Journal of Political Science is widely recognised as one of the premier journals in its field.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信