Better, Faster, Stronger: Using Machine Learning to Analyse South African Police-recorded Protest Data

IF 0.5 Q4 SOCIOLOGY
M. Bekker
{"title":"Better, Faster, Stronger: Using Machine Learning to Analyse South African Police-recorded Protest Data","authors":"M. Bekker","doi":"10.1080/21528586.2021.1982762","DOIUrl":null,"url":null,"abstract":"ABSTRACT A long-important tool for quantitative analysis of protests, the potential power of Protest Event Analysis (PEA) has only increased with the rise of Machine Learning technologies and the ubiquity of big data. PEA coders also present an advantage over contemporary Natural Language Programming innovations by being customisable to incorporate locally appropriate terms and vernaculars, expressed as personalised ontologies. As such, there is a need to develop a standard process for deploying machine learning tools that can draw on the local. This paper introduces such a tool, innovating the numeration of abstract indicators. “Machine Learning Protest Event Analysis Keyword Enumerated Recoding” is a protocol that enables PEA coders to read and classify large “event databases”, incorporating local terms and abstract indicators into the analysis. Applying this protocol to 150,000 records in a police-recorded database of crowd events in South Africa, protest events could be individually rated by levels of “tumult”—a feat hitherto inhibited by conventional PEA methods. Innovations in estimating crowd sizes, as well as an updated view of post-apartheid protest, showing that protests tend to be more common but less prone to violence than previous theories concluded, speaks to the potential for this protocol to unearth novel insights on even bigger data sets.","PeriodicalId":44730,"journal":{"name":"South African Review of Sociology","volume":"67 1","pages":"4 - 23"},"PeriodicalIF":0.5000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"South African Review of Sociology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/21528586.2021.1982762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"SOCIOLOGY","Score":null,"Total":0}
引用次数: 2

Abstract

ABSTRACT A long-important tool for quantitative analysis of protests, the potential power of Protest Event Analysis (PEA) has only increased with the rise of Machine Learning technologies and the ubiquity of big data. PEA coders also present an advantage over contemporary Natural Language Programming innovations by being customisable to incorporate locally appropriate terms and vernaculars, expressed as personalised ontologies. As such, there is a need to develop a standard process for deploying machine learning tools that can draw on the local. This paper introduces such a tool, innovating the numeration of abstract indicators. “Machine Learning Protest Event Analysis Keyword Enumerated Recoding” is a protocol that enables PEA coders to read and classify large “event databases”, incorporating local terms and abstract indicators into the analysis. Applying this protocol to 150,000 records in a police-recorded database of crowd events in South Africa, protest events could be individually rated by levels of “tumult”—a feat hitherto inhibited by conventional PEA methods. Innovations in estimating crowd sizes, as well as an updated view of post-apartheid protest, showing that protests tend to be more common but less prone to violence than previous theories concluded, speaks to the potential for this protocol to unearth novel insights on even bigger data sets.
更好,更快,更强:使用机器学习分析南非警方记录的抗议数据
作为长期以来重要的抗议定量分析工具,随着机器学习技术的兴起和大数据的无处不在,抗议事件分析(PEA)的潜在力量只会增加。与当代自然语言编程创新相比,PEA编码器还具有一个优势,即可定制,以个性化本体的形式表达本地适当的术语和方言。因此,有必要开发一个标准流程来部署可以利用本地资源的机器学习工具。本文介绍了这样一个工具,创新了抽象指标的计算方法。“机器学习抗议事件分析关键字枚举重编码”是一种协议,它使PEA编码器能够读取和分类大型“事件数据库”,并将局部术语和抽象指标纳入分析。将这一协议应用到南非警方记录的人群事件数据库中的15万份记录中,抗议事件可以按照“骚乱”的程度进行单独评级——这是迄今为止传统PEA方法所无法做到的。估计人群规模的创新,以及对后种族隔离抗议的更新看法,表明抗议活动往往比以前的理论结论更常见,但更不容易发生暴力,说明该协议有可能在更大的数据集上挖掘出新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.90
自引率
25.00%
发文量
26
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信