Predicting the tolerance level of religious discourse through computational linguistics

2016 IEEE Systems and Information Engineering Design Symposium (SIEDS) Pub Date : 2016-04-29 DOI:10.1109/SIEDS.2016.7489320

Nicholas Venuti, Brian Sachtjen, Hope McIntyre, Chetan Mishra, M. Hays, Donald E. Brown

{"title":"Predicting the tolerance level of religious discourse through computational linguistics","authors":"Nicholas Venuti, Brian Sachtjen, Hope McIntyre, Chetan Mishra, M. Hays, Donald E. Brown","doi":"10.1109/SIEDS.2016.7489320","DOIUrl":null,"url":null,"abstract":"Religious violence is one of the biggest and most complicated problems facing the world today. The number of incidents has been increasing in recent years and, unfortunately, scalable and accurate systems to predict which groups are likely to engage in such actions are not keeping pace. Additionally, this problem is compounded by lingual and cultural differences, which limit the effectiveness of understanding how tolerant or intolerant a group is without bias. To circumvent this challenge, recent studies indicate promise in the analysis of the performative character of discourse (how words are used) to estimate the tolerance level, rather than using the semantic or emotive character of text (what the words mean or imply). Using expert estimates of linguistic flexibility, a representation of the performative character of text, and thus also predictive of a text's tolerance level, this paper describes (a) new approaches to automating the quantification of the performative character of words and (b) the predictive efficacy of these approaches versus traditional semantic indicators of tolerance or intolerance. To implement the pipeline, a judgment identifier was developed along with multiple semantic density algorithms to extract the frequency of judgments and flexibility of keyword contexts, respectively. Test results show that text mining algorithms can accurately estimate the language flexibility of religious discourse. These results provide evidence that the performative characteristics of language better predict tolerance level than the semantic characteristics of language.","PeriodicalId":426864,"journal":{"name":"2016 IEEE Systems and Information Engineering Design Symposium (SIEDS)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIEDS.2016.7489320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Religious violence is one of the biggest and most complicated problems facing the world today. The number of incidents has been increasing in recent years and, unfortunately, scalable and accurate systems to predict which groups are likely to engage in such actions are not keeping pace. Additionally, this problem is compounded by lingual and cultural differences, which limit the effectiveness of understanding how tolerant or intolerant a group is without bias. To circumvent this challenge, recent studies indicate promise in the analysis of the performative character of discourse (how words are used) to estimate the tolerance level, rather than using the semantic or emotive character of text (what the words mean or imply). Using expert estimates of linguistic flexibility, a representation of the performative character of text, and thus also predictive of a text's tolerance level, this paper describes (a) new approaches to automating the quantification of the performative character of words and (b) the predictive efficacy of these approaches versus traditional semantic indicators of tolerance or intolerance. To implement the pipeline, a judgment identifier was developed along with multiple semantic density algorithms to extract the frequency of judgments and flexibility of keyword contexts, respectively. Test results show that text mining algorithms can accurately estimate the language flexibility of religious discourse. These results provide evidence that the performative characteristics of language better predict tolerance level than the semantic characteristics of language.

查看原文本刊更多论文

用计算语言学预测宗教话语的容忍度

宗教暴力是当今世界面临的最大和最复杂的问题之一。近年来，此类事件的数量一直在增加，不幸的是，用于预测哪些组织可能参与此类行动的可扩展和准确的系统并没有跟上。此外，语言和文化差异使这个问题更加复杂，这限制了理解一个群体的宽容或不宽容程度的有效性。为了规避这一挑战，最近的研究表明，在分析话语的行为特征(词语是如何使用的)来估计容忍水平方面有希望，而不是使用文本的语义或情感特征(词语的意思或暗示)。本文利用专家对语言灵活性的估计，即文本的表现特征，从而也预测文本的容忍水平，描述了(a)自动化量化单词的表现特征的新方法，以及(b)这些方法与传统的容忍或不容忍语义指标的预测效果。为了实现该管道，开发了判断标识符以及多种语义密度算法，分别提取判断的频率和关键字上下文的灵活性。测试结果表明，文本挖掘算法可以准确地估计宗教话语的语言灵活性。这些结果证明语言的行为特征比语言的语义特征更能预测容忍水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 IEEE Systems and Information Engineering Design Symposium (SIEDS)

自引率

0.00%

发文量