In search of hate speech in Lithuanian public discourse: A corpus-assisted analysis of online comments

Q2 Arts and Humanities

Lodz Papers in Pragmatics Pub Date : 2018-06-26 DOI:10.1515/lpp-2018-0005

Jurate Ruzaite

{"title":"In search of hate speech in Lithuanian public discourse: A corpus-assisted analysis of online comments","authors":"Jurate Ruzaite","doi":"10.1515/lpp-2018-0005","DOIUrl":null,"url":null,"abstract":"Abstract The present paper aims to report on the preliminary findings from the initial stages of ongoing research on hate speech in Lithuanian online comments. Comments are marked strongly by such phenomena as flaming and trolling; therefore, in this genre we can expect a high degree of hostility, obscenity, high incidence of insults and aggressive lexis, which can inflict harm to individuals or organizations. The goal of the current research is thus to make an attempt to identify some features of verbal aggression in Lithuanian by applying the principles and instruments of corpus linguistics, which proved to be a useful approach when dealing with such issues as trolling. It is expected that further analysis of those features will help to identify and define formal linguistic criteria that could facilitate identification of hate speech in public discourse. The data has been obtained from the Lithuanian corpus of user-generated comments collected from one major Lithuanian portal, www.delfi.lt. The corpus consists of all the comments posted in the year 2014 and in total includes 17,909 comments, which make up 1,160,109 words. For the initial data analysis, linguistic aspects, such as wordlists, collocations, and formulaic language, were analysed by using the AntConc software. The interpretations of the results are still very tentative, but what the initial findings show is that overt aggression does not feature among the most frequent and most salient features of comments. Aggression is, in our data, indirectly expressed through creative language use, which can mainly be studied through qualitative analysis.","PeriodicalId":39423,"journal":{"name":"Lodz Papers in Pragmatics","volume":"14 1","pages":"116 - 93"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lpp-2018-0005","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lodz Papers in Pragmatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/lpp-2018-0005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Arts and Humanities","Score":null,"Total":0}

引用次数: 6

Abstract

Abstract The present paper aims to report on the preliminary findings from the initial stages of ongoing research on hate speech in Lithuanian online comments. Comments are marked strongly by such phenomena as flaming and trolling; therefore, in this genre we can expect a high degree of hostility, obscenity, high incidence of insults and aggressive lexis, which can inflict harm to individuals or organizations. The goal of the current research is thus to make an attempt to identify some features of verbal aggression in Lithuanian by applying the principles and instruments of corpus linguistics, which proved to be a useful approach when dealing with such issues as trolling. It is expected that further analysis of those features will help to identify and define formal linguistic criteria that could facilitate identification of hate speech in public discourse. The data has been obtained from the Lithuanian corpus of user-generated comments collected from one major Lithuanian portal, www.delfi.lt. The corpus consists of all the comments posted in the year 2014 and in total includes 17,909 comments, which make up 1,160,109 words. For the initial data analysis, linguistic aspects, such as wordlists, collocations, and formulaic language, were analysed by using the AntConc software. The interpretations of the results are still very tentative, but what the initial findings show is that overt aggression does not feature among the most frequent and most salient features of comments. Aggression is, in our data, indirectly expressed through creative language use, which can mainly be studied through qualitative analysis.

查看原文本刊更多论文

寻找立陶宛公共话语中的仇恨言论:在线评论的语料库辅助分析

摘要本文旨在报告立陶宛网络评论中仇恨言论研究的初步结果。评论带有强烈的标记，如火焰和巨魔；因此，在这一类型中，我们可以期待高度的敌意、淫秽、侮辱和攻击性词汇的高发生率，这可能会对个人或组织造成伤害。因此，本研究的目的是试图通过应用语料库语言学的原则和工具来识别立陶宛语中言语攻击的一些特征，这被证明是处理诸如巨魔等问题的有用方法。预计对这些特征的进一步分析将有助于确定和定义正式的语言标准，从而有助于识别公共话语中的仇恨言论。数据来自立陶宛一家主要门户网站www.delfi.lt收集的用户生成评论的立陶宛语料库。该语料库由2014年发布的所有评论组成，共包括17909条评论，共1160109个单词。在最初的数据分析中，使用AntConc软件对语言方面进行了分析，如单词表、搭配和公式化语言。对结果的解释仍然是非常试探性的，但初步发现表明，公开的攻击性并不是评论中最常见和最显著的特征。在我们的数据中，攻击性是通过创造性的语言使用间接表达的，这主要可以通过定性分析来研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Lodz Papers in Pragmatics Arts and Humanities-Language and Linguistics

CiteScore

1.10

自引率

0.00%

发文量