In search of hate speech in Lithuanian public discourse: A corpus-assisted analysis of online comments

Q2 Arts and Humanities
Jurate Ruzaite
{"title":"In search of hate speech in Lithuanian public discourse: A corpus-assisted analysis of online comments","authors":"Jurate Ruzaite","doi":"10.1515/lpp-2018-0005","DOIUrl":null,"url":null,"abstract":"Abstract The present paper aims to report on the preliminary findings from the initial stages of ongoing research on hate speech in Lithuanian online comments. Comments are marked strongly by such phenomena as flaming and trolling; therefore, in this genre we can expect a high degree of hostility, obscenity, high incidence of insults and aggressive lexis, which can inflict harm to individuals or organizations. The goal of the current research is thus to make an attempt to identify some features of verbal aggression in Lithuanian by applying the principles and instruments of corpus linguistics, which proved to be a useful approach when dealing with such issues as trolling. It is expected that further analysis of those features will help to identify and define formal linguistic criteria that could facilitate identification of hate speech in public discourse. The data has been obtained from the Lithuanian corpus of user-generated comments collected from one major Lithuanian portal, www.delfi.lt. The corpus consists of all the comments posted in the year 2014 and in total includes 17,909 comments, which make up 1,160,109 words. For the initial data analysis, linguistic aspects, such as wordlists, collocations, and formulaic language, were analysed by using the AntConc software. The interpretations of the results are still very tentative, but what the initial findings show is that overt aggression does not feature among the most frequent and most salient features of comments. Aggression is, in our data, indirectly expressed through creative language use, which can mainly be studied through qualitative analysis.","PeriodicalId":39423,"journal":{"name":"Lodz Papers in Pragmatics","volume":"14 1","pages":"116 - 93"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lpp-2018-0005","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lodz Papers in Pragmatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/lpp-2018-0005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 6

Abstract

Abstract The present paper aims to report on the preliminary findings from the initial stages of ongoing research on hate speech in Lithuanian online comments. Comments are marked strongly by such phenomena as flaming and trolling; therefore, in this genre we can expect a high degree of hostility, obscenity, high incidence of insults and aggressive lexis, which can inflict harm to individuals or organizations. The goal of the current research is thus to make an attempt to identify some features of verbal aggression in Lithuanian by applying the principles and instruments of corpus linguistics, which proved to be a useful approach when dealing with such issues as trolling. It is expected that further analysis of those features will help to identify and define formal linguistic criteria that could facilitate identification of hate speech in public discourse. The data has been obtained from the Lithuanian corpus of user-generated comments collected from one major Lithuanian portal, www.delfi.lt. The corpus consists of all the comments posted in the year 2014 and in total includes 17,909 comments, which make up 1,160,109 words. For the initial data analysis, linguistic aspects, such as wordlists, collocations, and formulaic language, were analysed by using the AntConc software. The interpretations of the results are still very tentative, but what the initial findings show is that overt aggression does not feature among the most frequent and most salient features of comments. Aggression is, in our data, indirectly expressed through creative language use, which can mainly be studied through qualitative analysis.
寻找立陶宛公共话语中的仇恨言论:在线评论的语料库辅助分析
摘要本文旨在报告立陶宛网络评论中仇恨言论研究的初步结果。评论带有强烈的标记,如火焰和巨魔;因此,在这一类型中,我们可以期待高度的敌意、淫秽、侮辱和攻击性词汇的高发生率,这可能会对个人或组织造成伤害。因此,本研究的目的是试图通过应用语料库语言学的原则和工具来识别立陶宛语中言语攻击的一些特征,这被证明是处理诸如巨魔等问题的有用方法。预计对这些特征的进一步分析将有助于确定和定义正式的语言标准,从而有助于识别公共话语中的仇恨言论。数据来自立陶宛一家主要门户网站www.delfi.lt收集的用户生成评论的立陶宛语料库。该语料库由2014年发布的所有评论组成,共包括17909条评论,共1160109个单词。在最初的数据分析中,使用AntConc软件对语言方面进行了分析,如单词表、搭配和公式化语言。对结果的解释仍然是非常试探性的,但初步发现表明,公开的攻击性并不是评论中最常见和最显著的特征。在我们的数据中,攻击性是通过创造性的语言使用间接表达的,这主要可以通过定性分析来研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Lodz Papers in Pragmatics
Lodz Papers in Pragmatics Arts and Humanities-Language and Linguistics
CiteScore
1.10
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信