乌尔都语推文中的暴力观点检测

Muhammad Hammad Akram, Khurram Shahzad
{"title":"乌尔都语推文中的暴力观点检测","authors":"Muhammad Hammad Akram, Khurram Shahzad","doi":"10.1109/ICOSST53930.2021.9683934","DOIUrl":null,"url":null,"abstract":"The widespread use of social media has led to substantial increase in the global connectivity. Consequently, the content shared on social media has the potential to become viral in a short span of time. While some content is desired to become viral, there is a high risk that the inappropriate messages can also become viral which could be disastrous for the society. For instance, spreading violent views may lead to riots and unrest in the society. Therefore, it is desired to detect violent views to ensure stopping them from spreading. To that end, this study has scrapped Twitter to develop and publicly release the first-ever Violent Views Detection corpus for Urdu (VVD-21). The corpus is composed of 3297 Urdu tweets which are manually classified into Violent and Non-Violent views. Furthermore, experiments are performed using six traditional and two deep learning techniques to evaluate their effectiveness of these techniques for automatically detecting violent views in Urdu text. The results of the experiments show that Logistic Regression is the most effective technique as it achieved the highest F1 score of 0.881.","PeriodicalId":325357,"journal":{"name":"2021 15th International Conference on Open Source Systems and Technologies (ICOSST)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Violent Views Detection in Urdu Tweets\",\"authors\":\"Muhammad Hammad Akram, Khurram Shahzad\",\"doi\":\"10.1109/ICOSST53930.2021.9683934\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The widespread use of social media has led to substantial increase in the global connectivity. Consequently, the content shared on social media has the potential to become viral in a short span of time. While some content is desired to become viral, there is a high risk that the inappropriate messages can also become viral which could be disastrous for the society. For instance, spreading violent views may lead to riots and unrest in the society. Therefore, it is desired to detect violent views to ensure stopping them from spreading. To that end, this study has scrapped Twitter to develop and publicly release the first-ever Violent Views Detection corpus for Urdu (VVD-21). The corpus is composed of 3297 Urdu tweets which are manually classified into Violent and Non-Violent views. Furthermore, experiments are performed using six traditional and two deep learning techniques to evaluate their effectiveness of these techniques for automatically detecting violent views in Urdu text. The results of the experiments show that Logistic Regression is the most effective technique as it achieved the highest F1 score of 0.881.\",\"PeriodicalId\":325357,\"journal\":{\"name\":\"2021 15th International Conference on Open Source Systems and Technologies (ICOSST)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 15th International Conference on Open Source Systems and Technologies (ICOSST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSST53930.2021.9683934\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 15th International Conference on Open Source Systems and Technologies (ICOSST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSST53930.2021.9683934","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

社交媒体的广泛使用导致全球连通性大幅提高。因此,在社交媒体上分享的内容有可能在短时间内成为病毒式传播。虽然有些内容希望成为病毒,但不适当的信息也很有可能成为病毒,这对社会来说可能是灾难性的。例如,传播暴力观点可能会导致社会的骚乱和动荡。因此,需要发现暴力观点,以确保阻止其传播。为此,这项研究放弃了Twitter,开发并公开发布了第一个乌尔都语暴力观点检测语料库(VVD-21)。语料库由3297条乌尔都语推文组成,这些推文被手动分为暴力和非暴力观点。此外,使用六种传统和两种深度学习技术进行了实验,以评估这些技术在自动检测乌尔都语文本中的暴力观点方面的有效性。实验结果表明,Logistic回归是最有效的方法,其F1得分最高,为0.881。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Violent Views Detection in Urdu Tweets
The widespread use of social media has led to substantial increase in the global connectivity. Consequently, the content shared on social media has the potential to become viral in a short span of time. While some content is desired to become viral, there is a high risk that the inappropriate messages can also become viral which could be disastrous for the society. For instance, spreading violent views may lead to riots and unrest in the society. Therefore, it is desired to detect violent views to ensure stopping them from spreading. To that end, this study has scrapped Twitter to develop and publicly release the first-ever Violent Views Detection corpus for Urdu (VVD-21). The corpus is composed of 3297 Urdu tweets which are manually classified into Violent and Non-Violent views. Furthermore, experiments are performed using six traditional and two deep learning techniques to evaluate their effectiveness of these techniques for automatically detecting violent views in Urdu text. The results of the experiments show that Logistic Regression is the most effective technique as it achieved the highest F1 score of 0.881.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信