{"title":"Hate Speech Detection using ML algorithms","authors":"Aditya Razdan, S. S","doi":"10.1109/aimv53313.2021.9670987","DOIUrl":null,"url":null,"abstract":"Social media is a growing platform where different users share their ideas and sentiments towards different topics because users spend a lot of time expressing their thoughts and views. There are various researches going on in detecting the sentiments of the user’s comments but the main sentiment factor remain undiagnosed. In this paper, the aim is to detect hate speeches. The dataset was preprocessed and cleaned and cleaned text was explored to get a better understanding. Salient features were extracted from the data to train our model and to identify the hate sentiments of tweets. The vector model is created using genism to learn the relationship between words and based on that sentence are labeled. Stop words and port stemmer are used to filter unwanted data to build the vocabulary using CountVectorizer before it is used for model building. Using various machine algorithms, comparative study is done to check the performance of algorithms and promising results are attained.","PeriodicalId":135318,"journal":{"name":"2021 International Conference on Artificial Intelligence and Machine Vision (AIMV)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Artificial Intelligence and Machine Vision (AIMV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/aimv53313.2021.9670987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Social media is a growing platform where different users share their ideas and sentiments towards different topics because users spend a lot of time expressing their thoughts and views. There are various researches going on in detecting the sentiments of the user’s comments but the main sentiment factor remain undiagnosed. In this paper, the aim is to detect hate speeches. The dataset was preprocessed and cleaned and cleaned text was explored to get a better understanding. Salient features were extracted from the data to train our model and to identify the hate sentiments of tweets. The vector model is created using genism to learn the relationship between words and based on that sentence are labeled. Stop words and port stemmer are used to filter unwanted data to build the vocabulary using CountVectorizer before it is used for model building. Using various machine algorithms, comparative study is done to check the performance of algorithms and promising results are attained.