犯罪信息检索中情感分析的机器学习方法

T. Mantoro, M. A. Ayu, R. Handayanto
{"title":"犯罪信息检索中情感分析的机器学习方法","authors":"T. Mantoro, M. A. Ayu, R. Handayanto","doi":"10.1109/IC2IE50715.2020.9274607","DOIUrl":null,"url":null,"abstract":"Crime information is usually announced periodically. To get the information in real-time, the information should be retrieved through the information retrieval system automatically. The system should choose some appropriate keywords for retrieving the crime data. Eight keywords have been chosen which represented the most viral topic. The keywords in this study were analyzed regarding their sentiment from the hashtags in twitter posts. The Machine Learning algorithms were utilised such as Multinomal Naive Bayes, Random Forest Classifier, Linear SVM, and Nearestneighborhood (kNN) finding a better classifier. Sentiments, both positive and negative, are usually have been used by the website content designer to attract the reader. Most keywords showed negative sentiment which showed the negative reaction from the people. The sentiment analysis in Bahasa Indonesia also useful for understanding the people’s view on the types of crime as well as for keyword selection in the crime information retrieval system. As the result, near-repeat crime effect as a condition where criminal activity tends to repeat in the near place and time, can be predicted.","PeriodicalId":211983,"journal":{"name":"2020 3rd International Conference on Computer and Informatics Engineering (IC2IE)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Machine Learning Approach for Sentiment Analysis in Crime Information Retrieval\",\"authors\":\"T. Mantoro, M. A. Ayu, R. Handayanto\",\"doi\":\"10.1109/IC2IE50715.2020.9274607\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crime information is usually announced periodically. To get the information in real-time, the information should be retrieved through the information retrieval system automatically. The system should choose some appropriate keywords for retrieving the crime data. Eight keywords have been chosen which represented the most viral topic. The keywords in this study were analyzed regarding their sentiment from the hashtags in twitter posts. The Machine Learning algorithms were utilised such as Multinomal Naive Bayes, Random Forest Classifier, Linear SVM, and Nearestneighborhood (kNN) finding a better classifier. Sentiments, both positive and negative, are usually have been used by the website content designer to attract the reader. Most keywords showed negative sentiment which showed the negative reaction from the people. The sentiment analysis in Bahasa Indonesia also useful for understanding the people’s view on the types of crime as well as for keyword selection in the crime information retrieval system. As the result, near-repeat crime effect as a condition where criminal activity tends to repeat in the near place and time, can be predicted.\",\"PeriodicalId\":211983,\"journal\":{\"name\":\"2020 3rd International Conference on Computer and Informatics Engineering (IC2IE)\",\"volume\":\"76 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 3rd International Conference on Computer and Informatics Engineering (IC2IE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC2IE50715.2020.9274607\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 3rd International Conference on Computer and Informatics Engineering (IC2IE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC2IE50715.2020.9274607","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

犯罪信息通常定期公布。为了实现信息的实时获取,需要通过信息检索系统对信息进行自动检索。系统应该选择一些合适的关键词来检索犯罪数据。8个关键词被选出来代表最具病毒性的话题。本研究中的关键词是根据twitter帖子中的标签来分析他们的情绪的。利用多项朴素贝叶斯、随机森林分类器、线性支持向量机和最近邻(kNN)等机器学习算法寻找更好的分类器。情绪,积极的和消极的,通常已经被网站内容设计师用来吸引读者。大多数关键词都表现出负面情绪,反映了人们的负面反应。印尼语的情感分析也有助于了解人们对犯罪类型的看法,以及在犯罪信息检索系统中选择关键字。因此,近似重复犯罪效应是指犯罪活动在最近的地点和时间内倾向于重复的一种情况,可以预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning Approach for Sentiment Analysis in Crime Information Retrieval
Crime information is usually announced periodically. To get the information in real-time, the information should be retrieved through the information retrieval system automatically. The system should choose some appropriate keywords for retrieving the crime data. Eight keywords have been chosen which represented the most viral topic. The keywords in this study were analyzed regarding their sentiment from the hashtags in twitter posts. The Machine Learning algorithms were utilised such as Multinomal Naive Bayes, Random Forest Classifier, Linear SVM, and Nearestneighborhood (kNN) finding a better classifier. Sentiments, both positive and negative, are usually have been used by the website content designer to attract the reader. Most keywords showed negative sentiment which showed the negative reaction from the people. The sentiment analysis in Bahasa Indonesia also useful for understanding the people’s view on the types of crime as well as for keyword selection in the crime information retrieval system. As the result, near-repeat crime effect as a condition where criminal activity tends to repeat in the near place and time, can be predicted.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信