使用数据挖掘技术探测和预测犯罪:比较研究

S. Zahran, Eman M. Mohamed, Hamdy M. Mousa
{"title":"使用数据挖掘技术探测和预测犯罪:比较研究","authors":"S. Zahran, Eman M. Mohamed, Hamdy M. Mousa","doi":"10.21608/ijci.2021.207749","DOIUrl":null,"url":null,"abstract":"Crime is a major problem in our society where the highest priority is concerned with individuals, society, and government. Thus, it seems important to study factors and relations between the occurrence of different crimes to avoid more upcoming crimes. Crime prediction is a method of trying to study the causes and motives of crime and predict the times and places of its occurrence to reduce the commission of crimes that are expected to occur in the future. Data mining is an important way to facilitate the solution of future crime problems by investigating hidden crime patterns and historical crime data. Therefore, this study aims to analyze and discuss the various factors affecting the commission of crimes and the methods that are applied to predict future crimes and analyze their results. In this study, the model of crime prediction is proposed which is based on some classification algorithms such as (NB, KNN, Decision Tree, random forest, Linear Regression, Logistic Regression, SVM), these classification algorithms are applied to four real data sets (Chicago dataset, Los Angeles dataset, Egypt dataset, United States dataset), Egypt data set was extracted primarily from the online website (Zabatak.com) and comparing between their scores. The experimental results showed that the Random Forest classifier achieves a high score on four data sets compared with other classifiers. Random Forest achieves %88 on the Los Angeles dataset, %92 on the Egypt dataset, %97 on the Chicago dataset, and 81.7% on the United States dataset. Keywords— Crime prediction, data mining, classification, clustering, KNN, NB, SVM","PeriodicalId":137729,"journal":{"name":"IJCI. International Journal of Computers and Information","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Detecting and Predicting Crimes using Data Mining Techniques: Comparative Study\",\"authors\":\"S. Zahran, Eman M. Mohamed, Hamdy M. Mousa\",\"doi\":\"10.21608/ijci.2021.207749\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crime is a major problem in our society where the highest priority is concerned with individuals, society, and government. Thus, it seems important to study factors and relations between the occurrence of different crimes to avoid more upcoming crimes. Crime prediction is a method of trying to study the causes and motives of crime and predict the times and places of its occurrence to reduce the commission of crimes that are expected to occur in the future. Data mining is an important way to facilitate the solution of future crime problems by investigating hidden crime patterns and historical crime data. Therefore, this study aims to analyze and discuss the various factors affecting the commission of crimes and the methods that are applied to predict future crimes and analyze their results. In this study, the model of crime prediction is proposed which is based on some classification algorithms such as (NB, KNN, Decision Tree, random forest, Linear Regression, Logistic Regression, SVM), these classification algorithms are applied to four real data sets (Chicago dataset, Los Angeles dataset, Egypt dataset, United States dataset), Egypt data set was extracted primarily from the online website (Zabatak.com) and comparing between their scores. The experimental results showed that the Random Forest classifier achieves a high score on four data sets compared with other classifiers. Random Forest achieves %88 on the Los Angeles dataset, %92 on the Egypt dataset, %97 on the Chicago dataset, and 81.7% on the United States dataset. Keywords— Crime prediction, data mining, classification, clustering, KNN, NB, SVM\",\"PeriodicalId\":137729,\"journal\":{\"name\":\"IJCI. International Journal of Computers and Information\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IJCI. International Journal of Computers and Information\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21608/ijci.2021.207749\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IJCI. International Journal of Computers and Information","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/ijci.2021.207749","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

犯罪是个人、社会和政府最关心的社会问题。因此,研究不同犯罪之间发生的因素和关系,以避免更多的犯罪发生显得非常重要。犯罪预测是试图研究犯罪的原因和动机,预测其发生的时间和地点,以减少未来可能发生的犯罪的一种方法。数据挖掘是通过调查隐藏的犯罪模式和历史犯罪数据,促进解决未来犯罪问题的重要途径。因此,本研究旨在分析和讨论影响犯罪行为的各种因素以及用于预测未来犯罪和分析其结果的方法。本研究提出了基于NB、KNN、决策树、随机森林、线性回归、逻辑回归、支持向量机等分类算法的犯罪预测模型,并将这些分类算法应用于四个真实数据集(芝加哥数据集、洛杉矶数据集、埃及数据集、美国数据集),其中埃及数据集主要提取自在线网站(Zabatak.com),并对它们的得分进行比较。实验结果表明,与其他分类器相比,随机森林分类器在四个数据集上取得了较高的分数。随机森林在洛杉矶数据集上实现了%88,在埃及数据集上实现了%92,在芝加哥数据集上实现了%97,在美国数据集上实现了81.7%。关键词:犯罪预测,数据挖掘,分类,聚类,KNN, NB, SVM
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Detecting and Predicting Crimes using Data Mining Techniques: Comparative Study
Crime is a major problem in our society where the highest priority is concerned with individuals, society, and government. Thus, it seems important to study factors and relations between the occurrence of different crimes to avoid more upcoming crimes. Crime prediction is a method of trying to study the causes and motives of crime and predict the times and places of its occurrence to reduce the commission of crimes that are expected to occur in the future. Data mining is an important way to facilitate the solution of future crime problems by investigating hidden crime patterns and historical crime data. Therefore, this study aims to analyze and discuss the various factors affecting the commission of crimes and the methods that are applied to predict future crimes and analyze their results. In this study, the model of crime prediction is proposed which is based on some classification algorithms such as (NB, KNN, Decision Tree, random forest, Linear Regression, Logistic Regression, SVM), these classification algorithms are applied to four real data sets (Chicago dataset, Los Angeles dataset, Egypt dataset, United States dataset), Egypt data set was extracted primarily from the online website (Zabatak.com) and comparing between their scores. The experimental results showed that the Random Forest classifier achieves a high score on four data sets compared with other classifiers. Random Forest achieves %88 on the Los Angeles dataset, %92 on the Egypt dataset, %97 on the Chicago dataset, and 81.7% on the United States dataset. Keywords— Crime prediction, data mining, classification, clustering, KNN, NB, SVM
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信