使用数据挖掘技术探测和预测犯罪:比较研究

IJCI. International Journal of Computers and Information Pub Date : 2021-12-01 DOI:10.21608/ijci.2021.207749

S. Zahran, Eman M. Mohamed, Hamdy M. Mousa

{"title":"使用数据挖掘技术探测和预测犯罪:比较研究","authors":"S. Zahran, Eman M. Mohamed, Hamdy M. Mousa","doi":"10.21608/ijci.2021.207749","DOIUrl":null,"url":null,"abstract":"Crime is a major problem in our society where the highest priority is concerned with individuals, society, and government. Thus, it seems important to study factors and relations between the occurrence of different crimes to avoid more upcoming crimes. Crime prediction is a method of trying to study the causes and motives of crime and predict the times and places of its occurrence to reduce the commission of crimes that are expected to occur in the future. Data mining is an important way to facilitate the solution of future crime problems by investigating hidden crime patterns and historical crime data. Therefore, this study aims to analyze and discuss the various factors affecting the commission of crimes and the methods that are applied to predict future crimes and analyze their results. In this study, the model of crime prediction is proposed which is based on some classification algorithms such as (NB, KNN, Decision Tree, random forest, Linear Regression, Logistic Regression, SVM), these classification algorithms are applied to four real data sets (Chicago dataset, Los Angeles dataset, Egypt dataset, United States dataset), Egypt data set was extracted primarily from the online website (Zabatak.com) and comparing between their scores. The experimental results showed that the Random Forest classifier achieves a high score on four data sets compared with other classifiers. Random Forest achieves %88 on the Los Angeles dataset, %92 on the Egypt dataset, %97 on the Chicago dataset, and 81.7% on the United States dataset. Keywords— Crime prediction, data mining, classification, clustering, KNN, NB, SVM","PeriodicalId":137729,"journal":{"name":"IJCI. International Journal of Computers and Information","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Detecting and Predicting Crimes using Data Mining Techniques: Comparative Study\",\"authors\":\"S. Zahran, Eman M. Mohamed, Hamdy M. Mousa\",\"doi\":\"10.21608/ijci.2021.207749\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crime is a major problem in our society where the highest priority is concerned with individuals, society, and government. Thus, it seems important to study factors and relations between the occurrence of different crimes to avoid more upcoming crimes. Crime prediction is a method of trying to study the causes and motives of crime and predict the times and places of its occurrence to reduce the commission of crimes that are expected to occur in the future. Data mining is an important way to facilitate the solution of future crime problems by investigating hidden crime patterns and historical crime data. Therefore, this study aims to analyze and discuss the various factors affecting the commission of crimes and the methods that are applied to predict future crimes and analyze their results. In this study, the model of crime prediction is proposed which is based on some classification algorithms such as (NB, KNN, Decision Tree, random forest, Linear Regression, Logistic Regression, SVM), these classification algorithms are applied to four real data sets (Chicago dataset, Los Angeles dataset, Egypt dataset, United States dataset), Egypt data set was extracted primarily from the online website (Zabatak.com) and comparing between their scores. The experimental results showed that the Random Forest classifier achieves a high score on four data sets compared with other classifiers. Random Forest achieves %88 on the Los Angeles dataset, %92 on the Egypt dataset, %97 on the Chicago dataset, and 81.7% on the United States dataset. Keywords— Crime prediction, data mining, classification, clustering, KNN, NB, SVM\",\"PeriodicalId\":137729,\"journal\":{\"name\":\"IJCI. International Journal of Computers and Information\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IJCI. International Journal of Computers and Information\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21608/ijci.2021.207749\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IJCI. International Journal of Computers and Information","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/ijci.2021.207749","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

犯罪是个人、社会和政府最关心的社会问题。因此，研究不同犯罪之间发生的因素和关系，以避免更多的犯罪发生显得非常重要。犯罪预测是试图研究犯罪的原因和动机，预测其发生的时间和地点，以减少未来可能发生的犯罪的一种方法。数据挖掘是通过调查隐藏的犯罪模式和历史犯罪数据，促进解决未来犯罪问题的重要途径。因此，本研究旨在分析和讨论影响犯罪行为的各种因素以及用于预测未来犯罪和分析其结果的方法。本研究提出了基于NB、KNN、决策树、随机森林、线性回归、逻辑回归、支持向量机等分类算法的犯罪预测模型，并将这些分类算法应用于四个真实数据集(芝加哥数据集、洛杉矶数据集、埃及数据集、美国数据集)，其中埃及数据集主要提取自在线网站(Zabatak.com)，并对它们的得分进行比较。实验结果表明，与其他分类器相比，随机森林分类器在四个数据集上取得了较高的分数。随机森林在洛杉矶数据集上实现了%88，在埃及数据集上实现了%92，在芝加哥数据集上实现了%97，在美国数据集上实现了81.7%。关键词:犯罪预测，数据挖掘，分类，聚类，KNN, NB, SVM

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Detecting and Predicting Crimes using Data Mining Techniques: Comparative Study

Crime is a major problem in our society where the highest priority is concerned with individuals, society, and government. Thus, it seems important to study factors and relations between the occurrence of different crimes to avoid more upcoming crimes. Crime prediction is a method of trying to study the causes and motives of crime and predict the times and places of its occurrence to reduce the commission of crimes that are expected to occur in the future. Data mining is an important way to facilitate the solution of future crime problems by investigating hidden crime patterns and historical crime data. Therefore, this study aims to analyze and discuss the various factors affecting the commission of crimes and the methods that are applied to predict future crimes and analyze their results. In this study, the model of crime prediction is proposed which is based on some classification algorithms such as (NB, KNN, Decision Tree, random forest, Linear Regression, Logistic Regression, SVM), these classification algorithms are applied to four real data sets (Chicago dataset, Los Angeles dataset, Egypt dataset, United States dataset), Egypt data set was extracted primarily from the online website (Zabatak.com) and comparing between their scores. The experimental results showed that the Random Forest classifier achieves a high score on four data sets compared with other classifiers. Random Forest achieves %88 on the Los Angeles dataset, %92 on the Egypt dataset, %97 on the Chicago dataset, and 81.7% on the United States dataset. Keywords— Crime prediction, data mining, classification, clustering, KNN, NB, SVM

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IJCI. International Journal of Computers and Information

自引率

0.00%

发文量