从报警电话中衡量网络犯罪

IF 1.8 4区 社会学 Q2 CRIMINOLOGY & PENOLOGY
Doy Kwon, Hervé Borrion, Richard Wortley
{"title":"从报警电话中衡量网络犯罪","authors":"Doy Kwon,&nbsp;Hervé Borrion,&nbsp;Richard Wortley","doi":"10.1007/s11417-024-09432-2","DOIUrl":null,"url":null,"abstract":"<div><p>Conventional police databases contain much information on cybercrime, but extracting it remains a practical challenge. This is because these databases rarely contain labels that could be used to automatically retrieve all cybercrime incidents. In this article, we present a supervised machine learning method for extracting cybercrime incidents in calls for police service datasets. Data from the Korean National Police (2020, 9 months, <i>N</i> = 15 million call logs) is used for the demonstration. We combined methods of keyword query selection, minority oversampling, and majority voting techniques to develop a classifier. Three classification techniques, including Naïve Bayes, linear SVM, and kernel SVM, were tested, and the kernel model was chosen to build the final model (accuracy, 93.4%; F1-score, 92.4). We estimate that cybercrime only represents 4.6% of the cases in the selected dataset (excluding traffic-related incidents), but that it can be prevalent with some crime types. We found, for example, that about three quarters (76%) of all fraud incidents have a cyber dimension. We conclude that the cybercrime classification method proposed in this study can support further research on cybercrime and that it offers considerable advantages over manual or keyword-based approaches.</p></div>","PeriodicalId":45526,"journal":{"name":"Asian Journal of Criminology","volume":"19 3","pages":"329 - 351"},"PeriodicalIF":1.8000,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11417-024-09432-2.pdf","citationCount":"0","resultStr":"{\"title\":\"Measuring Cybercrime in Calls for Police Service\",\"authors\":\"Doy Kwon,&nbsp;Hervé Borrion,&nbsp;Richard Wortley\",\"doi\":\"10.1007/s11417-024-09432-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Conventional police databases contain much information on cybercrime, but extracting it remains a practical challenge. This is because these databases rarely contain labels that could be used to automatically retrieve all cybercrime incidents. In this article, we present a supervised machine learning method for extracting cybercrime incidents in calls for police service datasets. Data from the Korean National Police (2020, 9 months, <i>N</i> = 15 million call logs) is used for the demonstration. We combined methods of keyword query selection, minority oversampling, and majority voting techniques to develop a classifier. Three classification techniques, including Naïve Bayes, linear SVM, and kernel SVM, were tested, and the kernel model was chosen to build the final model (accuracy, 93.4%; F1-score, 92.4). We estimate that cybercrime only represents 4.6% of the cases in the selected dataset (excluding traffic-related incidents), but that it can be prevalent with some crime types. We found, for example, that about three quarters (76%) of all fraud incidents have a cyber dimension. We conclude that the cybercrime classification method proposed in this study can support further research on cybercrime and that it offers considerable advantages over manual or keyword-based approaches.</p></div>\",\"PeriodicalId\":45526,\"journal\":{\"name\":\"Asian Journal of Criminology\",\"volume\":\"19 3\",\"pages\":\"329 - 351\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-07-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s11417-024-09432-2.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Criminology\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s11417-024-09432-2\",\"RegionNum\":4,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CRIMINOLOGY & PENOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Criminology","FirstCategoryId":"90","ListUrlMain":"https://link.springer.com/article/10.1007/s11417-024-09432-2","RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CRIMINOLOGY & PENOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

传统的警方数据库包含大量网络犯罪信息,但如何提取这些信息仍是一个实际挑战。这是因为这些数据库很少包含可用于自动检索所有网络犯罪事件的标签。在本文中,我们提出了一种监督机器学习方法,用于从报警服务数据集中提取网络犯罪事件。演示使用的数据来自韩国国家警察局(2020 年,9 个月,N = 1500 万个通话记录)。我们结合了关键词查询选择、少数超采样和多数投票技术等方法来开发分类器。我们测试了三种分类技术,包括奈伊夫贝叶斯、线性 SVM 和核 SVM,并选择核模型建立最终模型(准确率为 93.4%;F1 分数为 92.4)。我们估计,网络犯罪只占所选数据集中案件的 4.6%(不包括交通相关事件),但它可能在某些犯罪类型中普遍存在。例如,我们发现约四分之三(76%)的欺诈事件都涉及网络问题。我们的结论是,本研究中提出的网络犯罪分类方法可支持对网络犯罪的进一步研究,与人工或基于关键词的方法相比,它具有相当大的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Measuring Cybercrime in Calls for Police Service

Conventional police databases contain much information on cybercrime, but extracting it remains a practical challenge. This is because these databases rarely contain labels that could be used to automatically retrieve all cybercrime incidents. In this article, we present a supervised machine learning method for extracting cybercrime incidents in calls for police service datasets. Data from the Korean National Police (2020, 9 months, N = 15 million call logs) is used for the demonstration. We combined methods of keyword query selection, minority oversampling, and majority voting techniques to develop a classifier. Three classification techniques, including Naïve Bayes, linear SVM, and kernel SVM, were tested, and the kernel model was chosen to build the final model (accuracy, 93.4%; F1-score, 92.4). We estimate that cybercrime only represents 4.6% of the cases in the selected dataset (excluding traffic-related incidents), but that it can be prevalent with some crime types. We found, for example, that about three quarters (76%) of all fraud incidents have a cyber dimension. We conclude that the cybercrime classification method proposed in this study can support further research on cybercrime and that it offers considerable advantages over manual or keyword-based approaches.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Asian Journal of Criminology
Asian Journal of Criminology CRIMINOLOGY & PENOLOGY-
CiteScore
3.00
自引率
10.50%
发文量
31
期刊介绍: Electronic submission now possible! Please see the Instructions for Authors. For general information about this new journal please contact the publisher at [welmoed.spahr@springer.com] The Asian Journal of Criminology aims to advance the study of criminology and criminal justice in Asia, to promote evidence-based public policy in crime prevention, and to promote comparative studies about crime and criminal justice. The Journal provides a platform for criminologists, policymakers, and practitioners and welcomes manuscripts relating to crime, crime prevention, criminal law, medico-legal topics and the administration of criminal justice in Asian countries. The Journal especially encourages theoretical and methodological papers with an emphasis on evidence-based, empirical research addressing crime in Asian contexts. It seeks to publish research arising from a broad variety of methodological traditions, including quantitative, qualitative, historical, and comparative methods. The Journal fosters a multi-disciplinary focus and welcomes manuscripts from a variety of disciplines, including criminology, criminal justice, law, sociology, psychology, forensic science, social work, urban studies, history, and geography.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信