犯罪分析中命名实体识别的两阶段方法

Priyanka Das, A. Das
{"title":"犯罪分析中命名实体识别的两阶段方法","authors":"Priyanka Das, A. Das","doi":"10.1109/ICCCNT.2017.8203949","DOIUrl":null,"url":null,"abstract":"Crime against women in India is on increase over the past few years and enormous crime reports are being generated everyday. But it is difficult to manually access the crime reports to derive useful information that can provide insights to the law enforcement officers for analysing the crime trends. The present work emphasizes on a simple yet efficient two stage approach for analysing crime against women in India. Initially, the proposed framework extracts crime reports from online newspaper articles. Once the data is collected, the first stage approach provides an interesting aspect by identifying named entities like name of states, cities, person etc. from the dataset and a collection of top ten entities of various categories is ranked according to their frequency of occurrence. The preliminary assessment shows feasible results which are also compared with crime records drawn from National Crime Records Bureau. However, the identified subtypes of entities are mostly ignored whereas dealing only with the basic entities fails to provide in-depth recognition of crime trends. So considering the subtypes can really provide the prerequisites for finer distinction in the field of crime data mining. The second stage approach in the present work considers the sub-types of named entities as ‘Modus Operandi’ features (mode of operation) of the crime that caters exquisite perception of the crime performed against women in India. Though lot of research exists on crime analysis, considering modus operandi features is very less. The present work demonstrates the effectiveness of the method with high recall and precision for the identified named entities.","PeriodicalId":6581,"journal":{"name":"2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)","volume":"6 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"A two-stage approach of named-entity recognition for crime analysis\",\"authors\":\"Priyanka Das, A. Das\",\"doi\":\"10.1109/ICCCNT.2017.8203949\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crime against women in India is on increase over the past few years and enormous crime reports are being generated everyday. But it is difficult to manually access the crime reports to derive useful information that can provide insights to the law enforcement officers for analysing the crime trends. The present work emphasizes on a simple yet efficient two stage approach for analysing crime against women in India. Initially, the proposed framework extracts crime reports from online newspaper articles. Once the data is collected, the first stage approach provides an interesting aspect by identifying named entities like name of states, cities, person etc. from the dataset and a collection of top ten entities of various categories is ranked according to their frequency of occurrence. The preliminary assessment shows feasible results which are also compared with crime records drawn from National Crime Records Bureau. However, the identified subtypes of entities are mostly ignored whereas dealing only with the basic entities fails to provide in-depth recognition of crime trends. So considering the subtypes can really provide the prerequisites for finer distinction in the field of crime data mining. The second stage approach in the present work considers the sub-types of named entities as ‘Modus Operandi’ features (mode of operation) of the crime that caters exquisite perception of the crime performed against women in India. Though lot of research exists on crime analysis, considering modus operandi features is very less. The present work demonstrates the effectiveness of the method with high recall and precision for the identified named entities.\",\"PeriodicalId\":6581,\"journal\":{\"name\":\"2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)\",\"volume\":\"6 1\",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCNT.2017.8203949\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCNT.2017.8203949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

在过去的几年里,印度针对女性的犯罪在增加,每天都有大量的犯罪报告。但是,很难手动访问犯罪报告,以获得有用的信息,为执法人员分析犯罪趋势提供见解。目前的工作强调一个简单而有效的两阶段方法来分析印度针对妇女的犯罪。最初,拟议的框架从网上报纸文章中提取犯罪报道。一旦收集到数据,第一阶段的方法提供了一个有趣的方面,即从数据集中识别命名实体,如州名、城市名、人名等,并根据其出现频率对各种类别的前十个实体进行排序。初步评估结果可行,并与国家犯罪记录局的犯罪记录进行了比较。然而,所识别的实体亚型大多被忽视,而仅处理基本实体未能提供对犯罪趋势的深入认识。因此,考虑子类型确实可以为犯罪数据挖掘领域更精细的区分提供先决条件。本工作的第二阶段方法将命名实体的子类型视为犯罪的“作案手法”特征(操作模式),迎合了对印度妇女犯罪的精致感知。虽然在犯罪分析方面有大量的研究,但考虑作案手法特征的研究却很少。实验结果表明,该方法对已识别的命名实体具有较高的查全率和查准率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A two-stage approach of named-entity recognition for crime analysis
Crime against women in India is on increase over the past few years and enormous crime reports are being generated everyday. But it is difficult to manually access the crime reports to derive useful information that can provide insights to the law enforcement officers for analysing the crime trends. The present work emphasizes on a simple yet efficient two stage approach for analysing crime against women in India. Initially, the proposed framework extracts crime reports from online newspaper articles. Once the data is collected, the first stage approach provides an interesting aspect by identifying named entities like name of states, cities, person etc. from the dataset and a collection of top ten entities of various categories is ranked according to their frequency of occurrence. The preliminary assessment shows feasible results which are also compared with crime records drawn from National Crime Records Bureau. However, the identified subtypes of entities are mostly ignored whereas dealing only with the basic entities fails to provide in-depth recognition of crime trends. So considering the subtypes can really provide the prerequisites for finer distinction in the field of crime data mining. The second stage approach in the present work considers the sub-types of named entities as ‘Modus Operandi’ features (mode of operation) of the crime that caters exquisite perception of the crime performed against women in India. Though lot of research exists on crime analysis, considering modus operandi features is very less. The present work demonstrates the effectiveness of the method with high recall and precision for the identified named entities.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信