A Rule and Graph-Based Approach for Targeted Identity Resolution on Policing Data

Michael Phillips, Mohammad Hossein Amirhosseini, H. Kazemian
{"title":"A Rule and Graph-Based Approach for Targeted Identity Resolution on Policing Data","authors":"Michael Phillips, Mohammad Hossein Amirhosseini, H. Kazemian","doi":"10.1109/SSCI47803.2020.9308182","DOIUrl":null,"url":null,"abstract":"In criminal records, intentional manipulation of data is prevalent to create ambiguous identity and mislead authorities. Registering data electronically can result in misspelled data, variations in naming order, case sensitive data and inconsistencies in abbreviations and terminology. Therefore, trying to obtain the true identity (or identities) of a suspect can be a challenge for law enforcement agencies. We have developed a targeted approach to identity resolution which uses a rule-based scoring system on physical and official identity attributes and a graph-based analysis on social identity attributes to interrogate policing data and resolve whether a specific target is using multiple identities. The approach has been tested on an anonymized policing dataset, used in the SPIRIT project, funded by the European Union’s Horizon 2020. The dataset contains four ‘known’ identities using a total of five false identities. 23 targets were inputted into the methodology with no knowledge of how many or which had false identities. The rule-based scoring system ranked four of the five false identities with the joint highest score for the relevant target name with the remaining false identity holding the joint second highest score for its target. Moreover, when using graph analysis, 51 suspected false identities were found for the 23 targets with four of the five false identities linked through the crimes they had been involved in. Therefore, an identity resolution approach using both a rule-based scoring system and graph analysis, could be effective in facilitating the investigation process for law enforcement agencies and assisting them in finding criminals using false identities.","PeriodicalId":413489,"journal":{"name":"2020 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI47803.2020.9308182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In criminal records, intentional manipulation of data is prevalent to create ambiguous identity and mislead authorities. Registering data electronically can result in misspelled data, variations in naming order, case sensitive data and inconsistencies in abbreviations and terminology. Therefore, trying to obtain the true identity (or identities) of a suspect can be a challenge for law enforcement agencies. We have developed a targeted approach to identity resolution which uses a rule-based scoring system on physical and official identity attributes and a graph-based analysis on social identity attributes to interrogate policing data and resolve whether a specific target is using multiple identities. The approach has been tested on an anonymized policing dataset, used in the SPIRIT project, funded by the European Union’s Horizon 2020. The dataset contains four ‘known’ identities using a total of five false identities. 23 targets were inputted into the methodology with no knowledge of how many or which had false identities. The rule-based scoring system ranked four of the five false identities with the joint highest score for the relevant target name with the remaining false identity holding the joint second highest score for its target. Moreover, when using graph analysis, 51 suspected false identities were found for the 23 targets with four of the five false identities linked through the crimes they had been involved in. Therefore, an identity resolution approach using both a rule-based scoring system and graph analysis, could be effective in facilitating the investigation process for law enforcement agencies and assisting them in finding criminals using false identities.
基于规则和图的警务数据目标身份解析方法
在犯罪记录中,故意操纵数据以制造模糊身份和误导当局是普遍存在的。以电子方式注册数据可能导致数据拼写错误、命名顺序变化、区分大小写的数据以及缩写和术语的不一致。因此,试图获得嫌疑人的真实身份(或身份)对执法机构来说可能是一个挑战。我们已经开发了一种有针对性的身份解决方法,该方法使用基于规则的物理和官方身份属性评分系统,以及基于图形的社会身份属性分析来查询警务数据并确定特定目标是否使用多个身份。该方法已经在一个匿名的警务数据集上进行了测试,该数据集被用于SPIRIT项目,该项目由欧盟的“地平线2020”资助。该数据集包含四个“已知”身份,共使用五个假身份。方法中输入了23个目标,但不知道有多少目标或哪些目标拥有虚假身份。基于规则的评分系统将5个假身份中的4个与相关目标名称的联合得分最高,其余假身份为其目标的联合得分第二高。此外,当使用图形分析时,在23个目标中发现了51个可疑的假身份,其中4个假身份与他们所参与的犯罪有关。因此,使用基于规则的评分系统和图形分析的身份解决方法可以有效地促进执法机构的调查过程,并帮助他们找到使用虚假身份的罪犯。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信