哪些是记录链接的最佳标识符?

Catherine Quantin, Christine Binquet, Karima Bourquard, Ronny Pattisina, Béatrice Gouyon-Cornet, Cyril Ferdynus, Jean-Bernard Gouyon, Allaert François-André
{"title":"哪些是记录链接的最佳标识符?","authors":"Catherine Quantin, Christine Binquet, Karima Bourquard, Ronny Pattisina, Béatrice Gouyon-Cornet, Cyril Ferdynus, Jean-Bernard Gouyon, Allaert François-André","doi":"10.1080/14639230400005974","DOIUrl":null,"url":null,"abstract":"As a linkage using less informative identifiers could lead to linkage errors, it is essential to quantify the information associated to each identifier. The aim of this study was to estimate the discriminating power of different identifiers susceptible to be used in a record linkage process. This work showed the interest of three identifiers when linking data concerning a same patient using an automatic procedure based on the method proposed by Jaro; the date of birth, the first and the last names seemed to be the more appropriate identifiers. Including a poorly discriminating identifier like gender did not improve the results. Moreover, adding a second christian name, often missing, increased linkage errors. On the contrary, it seemed that using a phonetic treatment adapted to the French language could improve the results of linkage in comparison to the Soundex. However, whatever, the method used it seems necessary to improve the quality of identifier collection as it could greatly influence linkage results.","PeriodicalId":80069,"journal":{"name":"Medical informatics and the Internet in medicine","volume":"29 3-4","pages":"221-7"},"PeriodicalIF":0.0000,"publicationDate":"2004-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/14639230400005974","citationCount":"32","resultStr":"{\"title\":\"Which are the best identifiers for record linkage?\",\"authors\":\"Catherine Quantin, Christine Binquet, Karima Bourquard, Ronny Pattisina, Béatrice Gouyon-Cornet, Cyril Ferdynus, Jean-Bernard Gouyon, Allaert François-André\",\"doi\":\"10.1080/14639230400005974\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As a linkage using less informative identifiers could lead to linkage errors, it is essential to quantify the information associated to each identifier. The aim of this study was to estimate the discriminating power of different identifiers susceptible to be used in a record linkage process. This work showed the interest of three identifiers when linking data concerning a same patient using an automatic procedure based on the method proposed by Jaro; the date of birth, the first and the last names seemed to be the more appropriate identifiers. Including a poorly discriminating identifier like gender did not improve the results. Moreover, adding a second christian name, often missing, increased linkage errors. On the contrary, it seemed that using a phonetic treatment adapted to the French language could improve the results of linkage in comparison to the Soundex. However, whatever, the method used it seems necessary to improve the quality of identifier collection as it could greatly influence linkage results.\",\"PeriodicalId\":80069,\"journal\":{\"name\":\"Medical informatics and the Internet in medicine\",\"volume\":\"29 3-4\",\"pages\":\"221-7\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/14639230400005974\",\"citationCount\":\"32\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical informatics and the Internet in medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/14639230400005974\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical informatics and the Internet in medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/14639230400005974","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 32

摘要

由于使用信息量较少的标识符的链接可能导致链接错误,因此有必要量化与每个标识符相关的信息。本研究的目的是估计在档案连结过程中可能使用的不同标识符的辨别能力。这项工作表明,当使用基于Jaro提出的方法的自动程序链接有关同一患者的数据时,三个标识符的兴趣;出生日期、名字和姓氏似乎是更合适的标识符。包括像性别这样判别性差的标识符并没有改善结果。而且,加上第二个教名,往往遗漏,增加了联动错误。相反,与Soundex相比,使用适合法语的语音处理似乎可以改善链接的结果。然而,无论使用何种方法,似乎都有必要提高标识符收集的质量,因为它可能极大地影响链接结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Which are the best identifiers for record linkage?
As a linkage using less informative identifiers could lead to linkage errors, it is essential to quantify the information associated to each identifier. The aim of this study was to estimate the discriminating power of different identifiers susceptible to be used in a record linkage process. This work showed the interest of three identifiers when linking data concerning a same patient using an automatic procedure based on the method proposed by Jaro; the date of birth, the first and the last names seemed to be the more appropriate identifiers. Including a poorly discriminating identifier like gender did not improve the results. Moreover, adding a second christian name, often missing, increased linkage errors. On the contrary, it seemed that using a phonetic treatment adapted to the French language could improve the results of linkage in comparison to the Soundex. However, whatever, the method used it seems necessary to improve the quality of identifier collection as it could greatly influence linkage results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信