人口普查记录连接的新方法。

IF 1.6 2区 历史学 Q1 HISTORY
Ron Goeken, Lap Huynh, Thomas Lenius, Rebecca Vick
{"title":"人口普查记录连接的新方法。","authors":"Ron Goeken,&nbsp;Lap Huynh,&nbsp;Thomas Lenius,&nbsp;Rebecca Vick","doi":"10.1080/01615440.2010.517152","DOIUrl":null,"url":null,"abstract":"<p><p>The Minnesota Population Center (MPC) has released linked datasets through its NAPP and IPUMS projects, making them readily accessible to researchers. Prior to the availability of complete count census microdata from the MPC, researchers applied various forms of record-linking software. This essay describes the techniques used in the MPC's linking program and briefly compares this technique with those used by other researchers. The key feature of the MPC linking method is the construction of cumulative name similarity scores, based on approximately 2.5 billion record comparisons; we also use support vector mechanics to classify potential links. This article explains modifications made for the final linked datasets and includes a discussion of the role of weighting variables when using linked data.</p>","PeriodicalId":45535,"journal":{"name":"Historical Methods","volume":"44 1","pages":"7-14"},"PeriodicalIF":1.6000,"publicationDate":"2011-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/01615440.2010.517152","citationCount":"62","resultStr":"{\"title\":\"New Methods of Census Record Linking.\",\"authors\":\"Ron Goeken,&nbsp;Lap Huynh,&nbsp;Thomas Lenius,&nbsp;Rebecca Vick\",\"doi\":\"10.1080/01615440.2010.517152\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The Minnesota Population Center (MPC) has released linked datasets through its NAPP and IPUMS projects, making them readily accessible to researchers. Prior to the availability of complete count census microdata from the MPC, researchers applied various forms of record-linking software. This essay describes the techniques used in the MPC's linking program and briefly compares this technique with those used by other researchers. The key feature of the MPC linking method is the construction of cumulative name similarity scores, based on approximately 2.5 billion record comparisons; we also use support vector mechanics to classify potential links. This article explains modifications made for the final linked datasets and includes a discussion of the role of weighting variables when using linked data.</p>\",\"PeriodicalId\":45535,\"journal\":{\"name\":\"Historical Methods\",\"volume\":\"44 1\",\"pages\":\"7-14\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2011-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/01615440.2010.517152\",\"citationCount\":\"62\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Historical Methods\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1080/01615440.2010.517152\",\"RegionNum\":2,\"RegionCategory\":\"历史学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HISTORY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Historical Methods","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1080/01615440.2010.517152","RegionNum":2,"RegionCategory":"历史学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HISTORY","Score":null,"Total":0}
引用次数: 62

摘要

明尼苏达人口中心(MPC)通过其NAPP和IPUMS项目发布了相关的数据集,使研究人员可以很容易地访问它们。在MPC提供完整的人口普查微数据之前,研究人员应用了各种形式的记录链接软件。本文描述了MPC连接程序中使用的技术,并简要地将该技术与其他研究人员使用的技术进行了比较。MPC链接方法的关键特征是基于大约25亿条记录的比较,构建了累积的名称相似度分数;我们还使用支持向量力学对潜在链接进行分类。本文解释了对最终关联数据集所做的修改,并讨论了使用关联数据时权重变量的作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
New Methods of Census Record Linking.

The Minnesota Population Center (MPC) has released linked datasets through its NAPP and IPUMS projects, making them readily accessible to researchers. Prior to the availability of complete count census microdata from the MPC, researchers applied various forms of record-linking software. This essay describes the techniques used in the MPC's linking program and briefly compares this technique with those used by other researchers. The key feature of the MPC linking method is the construction of cumulative name similarity scores, based on approximately 2.5 billion record comparisons; we also use support vector mechanics to classify potential links. This article explains modifications made for the final linked datasets and includes a discussion of the role of weighting variables when using linked data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Historical Methods
Historical Methods Multiple-
CiteScore
3.20
自引率
7.10%
发文量
13
期刊介绍: Historical Methodsreaches an international audience of social scientists concerned with historical problems. It explores interdisciplinary approaches to new data sources, new approaches to older questions and material, and practical discussions of computer and statistical methodology, data collection, and sampling procedures. The journal includes the following features: “Evidence Matters” emphasizes how to find, decipher, and analyze evidence whether or not that evidence is meant to be quantified. “Database Developments” announces major new public databases or large alterations in older ones, discusses innovative ways to organize them, and explains new ways of categorizing information.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信