1996年3月当期人口调查与1995年全国健康访问调查的统计匹配。

Q1 Mathematics
D. D. Ingram, C. Moriarity, John F. O'Hare, Joan L. Turek
{"title":"1996年3月当期人口调查与1995年全国健康访问调查的统计匹配。","authors":"D. D. Ingram, C. Moriarity, John F. O'Hare, Joan L. Turek","doi":"10.1037/e414732008-001","DOIUrl":null,"url":null,"abstract":"OBJECTIVES Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. METHODS Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. RESULTS The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.","PeriodicalId":23577,"journal":{"name":"Vital and health statistics. Series 2, Data evaluation and methods research","volume":"144 1","pages":"1-50"},"PeriodicalIF":0.0000,"publicationDate":"2008-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey.\",\"authors\":\"D. D. Ingram, C. Moriarity, John F. O'Hare, Joan L. Turek\",\"doi\":\"10.1037/e414732008-001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"OBJECTIVES Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. METHODS Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. RESULTS The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.\",\"PeriodicalId\":23577,\"journal\":{\"name\":\"Vital and health statistics. Series 2, Data evaluation and methods research\",\"volume\":\"144 1\",\"pages\":\"1-50\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Vital and health statistics. Series 2, Data evaluation and methods research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1037/e414732008-001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vital and health statistics. Series 2, Data evaluation and methods research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1037/e414732008-001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 2

摘要

目的统计匹配是一种用于组合两个文件的方法,当一个文件上的个人不太可能也在另一个文件上时。本报告的目的是记录和评价1996年3月当前人口调查(CPS)和1995年全国健康访谈调查(NHIS)的统计匹配情况,并提出改进今后匹配情况的建议。CPS-NHIS匹配的动机是需要一套包含卫生措施和家庭资源数据的数据集,以便用于政策分析。方法对1996年3月全国人口统计系统与1995年全国人口统计系统进行了三次统计匹配。所有三个匹配都使用带有分区的人级约束匹配和预测平均匹配算法来链接两个文件上的记录。在其中的两个配对中,CPS作为宿主文件,NHIS作为供体文件;对于第三个匹配,NHIS是主机文件,CPS是供体文件。结果1996年3月CPS和1995年NHIS的约束预测平均匹配成功地结合了两个文件的部分信息,但匹配文件中某些Host和Donor变量之间的关系可能存在扭曲。对匹配的评估表明,在匹配之前用于划分宿主和供体文件的变量以及涉及预测平均匹配的变量在确定匹配文件上的变量之间的关系是否正确地表示总体中这些变量之间的关系方面发挥了重要作用。评估还表明,对小群体的估计可能特别容易出错。结果表明,在统计匹配的文件上探索宿主和供体变量之间的关系时,需要谨慎行事。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey.
OBJECTIVES Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. METHODS Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. RESULTS The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
13.20
自引率
0.00%
发文量
0
期刊介绍: Studies of new statistical methodology including experimental tests of new survey methods, studies of vital statistics collection methods, new analytical techniques, objective evaluations of reliability of collected data, and contributions to statistical theory. Studies also include comparison of U.S. methodology with those of other countries.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信