Baozhen Qiao, April A Austin, Jamie Musco, Tabassum Insaf, Maria J Schymura
{"title":"Using LexisNexis to Improve Social Security Number Information in the New York State Cancer Registry.","authors":"Baozhen Qiao, April A Austin, Jamie Musco, Tabassum Insaf, Maria J Schymura","doi":"","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Social Security numbers (SSNs) collected by cancer surveillance registries in the United States are used for patient matching, deduplication, follow-up, and linkage studies. However, due to various reasons, a small proportion of patient records have missing or inaccurate SSNs. Recently, New York State Cancer Registry (NYSCR) data have been linked to LexisNexis data to obtain patient demographic information, including SSNs. The current study evaluated the feasibility of using LexisNexis to improve SSN information in the NYSCR.</p><p><strong>Materials and methods: </strong>Patients diagnosed during the years 2005-2016, aged 21 or older, in the NYSCR were linked to LexisNexis data. For the matched patients, LexisNexis returned demographic information, including SSNs as available. Percentages of patients without LexisNexis matches or without LexisNexis SSNs were examined by demographic characteristics. We used multivariate logistic regression analyses to further evaluate how patient demographic characteristics affected the likelihood of no LexisNexis matches or of no SSNs returned. For patients with SSNs returned, LexisNexis SSNs were compared with registry SSNs. If patients had prior missing registry SSNs or if LexisNexis SSNs were inconsistent with registry SSNs, we used Match*Pro to review and verify match status. Registry SSNs were updated for those confirmed to be true matches. Improvement of SSNs was assessed based on percentage reduction of missingness.</p><p><strong>Results: </strong>Of 1,396,078 patient records submitted for LexisNexis linkage, 1.6% were not matched. Among those matched, 1.5% did not have SSNs returned. Multivariate logistic regression analyses indicated that patients who were female, Black, Asian Pacific Islander (API), Hispanic, born outside the United States, deceased, or living in poorer census tracts were more likely to not have LexisNexis matches, or to not have SSNs returned. Among 47,271 patients with missing registry SSNs (3.4%), 26,895 had SSNs returned from LexisNexis, and 24,919 were confirmed to be true matches. After registry SSNs updates, the percentage of SSN missingness was reduced to 1.7%, with a larger absolute reduction observed among those who were younger than 60 years, API, or alive. For 33,057 patients with inconsistent SSNs, 11,474 were due to incorrect consolidations of SSNs in the registry, and those SSNs were subsequently fixed.</p><p><strong>Conclusions: </strong>LexisNexis is a valuable resource for improving the quality of SSN information in registries. Our results showed that the overall percentage of patients with missing SSNs was reduced from 3.4% to 1.7% after LexisNexis link-age, and SSNs that were initially incorrectly consolidated for some patients were also identified and subsequently fixed. However, the magnitude of SSN improvement varied by patient demographic characteristics. Data quality improvements often require resources, and this evaluation can assist registries with decisions related to similar efforts.</p>","PeriodicalId":39246,"journal":{"name":"Journal of registry management","volume":"50 4","pages":"138-143"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10945922/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of registry management","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Social Security numbers (SSNs) collected by cancer surveillance registries in the United States are used for patient matching, deduplication, follow-up, and linkage studies. However, due to various reasons, a small proportion of patient records have missing or inaccurate SSNs. Recently, New York State Cancer Registry (NYSCR) data have been linked to LexisNexis data to obtain patient demographic information, including SSNs. The current study evaluated the feasibility of using LexisNexis to improve SSN information in the NYSCR.
Materials and methods: Patients diagnosed during the years 2005-2016, aged 21 or older, in the NYSCR were linked to LexisNexis data. For the matched patients, LexisNexis returned demographic information, including SSNs as available. Percentages of patients without LexisNexis matches or without LexisNexis SSNs were examined by demographic characteristics. We used multivariate logistic regression analyses to further evaluate how patient demographic characteristics affected the likelihood of no LexisNexis matches or of no SSNs returned. For patients with SSNs returned, LexisNexis SSNs were compared with registry SSNs. If patients had prior missing registry SSNs or if LexisNexis SSNs were inconsistent with registry SSNs, we used Match*Pro to review and verify match status. Registry SSNs were updated for those confirmed to be true matches. Improvement of SSNs was assessed based on percentage reduction of missingness.
Results: Of 1,396,078 patient records submitted for LexisNexis linkage, 1.6% were not matched. Among those matched, 1.5% did not have SSNs returned. Multivariate logistic regression analyses indicated that patients who were female, Black, Asian Pacific Islander (API), Hispanic, born outside the United States, deceased, or living in poorer census tracts were more likely to not have LexisNexis matches, or to not have SSNs returned. Among 47,271 patients with missing registry SSNs (3.4%), 26,895 had SSNs returned from LexisNexis, and 24,919 were confirmed to be true matches. After registry SSNs updates, the percentage of SSN missingness was reduced to 1.7%, with a larger absolute reduction observed among those who were younger than 60 years, API, or alive. For 33,057 patients with inconsistent SSNs, 11,474 were due to incorrect consolidations of SSNs in the registry, and those SSNs were subsequently fixed.
Conclusions: LexisNexis is a valuable resource for improving the quality of SSN information in registries. Our results showed that the overall percentage of patients with missing SSNs was reduced from 3.4% to 1.7% after LexisNexis link-age, and SSNs that were initially incorrectly consolidated for some patients were also identified and subsequently fixed. However, the magnitude of SSN improvement varied by patient demographic characteristics. Data quality improvements often require resources, and this evaluation can assist registries with decisions related to similar efforts.