利用数据链接了解南澳大利亚州的血液制品使用模式和结果。

IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES
M. Palfy, Christopher Radbone
{"title":"利用数据链接了解南澳大利亚州的血液制品使用模式和结果。","authors":"M. Palfy, Christopher Radbone","doi":"10.23889/ijpds.v7i3.1975","DOIUrl":null,"url":null,"abstract":"ObjectivesThe purpose of this analytical activity was to ensure confidence in the technical capability for extracting, linking, and integrating public hospital inpatient data, public pathology blood transfusions records and blood tests, to optimise records linkage allowing patterns and trends to be then analysed with confidence. \nApproachThe SURE secure data platform was essential to ensure data governance and security requirements were met while integrating health data spanning 18 months (January 2018 - June 2019). Data sources came in multiple formats of varying quality. R was chosen for its data wrangling abilities and reproducibility. \nThe phases were: \n \nSource data loading and cleaning \nLinking hospital inpatient and blood transfusions records \nSummarising linked transfusion data \nLinking inpatient and blood tests data \nSummarising linked tests data \nIntegrating hospital data with summarised transfusion and summarised tests data \nDeriving additional variables based on summarised data \n \nResultsFrom 143,192 transfusion records, 55,053 (38.4%) were excluded as they did not meet the inclusion criteria (e.g., hospital or blood product out-of-scope). \nFrom 7,897,451 blood test records, 238,013 (3.0%) were excluded, mostly of poor quality (missing/invalid hospital code). \nInitially 91.4% of transfusion records were matched with hospital inpatient records. The linkage rate for state-wide blood test records was 62.3% for tests records, noting the low match rate was attributed to tests not performed on public hospital patients, as the blood test data was statewide. \nLinkage process was improved by adding additional patient codes from public pathology’s internal patient identifiers. The linkage rate improved to 95.5% for transfusion records and 64.4% for test records. \nConclusion12 different data sources, with differing file types and formats, needed coding to achieve standardised results, enabling future reproducibility. Over one hundred business rules were implemented to produce a robust solution for future data updates. End results were analysed, and it was determined that linkage and integration quality exceeded previous similar attempts in terms of match rate and accuracy.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2022-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Understanding South Australia’s blood products usage patterns and outcomes, using data linkage.\",\"authors\":\"M. Palfy, Christopher Radbone\",\"doi\":\"10.23889/ijpds.v7i3.1975\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ObjectivesThe purpose of this analytical activity was to ensure confidence in the technical capability for extracting, linking, and integrating public hospital inpatient data, public pathology blood transfusions records and blood tests, to optimise records linkage allowing patterns and trends to be then analysed with confidence. \\nApproachThe SURE secure data platform was essential to ensure data governance and security requirements were met while integrating health data spanning 18 months (January 2018 - June 2019). Data sources came in multiple formats of varying quality. R was chosen for its data wrangling abilities and reproducibility. \\nThe phases were: \\n \\nSource data loading and cleaning \\nLinking hospital inpatient and blood transfusions records \\nSummarising linked transfusion data \\nLinking inpatient and blood tests data \\nSummarising linked tests data \\nIntegrating hospital data with summarised transfusion and summarised tests data \\nDeriving additional variables based on summarised data \\n \\nResultsFrom 143,192 transfusion records, 55,053 (38.4%) were excluded as they did not meet the inclusion criteria (e.g., hospital or blood product out-of-scope). \\nFrom 7,897,451 blood test records, 238,013 (3.0%) were excluded, mostly of poor quality (missing/invalid hospital code). \\nInitially 91.4% of transfusion records were matched with hospital inpatient records. The linkage rate for state-wide blood test records was 62.3% for tests records, noting the low match rate was attributed to tests not performed on public hospital patients, as the blood test data was statewide. \\nLinkage process was improved by adding additional patient codes from public pathology’s internal patient identifiers. The linkage rate improved to 95.5% for transfusion records and 64.4% for test records. \\nConclusion12 different data sources, with differing file types and formats, needed coding to achieve standardised results, enabling future reproducibility. Over one hundred business rules were implemented to produce a robust solution for future data updates. End results were analysed, and it was determined that linkage and integration quality exceeded previous similar attempts in terms of match rate and accuracy.\",\"PeriodicalId\":36483,\"journal\":{\"name\":\"International Journal of Population Data Science\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2022-08-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Population Data Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23889/ijpds.v7i3.1975\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Population Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23889/ijpds.v7i3.1975","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

摘要

目的该分析活动的目的是确保对提取、链接和集成公立医院住院数据、公共病理学输血记录和血液测试的技术能力的信心,以优化记录链接,从而可以自信地分析模式和趋势。方法SURE安全数据平台对于确保在整合18个月(2018年1月至2019年6月)的健康数据时满足数据治理和安全要求至关重要。数据源有多种不同质量的格式。选择R是因为它的数据处理能力和再现性。阶段为:源数据加载和清理链接医院住院患者和输血记录汇总链接的输血数据链接住院患者和血液测试数据汇总链接的测试数据将医院数据与汇总的输血和汇总的测试数据整合基于汇总的数据推导出附加变量结果从143192份输血记录中,55053人(38.4%)被排除在外,因为他们不符合纳入标准(例如,医院或血液制品超出范围)。在7897451份血液检测记录中,238013份(3.0%)被排除在外,主要是质量差(医院代码缺失/无效)。最初91.4%的输血记录与医院住院记录相匹配。全州血液检测记录与检测记录的关联率为62.3%,注意到低匹配率归因于没有对公立医院患者进行检测,因为血液检测数据是全州范围的。通过从公共病理学的内部患者标识符中添加额外的患者代码,改进了链接过程。输血记录和检测记录的关联率分别提高到95.5%和64.4%。结论12个不同的数据源,具有不同的文件类型和格式,需要编码以实现标准化结果,从而实现未来的再现性。实施了一百多条业务规则,为未来的数据更新提供了一个强大的解决方案。对最终结果进行了分析,确定链接和集成质量在匹配率和准确性方面超过了以前的类似尝试。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Understanding South Australia’s blood products usage patterns and outcomes, using data linkage.
ObjectivesThe purpose of this analytical activity was to ensure confidence in the technical capability for extracting, linking, and integrating public hospital inpatient data, public pathology blood transfusions records and blood tests, to optimise records linkage allowing patterns and trends to be then analysed with confidence. ApproachThe SURE secure data platform was essential to ensure data governance and security requirements were met while integrating health data spanning 18 months (January 2018 - June 2019). Data sources came in multiple formats of varying quality. R was chosen for its data wrangling abilities and reproducibility. The phases were: Source data loading and cleaning Linking hospital inpatient and blood transfusions records Summarising linked transfusion data Linking inpatient and blood tests data Summarising linked tests data Integrating hospital data with summarised transfusion and summarised tests data Deriving additional variables based on summarised data ResultsFrom 143,192 transfusion records, 55,053 (38.4%) were excluded as they did not meet the inclusion criteria (e.g., hospital or blood product out-of-scope). From 7,897,451 blood test records, 238,013 (3.0%) were excluded, mostly of poor quality (missing/invalid hospital code). Initially 91.4% of transfusion records were matched with hospital inpatient records. The linkage rate for state-wide blood test records was 62.3% for tests records, noting the low match rate was attributed to tests not performed on public hospital patients, as the blood test data was statewide. Linkage process was improved by adding additional patient codes from public pathology’s internal patient identifiers. The linkage rate improved to 95.5% for transfusion records and 64.4% for test records. Conclusion12 different data sources, with differing file types and formats, needed coding to achieve standardised results, enabling future reproducibility. Over one hundred business rules were implemented to produce a robust solution for future data updates. End results were analysed, and it was determined that linkage and integration quality exceeded previous similar attempts in terms of match rate and accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.50
自引率
0.00%
发文量
386
审稿时长
20 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信