提高慢性阻塞性肺病患者α -1抗胰蛋白酶缺乏症的可能性:一种使用现实世界数据的新型预测模型

IF 2.3 4区 医学 Q2 RESPIRATORY SYSTEM
Daniel N Pfeffer, Rahul Dhakne, Omnya El Massad, Pulkit Sehgal, Thomas Ardiles, Michael O Calloway, M Chris Runken, Charlie Strange
{"title":"提高慢性阻塞性肺病患者α -1抗胰蛋白酶缺乏症的可能性:一种使用现实世界数据的新型预测模型","authors":"Daniel N Pfeffer, Rahul Dhakne, Omnya El Massad, Pulkit Sehgal, Thomas Ardiles, Michael O Calloway, M Chris Runken, Charlie Strange","doi":"10.15326/jcopdf.2023.0491","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Despite guideline recommendations, most patients with chronic obstructive pulmonary disease (COPD) do not undergo alpha-1 antitrypsin deficiency (AATD) testing and approximately 90% of people with AATD in the United States remain undiagnosed. This study sought to develop a predictive model using real-world data to improve detection of AATD-positive patients in the general COPD population.</p><p><strong>Methods: </strong>A predictive model using extreme gradient boosting was developed using the EVERSANA database, including longitudinal, patient-level medical claims, prescription claims, AATD-specific testing data, and electronic health records (EHR). The model was trained and then validated to predict AATD-positive status. Patients were coded as AATD positive based on the presence of any of the following criteria: (1) ≥2 AATD diagnosis codes in claims; (2) an AATD diagnosis code in the EHR; (3) a positive laboratory test for AATD; or (4) use of AATD-related medication. Over 500 variables were used to train the predictive model and >20 models were run to optimize the predictive power.</p><p><strong>Results: </strong>A total of 13,585 AATD-positive patients and 7796 AATD-negative patients were included in the model. The inclusion of non-AATD laboratory test results was critical for defining cohorts and optimizing model prediction (e.g., respiratory comorbidities, and calcium, glucose, hemoglobin, and bilirubin levels). The final model yielded high predictive power, with an area under the receiver operating characteristic curve of 0.9.</p><p><strong>Conclusion: </strong>Predictive modeling using real-world data is a sound approach for assessing AATD risk and useful for identifying COPD patients who should be confirmed by genetic testing. External validation is warranted to further assess the generalizability of these results.</p>","PeriodicalId":51340,"journal":{"name":"Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation","volume":" ","pages":"1-11"},"PeriodicalIF":2.3000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving the Likelihood of Identifying Alpha-1 Antitrypsin Deficiency Among Patients With COPD: A Novel Predictive Model Using Real-World Data.\",\"authors\":\"Daniel N Pfeffer, Rahul Dhakne, Omnya El Massad, Pulkit Sehgal, Thomas Ardiles, Michael O Calloway, M Chris Runken, Charlie Strange\",\"doi\":\"10.15326/jcopdf.2023.0491\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Despite guideline recommendations, most patients with chronic obstructive pulmonary disease (COPD) do not undergo alpha-1 antitrypsin deficiency (AATD) testing and approximately 90% of people with AATD in the United States remain undiagnosed. This study sought to develop a predictive model using real-world data to improve detection of AATD-positive patients in the general COPD population.</p><p><strong>Methods: </strong>A predictive model using extreme gradient boosting was developed using the EVERSANA database, including longitudinal, patient-level medical claims, prescription claims, AATD-specific testing data, and electronic health records (EHR). The model was trained and then validated to predict AATD-positive status. Patients were coded as AATD positive based on the presence of any of the following criteria: (1) ≥2 AATD diagnosis codes in claims; (2) an AATD diagnosis code in the EHR; (3) a positive laboratory test for AATD; or (4) use of AATD-related medication. Over 500 variables were used to train the predictive model and >20 models were run to optimize the predictive power.</p><p><strong>Results: </strong>A total of 13,585 AATD-positive patients and 7796 AATD-negative patients were included in the model. The inclusion of non-AATD laboratory test results was critical for defining cohorts and optimizing model prediction (e.g., respiratory comorbidities, and calcium, glucose, hemoglobin, and bilirubin levels). The final model yielded high predictive power, with an area under the receiver operating characteristic curve of 0.9.</p><p><strong>Conclusion: </strong>Predictive modeling using real-world data is a sound approach for assessing AATD risk and useful for identifying COPD patients who should be confirmed by genetic testing. External validation is warranted to further assess the generalizability of these results.</p>\",\"PeriodicalId\":51340,\"journal\":{\"name\":\"Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation\",\"volume\":\" \",\"pages\":\"1-11\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-01-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.15326/jcopdf.2023.0491\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"RESPIRATORY SYSTEM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.15326/jcopdf.2023.0491","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
引用次数: 0

摘要

背景:尽管指南建议,大多数COPD患者不接受α -1抗胰蛋白酶缺乏症(AATD)检测,美国约90%的AATD患者仍未确诊。本研究旨在利用真实世界数据开发一种预测模型,以提高对普通COPD人群中aatd阳性患者的检测。方法:利用EVERSANA数据库,包括纵向、患者级医疗索赔、处方索赔、aatd特定测试数据和电子健康记录(EHR),利用XGBoost建立预测模型。该模型经过训练和验证,可以预测aatd阳性状态。根据以下任何一个标准,患者被编码为AATD阳性:1)索赔中AATD诊断编码≥2个;2) EHR中的AATD诊断代码;3) AATD实验室检测呈阳性;或4)使用aatd相关药物。使用超过500个变量来训练预测模型,并运行bbb20个模型来优化预测能力。结果:aatd阳性患者13585例,aatd阴性患者7796例。纳入非aatd实验室检测结果对于确定队列和优化模型预测(例如,呼吸合并症、钙、葡萄糖、血红蛋白和胆红素水平)至关重要。最终模型具有较高的预测能力,受者工作特性曲线下面积为0.9。结论:使用真实世界数据的预测建模是评估AATD风险的一种有效方法,有助于识别需要通过基因检测确诊的COPD患者。外部验证是必要的,以进一步评估这些结果的普遍性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Improving the Likelihood of Identifying Alpha-1 Antitrypsin Deficiency Among Patients With COPD: A Novel Predictive Model Using Real-World Data.

Background: Despite guideline recommendations, most patients with chronic obstructive pulmonary disease (COPD) do not undergo alpha-1 antitrypsin deficiency (AATD) testing and approximately 90% of people with AATD in the United States remain undiagnosed. This study sought to develop a predictive model using real-world data to improve detection of AATD-positive patients in the general COPD population.

Methods: A predictive model using extreme gradient boosting was developed using the EVERSANA database, including longitudinal, patient-level medical claims, prescription claims, AATD-specific testing data, and electronic health records (EHR). The model was trained and then validated to predict AATD-positive status. Patients were coded as AATD positive based on the presence of any of the following criteria: (1) ≥2 AATD diagnosis codes in claims; (2) an AATD diagnosis code in the EHR; (3) a positive laboratory test for AATD; or (4) use of AATD-related medication. Over 500 variables were used to train the predictive model and >20 models were run to optimize the predictive power.

Results: A total of 13,585 AATD-positive patients and 7796 AATD-negative patients were included in the model. The inclusion of non-AATD laboratory test results was critical for defining cohorts and optimizing model prediction (e.g., respiratory comorbidities, and calcium, glucose, hemoglobin, and bilirubin levels). The final model yielded high predictive power, with an area under the receiver operating characteristic curve of 0.9.

Conclusion: Predictive modeling using real-world data is a sound approach for assessing AATD risk and useful for identifying COPD patients who should be confirmed by genetic testing. External validation is warranted to further assess the generalizability of these results.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.70
自引率
8.30%
发文量
45
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信