Daniel N Pfeffer, Rahul Dhakne, Omnya El Massad, Pulkit Sehgal, Thomas Ardiles, Michael O Calloway, M Chris Runken, Charlie Strange
{"title":"提高慢性阻塞性肺病患者α -1抗胰蛋白酶缺乏症的可能性:一种使用现实世界数据的新型预测模型","authors":"Daniel N Pfeffer, Rahul Dhakne, Omnya El Massad, Pulkit Sehgal, Thomas Ardiles, Michael O Calloway, M Chris Runken, Charlie Strange","doi":"10.15326/jcopdf.2023.0491","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Despite guideline recommendations, most patients with chronic obstructive pulmonary disease (COPD) do not undergo alpha-1 antitrypsin deficiency (AATD) testing and approximately 90% of people with AATD in the United States remain undiagnosed. This study sought to develop a predictive model using real-world data to improve detection of AATD-positive patients in the general COPD population.</p><p><strong>Methods: </strong>A predictive model using extreme gradient boosting was developed using the EVERSANA database, including longitudinal, patient-level medical claims, prescription claims, AATD-specific testing data, and electronic health records (EHR). The model was trained and then validated to predict AATD-positive status. Patients were coded as AATD positive based on the presence of any of the following criteria: (1) ≥2 AATD diagnosis codes in claims; (2) an AATD diagnosis code in the EHR; (3) a positive laboratory test for AATD; or (4) use of AATD-related medication. Over 500 variables were used to train the predictive model and >20 models were run to optimize the predictive power.</p><p><strong>Results: </strong>A total of 13,585 AATD-positive patients and 7796 AATD-negative patients were included in the model. The inclusion of non-AATD laboratory test results was critical for defining cohorts and optimizing model prediction (e.g., respiratory comorbidities, and calcium, glucose, hemoglobin, and bilirubin levels). The final model yielded high predictive power, with an area under the receiver operating characteristic curve of 0.9.</p><p><strong>Conclusion: </strong>Predictive modeling using real-world data is a sound approach for assessing AATD risk and useful for identifying COPD patients who should be confirmed by genetic testing. External validation is warranted to further assess the generalizability of these results.</p>","PeriodicalId":51340,"journal":{"name":"Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation","volume":" ","pages":"1-11"},"PeriodicalIF":2.3000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving the Likelihood of Identifying Alpha-1 Antitrypsin Deficiency Among Patients With COPD: A Novel Predictive Model Using Real-World Data.\",\"authors\":\"Daniel N Pfeffer, Rahul Dhakne, Omnya El Massad, Pulkit Sehgal, Thomas Ardiles, Michael O Calloway, M Chris Runken, Charlie Strange\",\"doi\":\"10.15326/jcopdf.2023.0491\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Despite guideline recommendations, most patients with chronic obstructive pulmonary disease (COPD) do not undergo alpha-1 antitrypsin deficiency (AATD) testing and approximately 90% of people with AATD in the United States remain undiagnosed. This study sought to develop a predictive model using real-world data to improve detection of AATD-positive patients in the general COPD population.</p><p><strong>Methods: </strong>A predictive model using extreme gradient boosting was developed using the EVERSANA database, including longitudinal, patient-level medical claims, prescription claims, AATD-specific testing data, and electronic health records (EHR). The model was trained and then validated to predict AATD-positive status. Patients were coded as AATD positive based on the presence of any of the following criteria: (1) ≥2 AATD diagnosis codes in claims; (2) an AATD diagnosis code in the EHR; (3) a positive laboratory test for AATD; or (4) use of AATD-related medication. Over 500 variables were used to train the predictive model and >20 models were run to optimize the predictive power.</p><p><strong>Results: </strong>A total of 13,585 AATD-positive patients and 7796 AATD-negative patients were included in the model. The inclusion of non-AATD laboratory test results was critical for defining cohorts and optimizing model prediction (e.g., respiratory comorbidities, and calcium, glucose, hemoglobin, and bilirubin levels). The final model yielded high predictive power, with an area under the receiver operating characteristic curve of 0.9.</p><p><strong>Conclusion: </strong>Predictive modeling using real-world data is a sound approach for assessing AATD risk and useful for identifying COPD patients who should be confirmed by genetic testing. External validation is warranted to further assess the generalizability of these results.</p>\",\"PeriodicalId\":51340,\"journal\":{\"name\":\"Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation\",\"volume\":\" \",\"pages\":\"1-11\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-01-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.15326/jcopdf.2023.0491\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"RESPIRATORY SYSTEM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.15326/jcopdf.2023.0491","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
Improving the Likelihood of Identifying Alpha-1 Antitrypsin Deficiency Among Patients With COPD: A Novel Predictive Model Using Real-World Data.
Background: Despite guideline recommendations, most patients with chronic obstructive pulmonary disease (COPD) do not undergo alpha-1 antitrypsin deficiency (AATD) testing and approximately 90% of people with AATD in the United States remain undiagnosed. This study sought to develop a predictive model using real-world data to improve detection of AATD-positive patients in the general COPD population.
Methods: A predictive model using extreme gradient boosting was developed using the EVERSANA database, including longitudinal, patient-level medical claims, prescription claims, AATD-specific testing data, and electronic health records (EHR). The model was trained and then validated to predict AATD-positive status. Patients were coded as AATD positive based on the presence of any of the following criteria: (1) ≥2 AATD diagnosis codes in claims; (2) an AATD diagnosis code in the EHR; (3) a positive laboratory test for AATD; or (4) use of AATD-related medication. Over 500 variables were used to train the predictive model and >20 models were run to optimize the predictive power.
Results: A total of 13,585 AATD-positive patients and 7796 AATD-negative patients were included in the model. The inclusion of non-AATD laboratory test results was critical for defining cohorts and optimizing model prediction (e.g., respiratory comorbidities, and calcium, glucose, hemoglobin, and bilirubin levels). The final model yielded high predictive power, with an area under the receiver operating characteristic curve of 0.9.
Conclusion: Predictive modeling using real-world data is a sound approach for assessing AATD risk and useful for identifying COPD patients who should be confirmed by genetic testing. External validation is warranted to further assess the generalizability of these results.