Predicting epidermal growth factor receptor (EGFR) mutation status in non-small cell lung cancer (NSCLC) patients through logistic regression: a model incorporating clinical characteristics, computed tomography (CT) imaging features, and tumor marker levels.
{"title":"Predicting epidermal growth factor receptor (EGFR) mutation status in non-small cell lung cancer (NSCLC) patients through logistic regression: a model incorporating clinical characteristics, computed tomography (CT) imaging features, and tumor marker levels.","authors":"Jimin Hao, Man Liu, Zhigang Zhou, Chunling Zhao, Liping Dai, Songyun Ouyang","doi":"10.7717/peerj.18618","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Approximately 60% of Asian populations with non-small cell lung cancer (NSCLC) harbor epidermal growth factor receptor (EGFR) gene mutations, marking it as a pivotal target for genotype-directed therapies. Currently, determining EGFR mutation status relies on DNA sequencing of histological or cytological specimens. This study presents a predictive model integrating clinical parameters, computed tomography (CT) characteristics, and serum tumor markers to forecast EGFR mutation status in NSCLC patients.</p><p><strong>Methods: </strong>Retrospective data collection was conducted on NSCLC patients diagnosed between January 2018 and June 2019 at the First Affiliated Hospital of Zhengzhou University, with available molecular pathology results. Clinical information, CT imaging features, and serum tumor marker levels were compiled. Four distinct models were employed in constructing the diagnostic model. Model diagnostic efficacy was assessed through receiver operating characteristic (ROC) area under the curve (AUC) values and calibration curves. DeLong's test was administered to validate model robustness.</p><p><strong>Results: </strong>Our study encompassed 748 participants. Logistic regression modeling, trained with the aforementioned variables, demonstrated remarkable predictive capability, achieving an AUC of 0.805 (95% confidence interval (CI) [0.766-0.844]) in the primary cohort and 0.753 (95% CI [0.687-0.818]) in the validation cohort. Calibration plots suggested a favorable fit of the model to the data.</p><p><strong>Conclusions: </strong>The developed logistic regression model emerges as a promising tool for forecasting EGFR mutation status. It holds potential to aid clinicians in more precisely identifying patients likely to benefit from EGFR molecular testing and facilitating targeted therapy decision-making, particularly in scenarios where molecular testing is impractical or inaccessible.</p>","PeriodicalId":19799,"journal":{"name":"PeerJ","volume":"12 ","pages":"e18618"},"PeriodicalIF":2.3000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623057/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7717/peerj.18618","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Approximately 60% of Asian populations with non-small cell lung cancer (NSCLC) harbor epidermal growth factor receptor (EGFR) gene mutations, marking it as a pivotal target for genotype-directed therapies. Currently, determining EGFR mutation status relies on DNA sequencing of histological or cytological specimens. This study presents a predictive model integrating clinical parameters, computed tomography (CT) characteristics, and serum tumor markers to forecast EGFR mutation status in NSCLC patients.
Methods: Retrospective data collection was conducted on NSCLC patients diagnosed between January 2018 and June 2019 at the First Affiliated Hospital of Zhengzhou University, with available molecular pathology results. Clinical information, CT imaging features, and serum tumor marker levels were compiled. Four distinct models were employed in constructing the diagnostic model. Model diagnostic efficacy was assessed through receiver operating characteristic (ROC) area under the curve (AUC) values and calibration curves. DeLong's test was administered to validate model robustness.
Results: Our study encompassed 748 participants. Logistic regression modeling, trained with the aforementioned variables, demonstrated remarkable predictive capability, achieving an AUC of 0.805 (95% confidence interval (CI) [0.766-0.844]) in the primary cohort and 0.753 (95% CI [0.687-0.818]) in the validation cohort. Calibration plots suggested a favorable fit of the model to the data.
Conclusions: The developed logistic regression model emerges as a promising tool for forecasting EGFR mutation status. It holds potential to aid clinicians in more precisely identifying patients likely to benefit from EGFR molecular testing and facilitating targeted therapy decision-making, particularly in scenarios where molecular testing is impractical or inaccessible.
期刊介绍:
PeerJ is an open access peer-reviewed scientific journal covering research in the biological and medical sciences. At PeerJ, authors take out a lifetime publication plan (for as little as $99) which allows them to publish articles in the journal for free, forever. PeerJ has 5 Nobel Prize Winners on the Board; they have won several industry and media awards; and they are widely recognized as being one of the most interesting recent developments in academic publishing.