Rafael Parra-Medina, Gabriela Guerron-Gomez, Daniel Mendivelso-González, Javier Hernan Gil-Gómez, Juan Pablo Alzate, Marcela Gomez-Suarez, Jose Fernando Polo, John Jaime Sprockel, Andres Mosquera-Zamudio
{"title":"Deep learning in histopathology images for prediction of oncogenic driver molecular alterations in lung cancer: a systematic review and meta-analysis.","authors":"Rafael Parra-Medina, Gabriela Guerron-Gomez, Daniel Mendivelso-González, Javier Hernan Gil-Gómez, Juan Pablo Alzate, Marcela Gomez-Suarez, Jose Fernando Polo, John Jaime Sprockel, Andres Mosquera-Zamudio","doi":"10.21037/tlcr-2024-1196","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Lung cancer (LC) is the second most diagnosed cancer and the leading cause of cancer mortality worldwide. Non-small cell lung cancer (NSCLC) accounts for 85% of cases, with oncogenic alterations like <i>EGFR, ALK, ROS1</i>, and <i>KRAS</i> guiding targeted therapies. Their prevalence varies by ethnicity, smoking status, and gender. Advances in artificial intelligence (AI) enable molecular biomarker prediction from hematoxylin and eosin-stained whole-slide images (H&E WSIs), offering a non-invasive approach to precision oncology. This review assesses deep learning (DL) models predicting oncogenic drivers in NSCLC from H&E WSIs and their diagnostic accuracy.</p><p><strong>Methods: </strong>A systematic review registered in PROSPERO (CRD42024573602) was conducted in Embase, LILACS, Medline, Web of Science, and Cochrane to identify studies on DL models using H&E slides for LC gene alterations. Only English and Spanish studies were included. Key metrics were extracted for meta-analysis. Studies without LC-specific data, missing essential metrics, or with inconsistent results were excluded.</p><p><strong>Results: </strong>We found evidence that convolutional neural networks (CNNs) were the most common architectures in studies. Also, in the meta-analysis, <i>ALK</i> {sensitivity of 84% [95% confidence interval (CI): 62-95%] and specificity of 85% (95% CI: 55-96%)}, <i>EGFR</i> [80% (95% CI: 72-86%) and specificity of 77% (95% CI: 69-83%)] and <i>TP53</i> [sensitivity and specificity of 70% (95% CI: 65-83%)] were the oncogenic driver molecular alterations that demonstrated the best predictive capability performance.</p><p><strong>Conclusions: </strong>Our results emphasize the potential of these models as screening tools despite H&E WSI.It is necessary to validate these predictive models among diverse populations and clinical outcomes. This approach is crucial and leaves an open door for advances in precision medicine, offering promising avenues for personalized treatment strategies.</p>","PeriodicalId":23271,"journal":{"name":"Translational lung cancer research","volume":"14 5","pages":"1756-1769"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12170222/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational lung cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tlcr-2024-1196","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/21 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Lung cancer (LC) is the second most diagnosed cancer and the leading cause of cancer mortality worldwide. Non-small cell lung cancer (NSCLC) accounts for 85% of cases, with oncogenic alterations like EGFR, ALK, ROS1, and KRAS guiding targeted therapies. Their prevalence varies by ethnicity, smoking status, and gender. Advances in artificial intelligence (AI) enable molecular biomarker prediction from hematoxylin and eosin-stained whole-slide images (H&E WSIs), offering a non-invasive approach to precision oncology. This review assesses deep learning (DL) models predicting oncogenic drivers in NSCLC from H&E WSIs and their diagnostic accuracy.
Methods: A systematic review registered in PROSPERO (CRD42024573602) was conducted in Embase, LILACS, Medline, Web of Science, and Cochrane to identify studies on DL models using H&E slides for LC gene alterations. Only English and Spanish studies were included. Key metrics were extracted for meta-analysis. Studies without LC-specific data, missing essential metrics, or with inconsistent results were excluded.
Results: We found evidence that convolutional neural networks (CNNs) were the most common architectures in studies. Also, in the meta-analysis, ALK {sensitivity of 84% [95% confidence interval (CI): 62-95%] and specificity of 85% (95% CI: 55-96%)}, EGFR [80% (95% CI: 72-86%) and specificity of 77% (95% CI: 69-83%)] and TP53 [sensitivity and specificity of 70% (95% CI: 65-83%)] were the oncogenic driver molecular alterations that demonstrated the best predictive capability performance.
Conclusions: Our results emphasize the potential of these models as screening tools despite H&E WSI.It is necessary to validate these predictive models among diverse populations and clinical outcomes. This approach is crucial and leaves an open door for advances in precision medicine, offering promising avenues for personalized treatment strategies.
期刊介绍:
Translational Lung Cancer Research(TLCR, Transl Lung Cancer Res, Print ISSN 2218-6751; Online ISSN 2226-4477) is an international, peer-reviewed, open-access journal, which was founded in March 2012. TLCR is indexed by PubMed/PubMed Central and the Chemical Abstracts Service (CAS) Databases. It is published quarterly the first year, and published bimonthly since February 2013. It provides practical up-to-date information on prevention, early detection, diagnosis, and treatment of lung cancer. Specific areas of its interest include, but not limited to, multimodality therapy, markers, imaging, tumor biology, pathology, chemoprevention, and technical advances related to lung cancer.