Ullas Batra , Shrinidhi Nathany , Swarsat Kaushik Nath , Joslia T. Jose , Trapti Sharma , Preeti P , Sunil Pasricha , Mansi Sharma , Nevidita Arambam , Vrinda Khanna , Abhishek Bansal , Anurag Mehta , Kamal Rawal
{"title":"AI-based pipeline for early screening of lung cancer: integrating radiology, clinical, and genomics data","authors":"Ullas Batra , Shrinidhi Nathany , Swarsat Kaushik Nath , Joslia T. Jose , Trapti Sharma , Preeti P , Sunil Pasricha , Mansi Sharma , Nevidita Arambam , Vrinda Khanna , Abhishek Bansal , Anurag Mehta , Kamal Rawal","doi":"10.1016/j.lansea.2024.100352","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>The prognosis of lung carcinoma has changed since the discovery of molecular targets and their specific drugs. Somatic Epidermal Growth Factor Receptor (<em>EGFR</em>) mutations have been reported in lung carcinoma, and these mutant proteins act as substrates for targeted therapies. However, in a resource-constrained country like India, panel-based next-generation sequencing cannot be made available to the population at large. Additional challenges such as adequacy of tissue in case of lung core biopsies and locating suitable tumour tissues as a result of innate intratumoral heterogeneity indicate the necessity of an AI-based end-to-end pipeline capable of automatically detecting and learning more effective lung nodule features from CT images and predicting the probability of the <em>EGFR</em>-mutant. This will help the oncologists and patients in resource-limited settings to achieve near-optimal care and appropriate therapy.</p></div><div><h3>Methods</h3><p>The <em>EGFR</em> gene sequencing and CT imaging data of 2277 patients with lung carcinoma were included from three cohorts in India and a White population cohort collected from TCIA. Another cohort LIDC-IDRI was used to train the AIPS-Nodule (AIPS-N) model for automatic detection and characterisation of lung nodules. We explored the value of combining the results of the AIPS-N with the clinical factors in the AIPS-Mutation (AIPS-M) model for predicting <em>EGFR</em> genotype, and it was evaluated by area under the curve (AUC).</p></div><div><h3>Findings</h3><p>AIPS-N achieved an average AP50 of 70.19% in detecting the location of nodules within the lung region of interest during validation and predicted the score of five lung nodule properties. The AIPS-M machine learning (ML) and deep learning (DL) models achieved AUCs ranging from 0.587 to 0.910.</p></div><div><h3>Interpretation</h3><p>The AIPS suggests that CT imaging combined with a fully automated lung-nodule analysis AI system can predict <em>EGFR</em> genotype and identify patients with an <em>EGFR</em> mutation in a cost-effective and non-invasive manner.</p></div><div><h3>Funding</h3><p>This work was supported by a grant provided by <span>Conquer Cancer Foundation of ASCO</span> [<span>2021IIG-5555960128</span>] and <span>Pfizer Products India Pvt. Ltd</span>.</p></div>","PeriodicalId":75136,"journal":{"name":"The Lancet regional health. Southeast Asia","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772368224000027/pdfft?md5=e281698cbcb3d165c45528f425e086c3&pid=1-s2.0-S2772368224000027-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Lancet regional health. Southeast Asia","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772368224000027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Background
The prognosis of lung carcinoma has changed since the discovery of molecular targets and their specific drugs. Somatic Epidermal Growth Factor Receptor (EGFR) mutations have been reported in lung carcinoma, and these mutant proteins act as substrates for targeted therapies. However, in a resource-constrained country like India, panel-based next-generation sequencing cannot be made available to the population at large. Additional challenges such as adequacy of tissue in case of lung core biopsies and locating suitable tumour tissues as a result of innate intratumoral heterogeneity indicate the necessity of an AI-based end-to-end pipeline capable of automatically detecting and learning more effective lung nodule features from CT images and predicting the probability of the EGFR-mutant. This will help the oncologists and patients in resource-limited settings to achieve near-optimal care and appropriate therapy.
Methods
The EGFR gene sequencing and CT imaging data of 2277 patients with lung carcinoma were included from three cohorts in India and a White population cohort collected from TCIA. Another cohort LIDC-IDRI was used to train the AIPS-Nodule (AIPS-N) model for automatic detection and characterisation of lung nodules. We explored the value of combining the results of the AIPS-N with the clinical factors in the AIPS-Mutation (AIPS-M) model for predicting EGFR genotype, and it was evaluated by area under the curve (AUC).
Findings
AIPS-N achieved an average AP50 of 70.19% in detecting the location of nodules within the lung region of interest during validation and predicted the score of five lung nodule properties. The AIPS-M machine learning (ML) and deep learning (DL) models achieved AUCs ranging from 0.587 to 0.910.
Interpretation
The AIPS suggests that CT imaging combined with a fully automated lung-nodule analysis AI system can predict EGFR genotype and identify patients with an EGFR mutation in a cost-effective and non-invasive manner.
Funding
This work was supported by a grant provided by Conquer Cancer Foundation of ASCO [2021IIG-5555960128] and Pfizer Products India Pvt. Ltd.