{"title":"基于pet的多中心肺癌研究放射学分析及特征域协调的影响。","authors":"Pooja Dwivedi, Sagar Barage, Rajshri Singh, Ashish Jha, Sayak Choudhury, Archi Agrawal, Venkatesh Rangarajan","doi":"10.1007/s13246-025-01625-y","DOIUrl":null,"url":null,"abstract":"<p><p>Radiomic biomarkers have demonstrated significant potential in non-invasively assessing tumor biology and providing essential insights for precision medicine. However, the clinical translation is often hindered by challenges in multicenter studies, primarily due to a lack of standardization, such as variations in scanner models, acquisition protocols, reconstruction techniques, etc. This study aims to assess the impact of various harmonization methods in multicenter, 18 F-FDG PET-based radiomics for the classification of lung cancer histological subtypes using a machine learning model. Retrospective data included 178 lung cancer cohorts, comprising 117 adenocarcinomas and 61 squamous cell carcinomas from three different centers. PET DICOM image data was preprocessed with 3D ROI segmentation of the lung tumor and healthy liver, followed by the extraction of 111 radiomic features. Subsequently, Z-Score, Quantile, and ComBat were applied to generate three different harmonized datasets. Feature distribution was analyzed, and the top ten features were selected using recursive feature elimination. An eXtreme gradient boosting model was built on each dataset, and performance was assessed using accuracy, precision, sensitivity, specificity, and AUC with a 95% confidence interval. Variations in radiomic feature distribution and feature selection were observed after applying different harmonization methods. During validation of the trained model, AUC improved from 0.556 [95% CI 0.551-0.563] in the unharmonized data to 0.719 [95% CI 0.710-0.720], 0.952 [95% CI 0.951-0.954], and 0.996 [95% CI 0.995-0.996] in Z-Score, Quantile, and ComBat harmonized data, respectively, for classifying adenocarcinoma and squamous cell carcinoma subtypes. The study indicates that feature selection was affected by the different harmonization methods. The ComBat method was shown to significantly enhance the performance of AI-assisted PET radiomics.</p>","PeriodicalId":48490,"journal":{"name":"Physical and Engineering Sciences in Medicine","volume":" ","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PET-based radiomic analysis in multicentre lung cancer study and impact of feature domain harmonization.\",\"authors\":\"Pooja Dwivedi, Sagar Barage, Rajshri Singh, Ashish Jha, Sayak Choudhury, Archi Agrawal, Venkatesh Rangarajan\",\"doi\":\"10.1007/s13246-025-01625-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Radiomic biomarkers have demonstrated significant potential in non-invasively assessing tumor biology and providing essential insights for precision medicine. However, the clinical translation is often hindered by challenges in multicenter studies, primarily due to a lack of standardization, such as variations in scanner models, acquisition protocols, reconstruction techniques, etc. This study aims to assess the impact of various harmonization methods in multicenter, 18 F-FDG PET-based radiomics for the classification of lung cancer histological subtypes using a machine learning model. Retrospective data included 178 lung cancer cohorts, comprising 117 adenocarcinomas and 61 squamous cell carcinomas from three different centers. PET DICOM image data was preprocessed with 3D ROI segmentation of the lung tumor and healthy liver, followed by the extraction of 111 radiomic features. Subsequently, Z-Score, Quantile, and ComBat were applied to generate three different harmonized datasets. Feature distribution was analyzed, and the top ten features were selected using recursive feature elimination. An eXtreme gradient boosting model was built on each dataset, and performance was assessed using accuracy, precision, sensitivity, specificity, and AUC with a 95% confidence interval. Variations in radiomic feature distribution and feature selection were observed after applying different harmonization methods. During validation of the trained model, AUC improved from 0.556 [95% CI 0.551-0.563] in the unharmonized data to 0.719 [95% CI 0.710-0.720], 0.952 [95% CI 0.951-0.954], and 0.996 [95% CI 0.995-0.996] in Z-Score, Quantile, and ComBat harmonized data, respectively, for classifying adenocarcinoma and squamous cell carcinoma subtypes. The study indicates that feature selection was affected by the different harmonization methods. The ComBat method was shown to significantly enhance the performance of AI-assisted PET radiomics.</p>\",\"PeriodicalId\":48490,\"journal\":{\"name\":\"Physical and Engineering Sciences in Medicine\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physical and Engineering Sciences in Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s13246-025-01625-y\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical and Engineering Sciences in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13246-025-01625-y","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
摘要
放射组学生物标志物在非侵入性评估肿瘤生物学和为精准医学提供重要见解方面显示出巨大的潜力。然而,临床翻译常常受到多中心研究挑战的阻碍,主要是由于缺乏标准化,例如扫描仪模型、获取协议、重建技术等方面的差异。本研究旨在利用机器学习模型评估各种协调方法在多中心、18 F-FDG pet为基础的放射组学中对肺癌组织学亚型分类的影响。回顾性数据包括178个肺癌队列,包括来自三个不同中心的117个腺癌和61个鳞状细胞癌。对PET DICOM图像数据进行预处理,对肺肿瘤和健康肝脏进行三维ROI分割,提取111个放射学特征。随后,Z-Score, Quantile和ComBat被应用于生成三个不同的协调数据集。对特征分布进行分析,采用递归特征消去法筛选出最优的10个特征。在每个数据集上建立一个eXtreme梯度增强模型,并使用准确度、精度、灵敏度、特异性和AUC(95%置信区间)对性能进行评估。采用不同的调和方法,观察到放射学特征分布和特征选择的变化。在训练模型的验证过程中,用于分类腺癌和鳞状细胞癌亚型的Z-Score、Quantile和ComBat协调数据的AUC分别从未协调数据中的0.556 [95% CI 0.551-0.563]提高到0.719 [95% CI 0.710-0.720]、0.952 [95% CI 0.951-0.954]和0.996 [95% CI 0.995-0.996]。研究表明,不同的协调方法会影响特征选择。战斗方法被证明可以显著提高人工智能辅助PET放射组学的性能。
PET-based radiomic analysis in multicentre lung cancer study and impact of feature domain harmonization.
Radiomic biomarkers have demonstrated significant potential in non-invasively assessing tumor biology and providing essential insights for precision medicine. However, the clinical translation is often hindered by challenges in multicenter studies, primarily due to a lack of standardization, such as variations in scanner models, acquisition protocols, reconstruction techniques, etc. This study aims to assess the impact of various harmonization methods in multicenter, 18 F-FDG PET-based radiomics for the classification of lung cancer histological subtypes using a machine learning model. Retrospective data included 178 lung cancer cohorts, comprising 117 adenocarcinomas and 61 squamous cell carcinomas from three different centers. PET DICOM image data was preprocessed with 3D ROI segmentation of the lung tumor and healthy liver, followed by the extraction of 111 radiomic features. Subsequently, Z-Score, Quantile, and ComBat were applied to generate three different harmonized datasets. Feature distribution was analyzed, and the top ten features were selected using recursive feature elimination. An eXtreme gradient boosting model was built on each dataset, and performance was assessed using accuracy, precision, sensitivity, specificity, and AUC with a 95% confidence interval. Variations in radiomic feature distribution and feature selection were observed after applying different harmonization methods. During validation of the trained model, AUC improved from 0.556 [95% CI 0.551-0.563] in the unharmonized data to 0.719 [95% CI 0.710-0.720], 0.952 [95% CI 0.951-0.954], and 0.996 [95% CI 0.995-0.996] in Z-Score, Quantile, and ComBat harmonized data, respectively, for classifying adenocarcinoma and squamous cell carcinoma subtypes. The study indicates that feature selection was affected by the different harmonization methods. The ComBat method was shown to significantly enhance the performance of AI-assisted PET radiomics.