A multi-task deep learning pipeline integrating vessel segmentation and radiomics for multiclass retinal disease classification

IF 2.6 3区医学 Q2 ONCOLOGY

Photodiagnosis and Photodynamic Therapy Pub Date : 2025-09-04 DOI:10.1016/j.pdpdt.2025.105209

Feng Yan , Yanxia Liu , Qingsong Zhao , Guangguo He

{"title":"A multi-task deep learning pipeline integrating vessel segmentation and radiomics for multiclass retinal disease classification","authors":"Feng Yan , Yanxia Liu , Qingsong Zhao , Guangguo He","doi":"10.1016/j.pdpdt.2025.105209","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>This study aims to develop a robust, multi-task deep learning framework that integrates vessel segmentation and radiomic analysis for the automated classification of four retinal conditions— diabetic retinopathy (DR), hypertensive retinopathy (HR), papilledema, and normal fundus—using fundus images.</div></div><div><h3>Materials and Methods</h3><div>A total of 2165 patients from eight medical centers were enrolled. Fundus images underwent standardized preprocessing including histogram equalization, normalization, resizing, and augmentation. Whole vessel and artery-vein segmentations were conducted using five deep learning models: U-Net, Attention U-Net, DeepLabV3+, HRNet, and Swin-Unet. From the segmented vascular maps, 220 radiomic features were extracted using PyRadiomics and Mahotas toolkits. The arteriovenous ratio (AVR) was also computed from artery and vein masks. ICC analysis was used to assess reproducibility across centers, with features below ICC < 0.75 excluded. Feature selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO), Recursive Feature Elimination (RFE), and Mutual Information (MI). The combined AVR and radiomic features were input into four classifiers— Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), Random Forest (RF), and Ensemble. Models were trained and validated on stratified splits and externally tested on an independent cohort of 769 patients. Evaluation metrics included accuracy, area under curve (AUC), recall, and receiver operating characteristics (ROC) analysis.</div></div><div><h3>Results</h3><div>Swin-Unet outperformed all models with external Dice Similarity Coefficient (DSC) of 92.4 % for whole vessel and 89.8 % for artery-vein segmentation. Classification using the LASSO-Ensemble combination achieved test accuracy of 93.7 %, external test accuracy of 92.3 %, and AUC of 95.2 %. AVR estimates were consistent with clinical expectations and contributed significantly to class discrimination.</div></div><div><h3>Conclusion</h3><div>This multi-task pipeline demonstrates the potential of combining transformer-based segmentation with radiomics for accurate, interpretable retinal disease classification, showing strong generalizability for future clinical applications.</div></div>","PeriodicalId":20141,"journal":{"name":"Photodiagnosis and Photodynamic Therapy","volume":"56 ","pages":"Article 105209"},"PeriodicalIF":2.6000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Photodiagnosis and Photodynamic Therapy","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1572100025007409","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Objective

This study aims to develop a robust, multi-task deep learning framework that integrates vessel segmentation and radiomic analysis for the automated classification of four retinal conditions— diabetic retinopathy (DR), hypertensive retinopathy (HR), papilledema, and normal fundus—using fundus images.

Materials and Methods

A total of 2165 patients from eight medical centers were enrolled. Fundus images underwent standardized preprocessing including histogram equalization, normalization, resizing, and augmentation. Whole vessel and artery-vein segmentations were conducted using five deep learning models: U-Net, Attention U-Net, DeepLabV3+, HRNet, and Swin-Unet. From the segmented vascular maps, 220 radiomic features were extracted using PyRadiomics and Mahotas toolkits. The arteriovenous ratio (AVR) was also computed from artery and vein masks. ICC analysis was used to assess reproducibility across centers, with features below ICC < 0.75 excluded. Feature selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO), Recursive Feature Elimination (RFE), and Mutual Information (MI). The combined AVR and radiomic features were input into four classifiers— Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), Random Forest (RF), and Ensemble. Models were trained and validated on stratified splits and externally tested on an independent cohort of 769 patients. Evaluation metrics included accuracy, area under curve (AUC), recall, and receiver operating characteristics (ROC) analysis.

Results

Swin-Unet outperformed all models with external Dice Similarity Coefficient (DSC) of 92.4 % for whole vessel and 89.8 % for artery-vein segmentation. Classification using the LASSO-Ensemble combination achieved test accuracy of 93.7 %, external test accuracy of 92.3 %, and AUC of 95.2 %. AVR estimates were consistent with clinical expectations and contributed significantly to class discrimination.

Conclusion

This multi-task pipeline demonstrates the potential of combining transformer-based segmentation with radiomics for accurate, interpretable retinal disease classification, showing strong generalizability for future clinical applications.

查看原文本刊更多论文

结合血管分割和放射组学的多任务深度学习管道用于多类别视网膜疾病分类。

目的：本研究旨在开发一个强大的多任务深度学习框架，该框架集成了血管分割和放射学分析，用于自动分类四种视网膜疾病-糖尿病视网膜病变（DR），高血压视网膜病变（HR），乳头水肿和正常眼底-使用眼底图像。材料:和。方法：共纳入来自8个医疗中心的2165例患者。眼底图像经过标准化的预处理，包括直方图均衡化、归一化、调整大小和增强。使用五种深度学习模型进行全血管和动静脉分割：U-Net、Attention U-Net、DeepLabV3+、HRNet和swwin - unet。利用PyRadiomics和Mahotas工具包，从分割的血管图中提取220个放射学特征。同时计算动静脉比（AVR）。采用ICC分析评估各中心的可重复性，排除ICC < 0.75以下的特征。使用最小绝对收缩和选择算子（LASSO）、递归特征消除（RFE）和互信息（MI）进行特征选择。AVR和放射学特征的组合被输入到四个分类器中——极端梯度增强（XGBoost）、分类增强（CatBoost）、随机森林（RF）和集成。模型在分层分割中进行训练和验证，并在769名患者的独立队列中进行外部测试。评价指标包括准确性、曲线下面积（AUC）、召回率和受试者工作特征（ROC）分析。结果：swan - unet在全血管和动静脉分割上的DSC分别为92.4%和89.8%，优于所有模型。LASSO-Ensemble组合分类的测试准确率为93.7%，外部测试准确率为92.3%，AUC为95.2%。AVR估计值与临床预期一致，并显著影响了阶级歧视。结论：这个多任务管道显示了将基于转换器的分割与放射组学相结合的潜力，可以实现准确、可解释的视网膜疾病分类，在未来的临床应用中具有很强的通用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Photodiagnosis and Photodynamic Therapy ONCOLOGY-

CiteScore

5.80

自引率

24.20%

发文量

509

审稿时长

50 days

期刊介绍： Photodiagnosis and Photodynamic Therapy is an international journal for the dissemination of scientific knowledge and clinical developments of Photodiagnosis and Photodynamic Therapy in all medical specialties. The journal publishes original articles, review articles, case presentations, "how-to-do-it" articles, Letters to the Editor, short communications and relevant images with short descriptions. All submitted material is subject to a strict peer-review process.