Semi-supervised pre-training based multi-task network for thyroid-associated ophthalmopathy classification

IF 3.7 2区工程技术 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Displays Pub Date : 2025-02-01 DOI:10.1016/j.displa.2025.102974

MingFei Yang , TianFeng Zhang , XueFei Song , YuZhong Zhang , Lei Zhou

{"title":"Semi-supervised pre-training based multi-task network for thyroid-associated ophthalmopathy classification","authors":"MingFei Yang , TianFeng Zhang , XueFei Song , YuZhong Zhang , Lei Zhou","doi":"10.1016/j.displa.2025.102974","DOIUrl":null,"url":null,"abstract":"<div><div>Thyroid-associated ophthalmopathy (TAO) is a blinding autoimmune disorder, and early diagnosis is crucial in preventing vision loss. Orbital CT imaging has emerged as a valuable tool for diagnosing and screening TAO. Radiomic is currently the most dominant technique for TAO diagnosis, however it is costly due to the need for manual image labeling by medical professionals. Convolutional Neural Network (CNN) is another promising technique for TAO diagnosis. However, the performance of CNN based classification may degrade due to the limited size of collected data or the complexity of designed model. Utilizing pretraining model is a crucial technique for boosting the performance of CNN based TAO classification. Therefore, a novel semi-supervised pretraining based multi-task network for TAO classification is proposed in this paper. Firstly, a multi-task network is designed, which consists of an encoder, a classification branch and two segmentation decoder. Then, the multi-task network is pretrained by minimizing the prediction difference between two segmentation decoders through a semi-supervised way. In this way, the pseudo voxel-level supervision can be generated for the unlabeled images. Finally, the encoder and one light-weighted decoder can be initialized by the pretrained weights, and then they are jointly optimized for TAO classification with the classification branch through multi-task learning. Our proposed network model was comprehensively evaluated on a private dataset which consists of 982 orbital CT scans for TAO diagnosis. We also tested the classification generalization performance using an external dataset. The experimental results demonstrate that our model significantly improves the classification performance when compared with current SOTA methods. The source code is publically available at <span><span>https://github.com/VLAD-KONATA/TAO_CT</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"87 ","pages":"Article 102974"},"PeriodicalIF":3.7000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225000113","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

Thyroid-associated ophthalmopathy (TAO) is a blinding autoimmune disorder, and early diagnosis is crucial in preventing vision loss. Orbital CT imaging has emerged as a valuable tool for diagnosing and screening TAO. Radiomic is currently the most dominant technique for TAO diagnosis, however it is costly due to the need for manual image labeling by medical professionals. Convolutional Neural Network (CNN) is another promising technique for TAO diagnosis. However, the performance of CNN based classification may degrade due to the limited size of collected data or the complexity of designed model. Utilizing pretraining model is a crucial technique for boosting the performance of CNN based TAO classification. Therefore, a novel semi-supervised pretraining based multi-task network for TAO classification is proposed in this paper. Firstly, a multi-task network is designed, which consists of an encoder, a classification branch and two segmentation decoder. Then, the multi-task network is pretrained by minimizing the prediction difference between two segmentation decoders through a semi-supervised way. In this way, the pseudo voxel-level supervision can be generated for the unlabeled images. Finally, the encoder and one light-weighted decoder can be initialized by the pretrained weights, and then they are jointly optimized for TAO classification with the classification branch through multi-task learning. Our proposed network model was comprehensively evaluated on a private dataset which consists of 982 orbital CT scans for TAO diagnosis. We also tested the classification generalization performance using an external dataset. The experimental results demonstrate that our model significantly improves the classification performance when compared with current SOTA methods. The source code is publically available at https://github.com/VLAD-KONATA/TAO_CT.

查看原文本刊更多论文

基于半监督预训练的甲状腺相关眼病分类多任务网络

甲状腺相关性眼病（TAO）是一种致盲性自身免疫性疾病，早期诊断对预防视力丧失至关重要。眼眶CT成像已成为诊断和筛查TAO的重要工具。放射组学是目前最主要的TAO诊断技术，但由于需要医疗专业人员手动标记图像，因此成本高昂。卷积神经网络（CNN）是另一种很有前途的TAO诊断技术。然而，由于收集数据的规模有限或设计模型的复杂性，基于CNN的分类性能可能会下降。利用预训练模型是提高基于CNN的TAO分类性能的关键技术。为此，本文提出了一种新的基于半监督预训练的多任务TAO分类网络。首先，设计了一个多任务网络，该网络由一个编码器、一个分类分支和两个分段解码器组成。然后，通过半监督的方式最小化两个分割解码器之间的预测差，对多任务网络进行预训练。通过这种方法，可以对未标记的图像生成伪体素级监督。最后，利用预训练的权值对编码器和一个轻量级解码器进行初始化，然后通过多任务学习，与分类分支一起对编码器和一个轻量级解码器进行TAO分类优化。我们提出的网络模型在一个私人数据集上进行了全面评估，该数据集由982个眼眶CT扫描组成，用于TAO诊断。我们还使用外部数据集测试了分类泛化性能。实验结果表明，与现有的SOTA方法相比，我们的模型显著提高了分类性能。源代码可在https://github.com/VLAD-KONATA/TAO_CT上公开获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Displays 工程技术-工程：电子与电气

CiteScore

4.60

自引率

25.60%

发文量

138

审稿时长

92 days

期刊介绍： Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface. Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.