基于变压器的跨模态多对比网络眼科疾病诊断

IF 6.6 2区医学 Q1 ENGINEERING, BIOMEDICAL

Biocybernetics and Biomedical Engineering Pub Date : 2023-07-01 DOI:10.1016/j.bbe.2023.06.001

Yang Yu, Hongqing Zhu

{"title":"基于变压器的跨模态多对比网络眼科疾病诊断","authors":"Yang Yu, Hongqing Zhu","doi":"10.1016/j.bbe.2023.06.001","DOIUrl":null,"url":null,"abstract":"<div><p><span><span>Automatic diagnosis of various ophthalmic diseases from ocular medical images is vital to support clinical decisions. Most current methods employ a single </span>imaging modality<span>, especially 2D fundus images. Considering that the diagnosis of ophthalmic diseases can greatly benefit from multiple imaging modalities, this paper further improves the accuracy of diagnosis by effectively utilizing cross-modal data. In this paper, we propose Transformer-based cross-modal multi-contrast network for efficiently fusing color fundus photograph (CFP) and optical coherence tomography (OCT) modality to diagnose ophthalmic diseases. We design multi-contrast learning strategy to extract discriminate features from cross-modal data for diagnosis. Then channel fusion head captures the semantically shared information across different modalities and the similarity features between patients of the same category. Meanwhile, we use a class-balanced training strategy to cope with the situation that medical datasets are usually class-imbalanced. Our method is evaluated on public benchmark datasets for cross-modal ophthalmic disease diagnosis. The experimental results demonstrate that our method outperforms other approaches. The codes and models are available at </span></span><span>https://github.com/ecustyy/tcmn</span><svg><path></path></svg>.</p></div>","PeriodicalId":55381,"journal":{"name":"Biocybernetics and Biomedical Engineering","volume":"43 3","pages":"Pages 507-527"},"PeriodicalIF":6.6000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transformer-based cross-modal multi-contrast network for ophthalmic diseases diagnosis\",\"authors\":\"Yang Yu, Hongqing Zhu\",\"doi\":\"10.1016/j.bbe.2023.06.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p><span><span>Automatic diagnosis of various ophthalmic diseases from ocular medical images is vital to support clinical decisions. Most current methods employ a single </span>imaging modality<span>, especially 2D fundus images. Considering that the diagnosis of ophthalmic diseases can greatly benefit from multiple imaging modalities, this paper further improves the accuracy of diagnosis by effectively utilizing cross-modal data. In this paper, we propose Transformer-based cross-modal multi-contrast network for efficiently fusing color fundus photograph (CFP) and optical coherence tomography (OCT) modality to diagnose ophthalmic diseases. We design multi-contrast learning strategy to extract discriminate features from cross-modal data for diagnosis. Then channel fusion head captures the semantically shared information across different modalities and the similarity features between patients of the same category. Meanwhile, we use a class-balanced training strategy to cope with the situation that medical datasets are usually class-imbalanced. Our method is evaluated on public benchmark datasets for cross-modal ophthalmic disease diagnosis. The experimental results demonstrate that our method outperforms other approaches. The codes and models are available at </span></span><span>https://github.com/ecustyy/tcmn</span><svg><path></path></svg>.</p></div>\",\"PeriodicalId\":55381,\"journal\":{\"name\":\"Biocybernetics and Biomedical Engineering\",\"volume\":\"43 3\",\"pages\":\"Pages 507-527\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biocybernetics and Biomedical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0208521623000359\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biocybernetics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0208521623000359","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

从眼部医学图像中自动诊断各种眼部疾病对支持临床决策至关重要。目前大多数方法采用单一成像方式，特别是二维眼底图像。考虑到多种成像模式对眼科疾病的诊断大有裨益，本文通过有效利用跨模式数据进一步提高了诊断的准确性。本文提出了一种基于transformer的跨模态多对比网络，用于有效融合彩色眼底照片(CFP)和光学相干断层扫描(OCT)模式来诊断眼科疾病。我们设计了多对比学习策略，从跨模态数据中提取判别特征进行诊断。然后，通道融合头捕获不同模式之间的语义共享信息和同一类别患者之间的相似特征。同时，我们采用类平衡训练策略来应对医疗数据集通常是类不平衡的情况。我们的方法在跨模态眼病诊断的公共基准数据集上进行了评估。实验结果表明，该方法优于其他方法。代码和模型可在https://github.com/ecustyy/tcmn上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Transformer-based cross-modal multi-contrast network for ophthalmic diseases diagnosis

Automatic diagnosis of various ophthalmic diseases from ocular medical images is vital to support clinical decisions. Most current methods employ a single imaging modality, especially 2D fundus images. Considering that the diagnosis of ophthalmic diseases can greatly benefit from multiple imaging modalities, this paper further improves the accuracy of diagnosis by effectively utilizing cross-modal data. In this paper, we propose Transformer-based cross-modal multi-contrast network for efficiently fusing color fundus photograph (CFP) and optical coherence tomography (OCT) modality to diagnose ophthalmic diseases. We design multi-contrast learning strategy to extract discriminate features from cross-modal data for diagnosis. Then channel fusion head captures the semantically shared information across different modalities and the similarity features between patients of the same category. Meanwhile, we use a class-balanced training strategy to cope with the situation that medical datasets are usually class-imbalanced. Our method is evaluated on public benchmark datasets for cross-modal ophthalmic disease diagnosis. The experimental results demonstrate that our method outperforms other approaches. The codes and models are available at https://github.com/ecustyy/tcmn.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Biocybernetics and Biomedical Engineering ENGINEERING, BIOMEDICAL-

CiteScore

16.50

自引率

6.20%

发文量

审稿时长

38 days

期刊介绍： Biocybernetics and Biomedical Engineering is a quarterly journal, founded in 1981, devoted to publishing the results of original, innovative and creative research investigations in the field of Biocybernetics and biomedical engineering, which bridges mathematical, physical, chemical and engineering methods and technology to analyse physiological processes in living organisms as well as to develop methods, devices and systems used in biology and medicine, mainly in medical diagnosis, monitoring systems and therapy. The Journal''s mission is to advance scientific discovery into new or improved standards of care, and promotion a wide-ranging exchange between science and its application to humans.