基于卷积神经网络的视网膜疾病多模态分类。

IF 1.3 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Biomedical Physics & Engineering Express Pub Date : 2025-07-03 DOI:10.1088/2057-1976/adeb92

Hongyi Pan, Jingpeng Miao, Jie Yu, Jingmin Li, Xiaobing Wang, Jihong Feng

{"title":"基于卷积神经网络的视网膜疾病多模态分类。","authors":"Hongyi Pan, Jingpeng Miao, Jie Yu, Jingmin Li, Xiaobing Wang, Jihong Feng","doi":"10.1088/2057-1976/adeb92","DOIUrl":null,"url":null,"abstract":"Retinal diseases such as age-related macular degeneration and diabetic retinopathy will lead to irreversible blindness without timely diagnosis and treatment. Optical coherence tomography (OCT) and optical coherence tomography angiography (OCTA) images provide complementary views of the retina, and the integration of the two imaging modalities can improve the accuracy of retinal disease classification. We propose a multi-modal classification model consisting of two branches to automatically diagnose retinal diseases, in which OCT and OCTA images are efficiently integrated to improve both the accuracy and efficiency of disease diagnosis. A bright line cropping is used to remove the useless black edge region while preserving the lesion features and reducing the calculation load. To solve the insufficient data issue, data enhancement and loose matching methods are adopted to increase the data amount. A two-step training method is used to train our proposed model, alleviating the limited training images. Our model is tested on an external test set instead of a training set, making the classification results more rigorous. The intermediate fusion and two-step training methods are adopted in our multiple classification model, achieving 0.9667, 0.9418, 0.8569, 0.9422, and 0.8921 in average accuracy, precision, recall, specificity, and F1-Score, respectively.Our multi-modal model outperforms the single-modal model, the early, and late fusion multi-modal model in accuracy. Our model offers doctors less human error, lower cost, more uniform, and effective mass screening, thus providing a solution to improve deep learning performance in terms of a relatively fewer number of training data and even more imbalanced classes.&#xD.","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-modal Classification of Retinal Disease Based On Convolutional Neural Network.\",\"authors\":\"Hongyi Pan, Jingpeng Miao, Jie Yu, Jingmin Li, Xiaobing Wang, Jihong Feng\",\"doi\":\"10.1088/2057-1976/adeb92\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Retinal diseases such as age-related macular degeneration and diabetic retinopathy will lead to irreversible blindness without timely diagnosis and treatment. Optical coherence tomography (OCT) and optical coherence tomography angiography (OCTA) images provide complementary views of the retina, and the integration of the two imaging modalities can improve the accuracy of retinal disease classification. We propose a multi-modal classification model consisting of two branches to automatically diagnose retinal diseases, in which OCT and OCTA images are efficiently integrated to improve both the accuracy and efficiency of disease diagnosis. A bright line cropping is used to remove the useless black edge region while preserving the lesion features and reducing the calculation load. To solve the insufficient data issue, data enhancement and loose matching methods are adopted to increase the data amount. A two-step training method is used to train our proposed model, alleviating the limited training images. Our model is tested on an external test set instead of a training set, making the classification results more rigorous. The intermediate fusion and two-step training methods are adopted in our multiple classification model, achieving 0.9667, 0.9418, 0.8569, 0.9422, and 0.8921 in average accuracy, precision, recall, specificity, and F1-Score, respectively.Our multi-modal model outperforms the single-modal model, the early, and late fusion multi-modal model in accuracy. Our model offers doctors less human error, lower cost, more uniform, and effective mass screening, thus providing a solution to improve deep learning performance in terms of a relatively fewer number of training data and even more imbalanced classes.&#xD.\",\"PeriodicalId\":8896,\"journal\":{\"name\":\"Biomedical Physics & Engineering Express\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Physics & Engineering Express\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1088/2057-1976/adeb92\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Physics & Engineering Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2057-1976/adeb92","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

老年性黄斑变性、糖尿病性视网膜病变等视网膜疾病如不及时诊断和治疗，会导致不可逆的失明。光学相干断层扫描（OCT）和光学相干断层扫描血管成像（OCTA）图像提供了视网膜的互补视图，两种成像方式的整合可以提高视网膜疾病分类的准确性。我们提出了一种由两个分支组成的多模态分类模型来自动诊断视网膜疾病，该模型将OCT和OCTA图像有效地集成在一起，以提高疾病诊断的准确性和效率。采用明线裁剪法去除无用的黑边区域，同时保留病灶特征，减少计算量。为了解决数据不足的问题，采用了数据增强和松散匹配的方法来增加数据量。采用两步训练方法对模型进行训练，缓解了训练图像的局限性。我们的模型在外部测试集而不是训练集上进行测试，使得分类结果更加严格。多重分类模型采用中间融合和两步训练方法，平均准确率、精密度、召回率、特异性和F1-Score分别达到0.9667、0.9418、0.8569、0.9422和0.8921。；多模态模型的准确率优于单模态模型、早期融合和晚期融合多模态模型。我们的模型为医生提供了更少的人为错误、更低的成本、更统一、更有效的大规模筛查，从而在相对较少的训练数据和更不平衡的类别方面提供了提高深度学习性能的解决方案。。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-modal Classification of Retinal Disease Based On Convolutional Neural Network.

Retinal diseases such as age-related macular degeneration and diabetic retinopathy will lead to irreversible blindness without timely diagnosis and treatment. Optical coherence tomography (OCT) and optical coherence tomography angiography (OCTA) images provide complementary views of the retina, and the integration of the two imaging modalities can improve the accuracy of retinal disease classification. We propose a multi-modal classification model consisting of two branches to automatically diagnose retinal diseases, in which OCT and OCTA images are efficiently integrated to improve both the accuracy and efficiency of disease diagnosis. A bright line cropping is used to remove the useless black edge region while preserving the lesion features and reducing the calculation load. To solve the insufficient data issue, data enhancement and loose matching methods are adopted to increase the data amount. A two-step training method is used to train our proposed model, alleviating the limited training images. Our model is tested on an external test set instead of a training set, making the classification results more rigorous. The intermediate fusion and two-step training methods are adopted in our multiple classification model, achieving 0.9667, 0.9418, 0.8569, 0.9422, and 0.8921 in average accuracy, precision, recall, specificity, and F1-Score, respectively. Our multi-modal model outperforms the single-modal model, the early, and late fusion multi-modal model in accuracy. Our model offers doctors less human error, lower cost, more uniform, and effective mass screening, thus providing a solution to improve deep learning performance in terms of a relatively fewer number of training data and even more imbalanced classes. .

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Biomedical Physics & Engineering Express RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

2.80

自引率

0.00%

发文量

153

期刊介绍： BPEX is an inclusive, international, multidisciplinary journal devoted to publishing new research on any application of physics and/or engineering in medicine and/or biology. Characterized by a broad geographical coverage and a fast-track peer-review process, relevant topics include all aspects of biophysics, medical physics and biomedical engineering. Papers that are almost entirely clinical or biological in their focus are not suitable. The journal has an emphasis on publishing interdisciplinary work and bringing research fields together, encompassing experimental, theoretical and computational work.