Qiankun Zuo, Zhengkun Shi, Bo Liu, Na Ping, Jiangtao Wang, Xi Cheng, Kexin Zhang, Jia Guo, Yixian Wu, Jin Hong
{"title":"具有多方向选择机制的多分辨率视觉曼巴,用于视网膜疾病检测。","authors":"Qiankun Zuo, Zhengkun Shi, Bo Liu, Na Ping, Jiangtao Wang, Xi Cheng, Kexin Zhang, Jia Guo, Yixian Wu, Jin Hong","doi":"10.3389/fcell.2024.1484880","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Retinal diseases significantly impact patients' quality of life and increase social medical costs. Optical coherence tomography (OCT) offers high-resolution imaging for precise detection and monitoring of these conditions. While deep learning techniques have been employed to extract features from OCT images for classification, convolutional neural networks (CNNs) often fail to capture global context due to their focus on local receptive fields. Transformer-based methods, on the other hand, suffer from quadratic complexity when handling long-range dependencies.</p><p><strong>Methods: </strong>To overcome these limitations, we introduce the Multi-Resolution Visual Mamba (MRVM) model, which addresses long-range dependencies with linear computational complexity for OCT image classification. The MRVM model initially employs convolution to extract local features and subsequently utilizes the retinal Mamba to capture global dependencies. By integrating multi-scale global features, the MRVM enhances classification accuracy and overall performance. Additionally, the multi-directional selection mechanism (MSM) within the retinal Mamba improves feature extraction by concentrating on various directions, thereby better capturing complex, orientation-specific retinal patterns.</p><p><strong>Results: </strong>Experimental results demonstrate that the MRVM model excels in differentiating retinal images with various lesions, achieving superior detection accuracy compared to traditional methods, with overall accuracies of 98.98\\% and 96.21\\% on two public datasets, respectively.</p><p><strong>Discussion: </strong>This approach offers a novel perspective for accurately identifying retinal diseases and could contribute to the development of more robust artificial intelligence algorithms and recognition systems for medical image-assisted diagnosis.</p>","PeriodicalId":12448,"journal":{"name":"Frontiers in Cell and Developmental Biology","volume":"12 ","pages":"1484880"},"PeriodicalIF":4.6000,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512455/pdf/","citationCount":"0","resultStr":"{\"title\":\"Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection.\",\"authors\":\"Qiankun Zuo, Zhengkun Shi, Bo Liu, Na Ping, Jiangtao Wang, Xi Cheng, Kexin Zhang, Jia Guo, Yixian Wu, Jin Hong\",\"doi\":\"10.3389/fcell.2024.1484880\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Retinal diseases significantly impact patients' quality of life and increase social medical costs. Optical coherence tomography (OCT) offers high-resolution imaging for precise detection and monitoring of these conditions. While deep learning techniques have been employed to extract features from OCT images for classification, convolutional neural networks (CNNs) often fail to capture global context due to their focus on local receptive fields. Transformer-based methods, on the other hand, suffer from quadratic complexity when handling long-range dependencies.</p><p><strong>Methods: </strong>To overcome these limitations, we introduce the Multi-Resolution Visual Mamba (MRVM) model, which addresses long-range dependencies with linear computational complexity for OCT image classification. The MRVM model initially employs convolution to extract local features and subsequently utilizes the retinal Mamba to capture global dependencies. By integrating multi-scale global features, the MRVM enhances classification accuracy and overall performance. Additionally, the multi-directional selection mechanism (MSM) within the retinal Mamba improves feature extraction by concentrating on various directions, thereby better capturing complex, orientation-specific retinal patterns.</p><p><strong>Results: </strong>Experimental results demonstrate that the MRVM model excels in differentiating retinal images with various lesions, achieving superior detection accuracy compared to traditional methods, with overall accuracies of 98.98\\\\% and 96.21\\\\% on two public datasets, respectively.</p><p><strong>Discussion: </strong>This approach offers a novel perspective for accurately identifying retinal diseases and could contribute to the development of more robust artificial intelligence algorithms and recognition systems for medical image-assisted diagnosis.</p>\",\"PeriodicalId\":12448,\"journal\":{\"name\":\"Frontiers in Cell and Developmental Biology\",\"volume\":\"12 \",\"pages\":\"1484880\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512455/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Cell and Developmental Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.3389/fcell.2024.1484880\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"CELL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cell and Developmental Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fcell.2024.1484880","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
导言:视网膜疾病严重影响患者的生活质量,并增加社会医疗成本。光学相干断层扫描(OCT)可提供高分辨率成像,用于精确检测和监测这些疾病。虽然深度学习技术已被用于从 OCT 图像中提取特征进行分类,但卷积神经网络(CNN)由于侧重于局部感受野,往往无法捕捉全局背景。另一方面,基于变换器的方法在处理长程依赖关系时,会受到二次复杂性的影响:为了克服这些局限性,我们引入了多分辨率视觉曼巴(MRVM)模型,该模型以线性计算复杂度解决了 OCT 图像分类中的长程依赖性问题。MRVM 模型最初采用卷积法提取局部特征,随后利用视网膜 Mamba 捕捉全局相关性。通过整合多尺度全局特征,MRVM 提高了分类精度和整体性能。此外,视网膜 Mamba 中的多方向选择机制(MSM)通过集中于不同方向来改进特征提取,从而更好地捕捉复杂、特定方向的视网膜模式:实验结果表明,MRVM 模型在区分各种病变的视网膜图像方面表现出色,与传统方法相比,检测准确率更高,在两个公共数据集上的总体准确率分别为 98.98% 和 96.21%:该方法为准确识别视网膜疾病提供了一个新的视角,有助于为医学影像辅助诊断开发更强大的人工智能算法和识别系统。
Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection.
Introduction: Retinal diseases significantly impact patients' quality of life and increase social medical costs. Optical coherence tomography (OCT) offers high-resolution imaging for precise detection and monitoring of these conditions. While deep learning techniques have been employed to extract features from OCT images for classification, convolutional neural networks (CNNs) often fail to capture global context due to their focus on local receptive fields. Transformer-based methods, on the other hand, suffer from quadratic complexity when handling long-range dependencies.
Methods: To overcome these limitations, we introduce the Multi-Resolution Visual Mamba (MRVM) model, which addresses long-range dependencies with linear computational complexity for OCT image classification. The MRVM model initially employs convolution to extract local features and subsequently utilizes the retinal Mamba to capture global dependencies. By integrating multi-scale global features, the MRVM enhances classification accuracy and overall performance. Additionally, the multi-directional selection mechanism (MSM) within the retinal Mamba improves feature extraction by concentrating on various directions, thereby better capturing complex, orientation-specific retinal patterns.
Results: Experimental results demonstrate that the MRVM model excels in differentiating retinal images with various lesions, achieving superior detection accuracy compared to traditional methods, with overall accuracies of 98.98\% and 96.21\% on two public datasets, respectively.
Discussion: This approach offers a novel perspective for accurately identifying retinal diseases and could contribute to the development of more robust artificial intelligence algorithms and recognition systems for medical image-assisted diagnosis.
期刊介绍:
Frontiers in Cell and Developmental Biology is a broad-scope, interdisciplinary open-access journal, focusing on the fundamental processes of life, led by Prof Amanda Fisher and supported by a geographically diverse, high-quality editorial board.
The journal welcomes submissions on a wide spectrum of cell and developmental biology, covering intracellular and extracellular dynamics, with sections focusing on signaling, adhesion, migration, cell death and survival and membrane trafficking. Additionally, the journal offers sections dedicated to the cutting edge of fundamental and translational research in molecular medicine and stem cell biology.
With a collaborative, rigorous and transparent peer-review, the journal produces the highest scientific quality in both fundamental and applied research, and advanced article level metrics measure the real-time impact and influence of each publication.