{"title":"多尺度注意力网络在彩色图像视网膜静脉阻塞分类中的应用","authors":"Xiaochen Wang, Yanhui Ding, Yuanjie Zheng","doi":"10.1002/ima.22917","DOIUrl":null,"url":null,"abstract":"Recently, automatic diagnostic approaches widely use various retinal images to classify ocular diseases. And retinal vein occlusion (RVO) is the second most common retinal vascular disease after diabetic retinopathy. In clinical practice, ophthalmologists are usually accustomed to resorting to images of one modality. But single‐modality images often ignore other modality‐specific information. To solve this problem, this paper uses a novel retinal imaging, the multicolor (MC) imaging, for RVO recognition. It can obtain four multiple modal images with different wavelengths to provide much richer information about retinal features. Since the MC images contain local and global pathologies at multiple scales, a multiscale attention structure is proposed to recognize RVO. In simple terms, this structure uses Resnet as the backbone network for feature extraction, with simultaneous input of images in four modalities. Then, the feature maps at different scales are fed into an attention module to fuse the global and local features, which combines two attention mechanisms, the channel attention mechanism and the spatial attention mechanism. The extensive experimental results demonstrate that our proposed framework achieves quite promising classification performance on the fundus diseases and normal images.","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"33 6","pages":"2012-2022"},"PeriodicalIF":3.0000,"publicationDate":"2023-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiscale attention network for retinal vein occlusion classification with multicolor image\",\"authors\":\"Xiaochen Wang, Yanhui Ding, Yuanjie Zheng\",\"doi\":\"10.1002/ima.22917\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, automatic diagnostic approaches widely use various retinal images to classify ocular diseases. And retinal vein occlusion (RVO) is the second most common retinal vascular disease after diabetic retinopathy. In clinical practice, ophthalmologists are usually accustomed to resorting to images of one modality. But single‐modality images often ignore other modality‐specific information. To solve this problem, this paper uses a novel retinal imaging, the multicolor (MC) imaging, for RVO recognition. It can obtain four multiple modal images with different wavelengths to provide much richer information about retinal features. Since the MC images contain local and global pathologies at multiple scales, a multiscale attention structure is proposed to recognize RVO. In simple terms, this structure uses Resnet as the backbone network for feature extraction, with simultaneous input of images in four modalities. Then, the feature maps at different scales are fed into an attention module to fuse the global and local features, which combines two attention mechanisms, the channel attention mechanism and the spatial attention mechanism. The extensive experimental results demonstrate that our proposed framework achieves quite promising classification performance on the fundus diseases and normal images.\",\"PeriodicalId\":14027,\"journal\":{\"name\":\"International Journal of Imaging Systems and Technology\",\"volume\":\"33 6\",\"pages\":\"2012-2022\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2023-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Imaging Systems and Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/ima.22917\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.22917","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Multiscale attention network for retinal vein occlusion classification with multicolor image
Recently, automatic diagnostic approaches widely use various retinal images to classify ocular diseases. And retinal vein occlusion (RVO) is the second most common retinal vascular disease after diabetic retinopathy. In clinical practice, ophthalmologists are usually accustomed to resorting to images of one modality. But single‐modality images often ignore other modality‐specific information. To solve this problem, this paper uses a novel retinal imaging, the multicolor (MC) imaging, for RVO recognition. It can obtain four multiple modal images with different wavelengths to provide much richer information about retinal features. Since the MC images contain local and global pathologies at multiple scales, a multiscale attention structure is proposed to recognize RVO. In simple terms, this structure uses Resnet as the backbone network for feature extraction, with simultaneous input of images in four modalities. Then, the feature maps at different scales are fed into an attention module to fuse the global and local features, which combines two attention mechanisms, the channel attention mechanism and the spatial attention mechanism. The extensive experimental results demonstrate that our proposed framework achieves quite promising classification performance on the fundus diseases and normal images.
期刊介绍:
The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals.
IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging.
The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered.
The scope of the journal includes, but is not limited to, the following in the context of biomedical research:
Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.;
Neuromodulation and brain stimulation techniques such as TMS and tDCS;
Software and hardware for imaging, especially related to human and animal health;
Image segmentation in normal and clinical populations;
Pattern analysis and classification using machine learning techniques;
Computational modeling and analysis;
Brain connectivity and connectomics;
Systems-level characterization of brain function;
Neural networks and neurorobotics;
Computer vision, based on human/animal physiology;
Brain-computer interface (BCI) technology;
Big data, databasing and data mining.