{"title":"一种用于光学放大医学图像超分辨率的多级渐进多注意转换器","authors":"Mingxuan Li , Jinbao Li , Yingchun Cui","doi":"10.1016/j.bspc.2025.108066","DOIUrl":null,"url":null,"abstract":"<div><div>Optical magnification medical images are essential for doctors diagnosing diseases by analyzing patterns, textures, and regional structures. However, optical magnification medical images are often affected by various factors such as device limitations, changes in lighting, and the condition of the patient’s lesion area, resulting in low resolution and partial blurring of the obtained image results, which in turn affects subsequent diagnosis and treatment. Although super-resolution is an effective way to improve the clarity of medical images, most super-resolution models still face a series of challenges when applied to optical magnification medical images: on the one hand, most models are single-stage architecture design that cannot fully utilize multi-scale information, and on the other hand, whether the state-of-the-art methods can maintain superior performance and applicability on different medical image datasets. To address these challenges, we propose a Multi-Stage Progressive Multi-Attention Transformer (MPMAT) framework, which captures pixel information of small-, medium-, and large-scale images through a three-stage design and utilizes feature information of different scales to generate higher-quality medical images. Then, we propose a Multi-Attention Group (MAG) method, combining various attention mechanisms with a Multi-Convolution Feature Fusion Block (MCFFB). This approach can enhances the integration of local and global features and addresses the spatial and channel feature extraction strategy problem. Finally, the experimental results demonstrate that MPMAT outperforms the state-of-the-art methods by a significant margin, achieving an improvement of 0.3 dB to 1.2 dB on multiple optical magnification image datasets.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"110 ","pages":"Article 108066"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A multi-stage progressive multi-attention transformer for optical magnification medical images super-resolution\",\"authors\":\"Mingxuan Li , Jinbao Li , Yingchun Cui\",\"doi\":\"10.1016/j.bspc.2025.108066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Optical magnification medical images are essential for doctors diagnosing diseases by analyzing patterns, textures, and regional structures. However, optical magnification medical images are often affected by various factors such as device limitations, changes in lighting, and the condition of the patient’s lesion area, resulting in low resolution and partial blurring of the obtained image results, which in turn affects subsequent diagnosis and treatment. Although super-resolution is an effective way to improve the clarity of medical images, most super-resolution models still face a series of challenges when applied to optical magnification medical images: on the one hand, most models are single-stage architecture design that cannot fully utilize multi-scale information, and on the other hand, whether the state-of-the-art methods can maintain superior performance and applicability on different medical image datasets. To address these challenges, we propose a Multi-Stage Progressive Multi-Attention Transformer (MPMAT) framework, which captures pixel information of small-, medium-, and large-scale images through a three-stage design and utilizes feature information of different scales to generate higher-quality medical images. Then, we propose a Multi-Attention Group (MAG) method, combining various attention mechanisms with a Multi-Convolution Feature Fusion Block (MCFFB). This approach can enhances the integration of local and global features and addresses the spatial and channel feature extraction strategy problem. Finally, the experimental results demonstrate that MPMAT outperforms the state-of-the-art methods by a significant margin, achieving an improvement of 0.3 dB to 1.2 dB on multiple optical magnification image datasets.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"110 \",\"pages\":\"Article 108066\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425005774\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425005774","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
A multi-stage progressive multi-attention transformer for optical magnification medical images super-resolution
Optical magnification medical images are essential for doctors diagnosing diseases by analyzing patterns, textures, and regional structures. However, optical magnification medical images are often affected by various factors such as device limitations, changes in lighting, and the condition of the patient’s lesion area, resulting in low resolution and partial blurring of the obtained image results, which in turn affects subsequent diagnosis and treatment. Although super-resolution is an effective way to improve the clarity of medical images, most super-resolution models still face a series of challenges when applied to optical magnification medical images: on the one hand, most models are single-stage architecture design that cannot fully utilize multi-scale information, and on the other hand, whether the state-of-the-art methods can maintain superior performance and applicability on different medical image datasets. To address these challenges, we propose a Multi-Stage Progressive Multi-Attention Transformer (MPMAT) framework, which captures pixel information of small-, medium-, and large-scale images through a three-stage design and utilizes feature information of different scales to generate higher-quality medical images. Then, we propose a Multi-Attention Group (MAG) method, combining various attention mechanisms with a Multi-Convolution Feature Fusion Block (MCFFB). This approach can enhances the integration of local and global features and addresses the spatial and channel feature extraction strategy problem. Finally, the experimental results demonstrate that MPMAT outperforms the state-of-the-art methods by a significant margin, achieving an improvement of 0.3 dB to 1.2 dB on multiple optical magnification image datasets.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.