Tianyong Liu , Zhiqing Zhang , Guojia Fan , Nan Li , Chengwu Xu , Bin Li , Gang Zhao , Shoujun Zhou
{"title":"FocusMorph: A novel multi-scale fusion network for 3D brain MR image registration","authors":"Tianyong Liu , Zhiqing Zhang , Guojia Fan , Nan Li , Chengwu Xu , Bin Li , Gang Zhao , Shoujun Zhou","doi":"10.1016/j.patcog.2025.111761","DOIUrl":null,"url":null,"abstract":"<div><div>In the field of medical image processing, registration algorithms are crucial tools, especially in assisting physicians with aligning medical images acquired at different time points or through different modalities. These techniques are particularly important for medical applications such as disease diagnosis, lesion detection, surgical planning, and treatment monitoring. However, although most deep learning-based methods are capable of extracting multiscale features, they may fail to produce outputs that are directly related to the fnal deformation feld. Additionally, many methods based on the U-Net structure overly rely on the last layer of high-resolution images, which represents a significant drawback. To address these issues, we propose a novel unsupervised deformable registration method named FocusMorph. This method centers on the FLatten Transformer block and employs a focused linear attention mechanism to enhance attentional expressivity while maintaining low complexity. We have also designed a layer-by-layer output fusion mechanism and a motion image encoder specifically for medical image registration, which aids in continuously tracking positional differences between motion images and effectively fusing them. Experimental results indicate that the FocusMorph method surpasses current leading medical image registration techniques on two distinct brain image datasets. It achieves improvements in the Dice coefficient by 2.6% and 1.5%, respectively, confirming its superior performance and significant potential in image registration. These findings not only highlight FocusMorph’s robust registration capabilities but also underscore its promising prospects in medical image processing.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"167 ","pages":"Article 111761"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325004212","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In the field of medical image processing, registration algorithms are crucial tools, especially in assisting physicians with aligning medical images acquired at different time points or through different modalities. These techniques are particularly important for medical applications such as disease diagnosis, lesion detection, surgical planning, and treatment monitoring. However, although most deep learning-based methods are capable of extracting multiscale features, they may fail to produce outputs that are directly related to the fnal deformation feld. Additionally, many methods based on the U-Net structure overly rely on the last layer of high-resolution images, which represents a significant drawback. To address these issues, we propose a novel unsupervised deformable registration method named FocusMorph. This method centers on the FLatten Transformer block and employs a focused linear attention mechanism to enhance attentional expressivity while maintaining low complexity. We have also designed a layer-by-layer output fusion mechanism and a motion image encoder specifically for medical image registration, which aids in continuously tracking positional differences between motion images and effectively fusing them. Experimental results indicate that the FocusMorph method surpasses current leading medical image registration techniques on two distinct brain image datasets. It achieves improvements in the Dice coefficient by 2.6% and 1.5%, respectively, confirming its superior performance and significant potential in image registration. These findings not only highlight FocusMorph’s robust registration capabilities but also underscore its promising prospects in medical image processing.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.