Hao Zhai , Nannan Luo , You Yang , Zhendong Xu , Bo Lin
{"title":"Multi-focus image fusion via multi-scale attention and Siamese networks","authors":"Hao Zhai , Nannan Luo , You Yang , Zhendong Xu , Bo Lin","doi":"10.1016/j.dsp.2025.105493","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-focus Image Fusion (MFIF) technology aims to generate a full-focus image with an extended focus range by combining multiple images with different focal depths. This has significant implications in fields such as image restoration and medical imaging. This paper proposes a new MFIF method based on deep learning, which utilizes multi-scale attention and a Siamese network structure to efficiently extract local depth features from images and enhance the fusion effect. The design of the Siamese network structure allows the model to process paired multi-focus images and share the feature extraction process in the deeper layers of the network. This not only enhances the expressive capability but also improves the model's ability to recognize images with different focal depths. Consequently, the network can effectively capture local depth features, which provides rich information for subsequent fusion. By incorporating a multi-scale dilated convolution attention module, which dynamically adapts the receptive field size to encompass a larger number of pixels, the process of information aggregation is facilitated across a wider area, thereby enhancing the optimization of the feature reconstruction process. Furthermore, binary segmentation and small-area filtering methods are employed to enhance the consistency of the fused image. Experimental results show that the proposed method surpasses existing multi-focus image fusion methods in terms of both subjective visual effects and objective evaluation metrics.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"168 ","pages":"Article 105493"},"PeriodicalIF":3.0000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425005159","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-focus Image Fusion (MFIF) technology aims to generate a full-focus image with an extended focus range by combining multiple images with different focal depths. This has significant implications in fields such as image restoration and medical imaging. This paper proposes a new MFIF method based on deep learning, which utilizes multi-scale attention and a Siamese network structure to efficiently extract local depth features from images and enhance the fusion effect. The design of the Siamese network structure allows the model to process paired multi-focus images and share the feature extraction process in the deeper layers of the network. This not only enhances the expressive capability but also improves the model's ability to recognize images with different focal depths. Consequently, the network can effectively capture local depth features, which provides rich information for subsequent fusion. By incorporating a multi-scale dilated convolution attention module, which dynamically adapts the receptive field size to encompass a larger number of pixels, the process of information aggregation is facilitated across a wider area, thereby enhancing the optimization of the feature reconstruction process. Furthermore, binary segmentation and small-area filtering methods are employed to enhance the consistency of the fused image. Experimental results show that the proposed method surpasses existing multi-focus image fusion methods in terms of both subjective visual effects and objective evaluation metrics.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,