DGTF: A framework utilizing textual semantics for detail-aware and global fusion with a multi-degradation scenarios dataset

IF 4.6 2区 物理与天体物理 Q1 OPTICS
Mingxin Yu , Zhenyang Liang , Ning Li , Mingwei Lin
{"title":"DGTF: A framework utilizing textual semantics for detail-aware and global fusion with a multi-degradation scenarios dataset","authors":"Mingxin Yu ,&nbsp;Zhenyang Liang ,&nbsp;Ning Li ,&nbsp;Mingwei Lin","doi":"10.1016/j.optlastec.2025.113319","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared and visible light fusion aims to integrate information from both modalities to generate high-quality fused images. However, existing methods do not perform well in multiple degraded scenarios. In these scenarios, the source images face quality degradation and information loss. Fusion combined with degradation processing often damages the detailed target information in the images. To address this limitation, this paper proposes a novel Degradation-Text Fusion framework, named DGTF, which leverages cascaded degradation text, object text, and target masks for detail-aware degradation regulation. The framework adjusts the details based on the object text specified in the given input, ensuring that global degradation processing does not compromise the quality of detail fusion. This approach overcomes the limitations of previous methods constrained by global degradation processing. To train and evaluate DGTF, we constructed a new infrared and visible light dataset, Multi-degraded scene text target infrared and visible datasets (MTS), which encompasses seven extreme scenarios, including rain, snow, fog, low light, exposure, infrared noise, and low contrast. Extensive experimental results demonstrate that our method significantly outperforms existing techniques in fusion performance, even without text guidance. Furthermore, tests conducted on the MTS dataset reveal that the detail-regulated fusion results achieved by DGTF far surpass traditional degradation-based fusion methods, effectively enhancing the performance of advanced vision tasks. These findings validate the effectiveness of the proposed detail regulation framework. Our code is available at <span><span>https://github.com/linshenj/DGTF</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"191 ","pages":"Article 113319"},"PeriodicalIF":4.6000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030399225009107","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Infrared and visible light fusion aims to integrate information from both modalities to generate high-quality fused images. However, existing methods do not perform well in multiple degraded scenarios. In these scenarios, the source images face quality degradation and information loss. Fusion combined with degradation processing often damages the detailed target information in the images. To address this limitation, this paper proposes a novel Degradation-Text Fusion framework, named DGTF, which leverages cascaded degradation text, object text, and target masks for detail-aware degradation regulation. The framework adjusts the details based on the object text specified in the given input, ensuring that global degradation processing does not compromise the quality of detail fusion. This approach overcomes the limitations of previous methods constrained by global degradation processing. To train and evaluate DGTF, we constructed a new infrared and visible light dataset, Multi-degraded scene text target infrared and visible datasets (MTS), which encompasses seven extreme scenarios, including rain, snow, fog, low light, exposure, infrared noise, and low contrast. Extensive experimental results demonstrate that our method significantly outperforms existing techniques in fusion performance, even without text guidance. Furthermore, tests conducted on the MTS dataset reveal that the detail-regulated fusion results achieved by DGTF far surpass traditional degradation-based fusion methods, effectively enhancing the performance of advanced vision tasks. These findings validate the effectiveness of the proposed detail regulation framework. Our code is available at https://github.com/linshenj/DGTF.
DGTF:一个框架,利用文本语义与多退化场景数据集进行细节感知和全局融合
红外和可见光融合旨在整合两种模式的信息,以产生高质量的融合图像。然而,现有的方法在多种退化场景下表现不佳。在这些情况下,源图像面临质量下降和信息丢失。融合与退化处理相结合,往往会破坏图像中目标的详细信息。为了解决这一限制,本文提出了一种新的降级文本融合框架,称为DGTF,它利用级联降级文本、目标文本和目标掩码进行细节感知的降级调节。该框架根据给定输入中指定的对象文本调整细节,确保全局退化处理不会影响细节融合的质量。该方法克服了以往方法受全局降解处理的限制。为了训练和评估DGTF,我们构建了一个新的红外和可见光数据集,即多退化场景文本目标红外和可见光数据集(MTS),该数据集包含雨、雪、雾、弱光、曝光、红外噪声和低对比度等七种极端场景。大量的实验结果表明,即使没有文本引导,我们的方法在融合性能上也明显优于现有的技术。此外,在MTS数据集上进行的测试表明,DGTF实现的细节调节融合结果远远超过传统的基于退化的融合方法,有效地提高了高级视觉任务的性能。这些发现验证了拟议的详细监管框架的有效性。我们的代码可在https://github.com/linshenj/DGTF上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.50
自引率
10.00%
发文量
1060
审稿时长
3.4 months
期刊介绍: Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication. The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas: •development in all types of lasers •developments in optoelectronic devices and photonics •developments in new photonics and optical concepts •developments in conventional optics, optical instruments and components •techniques of optical metrology, including interferometry and optical fibre sensors •LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow •applications of lasers to materials processing, optical NDT display (including holography) and optical communication •research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume) •developments in optical computing and optical information processing •developments in new optical materials •developments in new optical characterization methods and techniques •developments in quantum optics •developments in light assisted micro and nanofabrication methods and techniques •developments in nanophotonics and biophotonics •developments in imaging processing and systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信