轻量级图像超分辨率标记化动态嵌入网络

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Xiangyuan Zhu , Xuchong Liu , Zheng Wu
{"title":"轻量级图像超分辨率标记化动态嵌入网络","authors":"Xiangyuan Zhu ,&nbsp;Xuchong Liu ,&nbsp;Zheng Wu","doi":"10.1016/j.knosys.2025.114640","DOIUrl":null,"url":null,"abstract":"<div><div>Image super-resolution is a crucial task in computer vision, aiming to reconstruct high-resolution images from low-resolution counterparts. Despite the remarkable progress of deep learning-based methods, existing approaches often face challenges in balancing reconstruction quality, computational efficiency, and model compactness. In this paper, we propose a novel tokenized dynamic embedding network, which integrates adaptive feature tokenization and dynamic embedding mechanisms to enhance super-resolution performance while maintaining efficiency. Specifically, we employ an adaptive feature tokenization strategy to selectively extract essential tokens, reducing computational complexity while preserving key image details. Additionally, we introduce a dynamic context embedding attention module for efficient long-range dependency modeling and a dual-perspective feature integration module for integrating spatial and contextual information, ensuring both fine-grained textures and global consistency. Extensive experiments on benchmark datasets demonstrate that our method outperforms state-of-the-art lightweight models in terms of objective metrics and perceptual quality, while maintaining a compact and efficient design suitable for real-world applications. The source code is available at <span><span>https://github.com/zxycs/TDEN</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114640"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lightweight image super-resolution with tokenized dynamic embedding network\",\"authors\":\"Xiangyuan Zhu ,&nbsp;Xuchong Liu ,&nbsp;Zheng Wu\",\"doi\":\"10.1016/j.knosys.2025.114640\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Image super-resolution is a crucial task in computer vision, aiming to reconstruct high-resolution images from low-resolution counterparts. Despite the remarkable progress of deep learning-based methods, existing approaches often face challenges in balancing reconstruction quality, computational efficiency, and model compactness. In this paper, we propose a novel tokenized dynamic embedding network, which integrates adaptive feature tokenization and dynamic embedding mechanisms to enhance super-resolution performance while maintaining efficiency. Specifically, we employ an adaptive feature tokenization strategy to selectively extract essential tokens, reducing computational complexity while preserving key image details. Additionally, we introduce a dynamic context embedding attention module for efficient long-range dependency modeling and a dual-perspective feature integration module for integrating spatial and contextual information, ensuring both fine-grained textures and global consistency. Extensive experiments on benchmark datasets demonstrate that our method outperforms state-of-the-art lightweight models in terms of objective metrics and perceptual quality, while maintaining a compact and efficient design suitable for real-world applications. The source code is available at <span><span>https://github.com/zxycs/TDEN</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"330 \",\"pages\":\"Article 114640\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S095070512501679X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095070512501679X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

图像超分辨率是计算机视觉中的一项重要任务,旨在从低分辨率图像中重建高分辨率图像。尽管基于深度学习的方法取得了显著进展,但现有方法在平衡重建质量、计算效率和模型紧凑性方面经常面临挑战。在本文中,我们提出了一种新的标记化动态嵌入网络,该网络将自适应特征标记化和动态嵌入机制相结合,在保持效率的同时提高了超分辨率性能。具体来说,我们采用自适应特征标记化策略来选择性地提取基本标记,在保留关键图像细节的同时降低计算复杂度。此外,我们还引入了一个动态上下文嵌入关注模块,用于高效的远程依赖建模;引入了一个双视角特征集成模块,用于集成空间和上下文信息,以确保细粒度纹理和全局一致性。在基准数据集上进行的大量实验表明,我们的方法在客观指标和感知质量方面优于最先进的轻量级模型,同时保持了适合实际应用的紧凑高效的设计。源代码可从https://github.com/zxycs/TDEN获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Lightweight image super-resolution with tokenized dynamic embedding network
Image super-resolution is a crucial task in computer vision, aiming to reconstruct high-resolution images from low-resolution counterparts. Despite the remarkable progress of deep learning-based methods, existing approaches often face challenges in balancing reconstruction quality, computational efficiency, and model compactness. In this paper, we propose a novel tokenized dynamic embedding network, which integrates adaptive feature tokenization and dynamic embedding mechanisms to enhance super-resolution performance while maintaining efficiency. Specifically, we employ an adaptive feature tokenization strategy to selectively extract essential tokens, reducing computational complexity while preserving key image details. Additionally, we introduce a dynamic context embedding attention module for efficient long-range dependency modeling and a dual-perspective feature integration module for integrating spatial and contextual information, ensuring both fine-grained textures and global consistency. Extensive experiments on benchmark datasets demonstrate that our method outperforms state-of-the-art lightweight models in terms of objective metrics and perceptual quality, while maintaining a compact and efficient design suitable for real-world applications. The source code is available at https://github.com/zxycs/TDEN.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Knowledge-Based Systems
Knowledge-Based Systems 工程技术-计算机:人工智能
CiteScore
14.80
自引率
12.50%
发文量
1245
审稿时长
7.8 months
期刊介绍: Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信