SASU-Net:基于光谱自适应聚合加权和尺度更新的高光谱视频跟踪器

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Dong Zhao , Haorui Zhang , Kunpeng Huang , Xuguang Zhu , Pattathal V. Arun , Wenhao Jiang , Shiyu Li , Xiaofang Pei , Huixin Zhou
{"title":"SASU-Net:基于光谱自适应聚合加权和尺度更新的高光谱视频跟踪器","authors":"Dong Zhao ,&nbsp;Haorui Zhang ,&nbsp;Kunpeng Huang ,&nbsp;Xuguang Zhu ,&nbsp;Pattathal V. Arun ,&nbsp;Wenhao Jiang ,&nbsp;Shiyu Li ,&nbsp;Xiaofang Pei ,&nbsp;Huixin Zhou","doi":"10.1016/j.eswa.2025.126721","DOIUrl":null,"url":null,"abstract":"<div><div>In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at <span><span>https://github.com/CodeMANz11/SASUNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"272 ","pages":"Article 126721"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SASU-Net: Hyperspectral video tracker based on spectral adaptive aggregation weighting and scale updating\",\"authors\":\"Dong Zhao ,&nbsp;Haorui Zhang ,&nbsp;Kunpeng Huang ,&nbsp;Xuguang Zhu ,&nbsp;Pattathal V. Arun ,&nbsp;Wenhao Jiang ,&nbsp;Shiyu Li ,&nbsp;Xiaofang Pei ,&nbsp;Huixin Zhou\",\"doi\":\"10.1016/j.eswa.2025.126721\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at <span><span>https://github.com/CodeMANz11/SASUNet</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"272 \",\"pages\":\"Article 126721\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-02-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425003434\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425003434","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

为了解决高光谱视频跟踪中尺度变化的问题,提出了一种基于光谱自适应聚合加权和尺度更新的高光谱视频跟踪方法。首先,利用波段自适应聚集加权对高光谱图像进行降维,生成光谱先验掩模;然后将降维后的图像输入ResNet50网络进行特征提取。目标区域和搜索区域的特征分别指向编码器和解码器。同时,将频谱先验掩码输入解码器进行先验校正。解码器的输出,融合矢量,经过无锚点预测以精确确定目标位置。SASU-Net集成了规模感知更新模块和基于规模评估的分段模板更新策略。最后对目标分类得分和量表得分进行评估,决定是否更新模板。通过对频谱自适应聚合的新颖利用、基于先验掩码的解码以及与尺度感知和更新策略的融合,SASU-Net在应对多种挑战,特别是尺度变化方面显示出显著的优势。实验结果表明,与基准高光谱视频跟踪器相比,该框架具有优越的性能,特别是在有效解决尺度变化挑战方面。该工作的源代码以及模型权重可在https://github.com/CodeMANz11/SASUNet上公开获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

SASU-Net: Hyperspectral video tracker based on spectral adaptive aggregation weighting and scale updating

SASU-Net: Hyperspectral video tracker based on spectral adaptive aggregation weighting and scale updating
In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at https://github.com/CodeMANz11/SASUNet.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信