SASU-Net：基于光谱自适应聚合加权和尺度更新的高光谱视频跟踪器

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-02-06 DOI:10.1016/j.eswa.2025.126721

Dong Zhao , Haorui Zhang , Kunpeng Huang , Xuguang Zhu , Pattathal V. Arun , Wenhao Jiang , Shiyu Li , Xiaofang Pei , Huixin Zhou

{"title":"SASU-Net：基于光谱自适应聚合加权和尺度更新的高光谱视频跟踪器","authors":"Dong Zhao , Haorui Zhang , Kunpeng Huang , Xuguang Zhu , Pattathal V. Arun , Wenhao Jiang , Shiyu Li , Xiaofang Pei , Huixin Zhou","doi":"10.1016/j.eswa.2025.126721","DOIUrl":null,"url":null,"abstract":"<div><div>In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at <span><span>https://github.com/CodeMANz11/SASUNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"272 ","pages":"Article 126721"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SASU-Net: Hyperspectral video tracker based on spectral adaptive aggregation weighting and scale updating\",\"authors\":\"Dong Zhao , Haorui Zhang , Kunpeng Huang , Xuguang Zhu , Pattathal V. Arun , Wenhao Jiang , Shiyu Li , Xiaofang Pei , Huixin Zhou\",\"doi\":\"10.1016/j.eswa.2025.126721\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at <span><span>https://github.com/CodeMANz11/SASUNet</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"272 \",\"pages\":\"Article 126721\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-02-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425003434\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425003434","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

为了解决高光谱视频跟踪中尺度变化的问题，提出了一种基于光谱自适应聚合加权和尺度更新的高光谱视频跟踪方法。首先，利用波段自适应聚集加权对高光谱图像进行降维，生成光谱先验掩模；然后将降维后的图像输入ResNet50网络进行特征提取。目标区域和搜索区域的特征分别指向编码器和解码器。同时，将频谱先验掩码输入解码器进行先验校正。解码器的输出，融合矢量，经过无锚点预测以精确确定目标位置。SASU-Net集成了规模感知更新模块和基于规模评估的分段模板更新策略。最后对目标分类得分和量表得分进行评估，决定是否更新模板。通过对频谱自适应聚合的新颖利用、基于先验掩码的解码以及与尺度感知和更新策略的融合，SASU-Net在应对多种挑战，特别是尺度变化方面显示出显著的优势。实验结果表明，与基准高光谱视频跟踪器相比，该框架具有优越的性能，特别是在有效解决尺度变化挑战方面。该工作的源代码以及模型权重可在https://github.com/CodeMANz11/SASUNet上公开获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

SASU-Net: Hyperspectral video tracker based on spectral adaptive aggregation weighting and scale updating

查看原文本刊更多论文

SASU-Net: Hyperspectral video tracker based on spectral adaptive aggregation weighting and scale updating

In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at https://github.com/CodeMANz11/SASUNet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.