{"title":"SASU-Net:基于光谱自适应聚合加权和尺度更新的高光谱视频跟踪器","authors":"Dong Zhao , Haorui Zhang , Kunpeng Huang , Xuguang Zhu , Pattathal V. Arun , Wenhao Jiang , Shiyu Li , Xiaofang Pei , Huixin Zhou","doi":"10.1016/j.eswa.2025.126721","DOIUrl":null,"url":null,"abstract":"<div><div>In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at <span><span>https://github.com/CodeMANz11/SASUNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"272 ","pages":"Article 126721"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SASU-Net: Hyperspectral video tracker based on spectral adaptive aggregation weighting and scale updating\",\"authors\":\"Dong Zhao , Haorui Zhang , Kunpeng Huang , Xuguang Zhu , Pattathal V. Arun , Wenhao Jiang , Shiyu Li , Xiaofang Pei , Huixin Zhou\",\"doi\":\"10.1016/j.eswa.2025.126721\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at <span><span>https://github.com/CodeMANz11/SASUNet</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"272 \",\"pages\":\"Article 126721\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-02-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425003434\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425003434","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
SASU-Net: Hyperspectral video tracker based on spectral adaptive aggregation weighting and scale updating
In order to address the challenge of scale variation in hyperspectral video tracking, we propose a novel tracker based on spectral adaptive aggregation weighting and scale updating. First, band adaptive aggregation weighting is used to reduce the dimensionality of the hyperspectral image and generate a spectral prior mask. The dimensionality-reduced image is then fed into the ResNet50 network for feature extraction. The features of the target and search regions are respectively directed to the encoder and decoder. Simultaneously, the spectral prior mask is input into the decoder for prior correction. The output from the decoder, fused vector, is subjected to anchor-free prediction for precisely determining the target position. SASU-Net incorporates a scale aware update module and a segmented template update strategy, which is founded on scale evaluation. At last, the target classification score and scale score are evaluated to determine whether to update the template. Through the novelty utilization of spectral adaptive aggregation, decoding based on prior masks, and the fusion with scale sensing and update strategy, SASU-Net shows a significant advantage in tackling multiple challenges, especially scale variation. Experimental results illustrate the superior performance of the proposed framework in comparison with the benchmark hyperspectral video trackers specially in effectively addressing scale variation challenges. The source code along with model weights of this work is publicly available at https://github.com/CodeMANz11/SASUNet.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.