{"title":"SiamSCT:用于实时空中跟踪的空间通道显著性和时间融合网络","authors":"Bo Wang;Chenglong Liu;Qiqi Chen;Jinghong Liu","doi":"10.1109/LGRS.2025.3596119","DOIUrl":null,"url":null,"abstract":"Visual tracking on uncrewed aerial vehicle (UAV) platforms is a fundamental and crucial visual task. Compared to conventional tracking tasks, aerial tracking faces specific challenging scenarios due to its unique perspective. Although existing aerial trackers have demonstrated promising performance, they remain limited in capturing spatial-channel saliency across branches and effectively utilizing historical information. To address these issues, this letter proposes a spatial-channel saliency and temporal fusion network (SiamSCT), which aims to enhance feature representation for efficient and accurate aerial tracking. Specifically, SiamSCT introduces a weight-shared spatial saliency block (SSB) to strengthen the spatial feature representation across the tracking network’s branches. In addition, a light channel aware module (CAM) is designed to facilitate deep feature interaction across branches at the channel level, further enhancing feature discriminability. Finally, using a historical similarity response fusion strategy, SiamSCT achieves more stable and reliable tracking responses, effectively tackling complex aerial scenarios. Extensive experiments on several authoritative aerial tracking datasets demonstrate that SiamSCT outperforms state-of-the-art (SOTA) trackers. Furthermore, SiamSCT achieves a tracking speed of 133 frames/s on NVIDIA RTX 3060Ti, proving its excellent performance in real time.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SiamSCT: Spatial-Channel Saliency and Temporal Fusion Network for Real-Time Aerial Tracking\",\"authors\":\"Bo Wang;Chenglong Liu;Qiqi Chen;Jinghong Liu\",\"doi\":\"10.1109/LGRS.2025.3596119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Visual tracking on uncrewed aerial vehicle (UAV) platforms is a fundamental and crucial visual task. Compared to conventional tracking tasks, aerial tracking faces specific challenging scenarios due to its unique perspective. Although existing aerial trackers have demonstrated promising performance, they remain limited in capturing spatial-channel saliency across branches and effectively utilizing historical information. To address these issues, this letter proposes a spatial-channel saliency and temporal fusion network (SiamSCT), which aims to enhance feature representation for efficient and accurate aerial tracking. Specifically, SiamSCT introduces a weight-shared spatial saliency block (SSB) to strengthen the spatial feature representation across the tracking network’s branches. In addition, a light channel aware module (CAM) is designed to facilitate deep feature interaction across branches at the channel level, further enhancing feature discriminability. Finally, using a historical similarity response fusion strategy, SiamSCT achieves more stable and reliable tracking responses, effectively tackling complex aerial scenarios. Extensive experiments on several authoritative aerial tracking datasets demonstrate that SiamSCT outperforms state-of-the-art (SOTA) trackers. Furthermore, SiamSCT achieves a tracking speed of 133 frames/s on NVIDIA RTX 3060Ti, proving its excellent performance in real time.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11113247/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11113247/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SiamSCT: Spatial-Channel Saliency and Temporal Fusion Network for Real-Time Aerial Tracking
Visual tracking on uncrewed aerial vehicle (UAV) platforms is a fundamental and crucial visual task. Compared to conventional tracking tasks, aerial tracking faces specific challenging scenarios due to its unique perspective. Although existing aerial trackers have demonstrated promising performance, they remain limited in capturing spatial-channel saliency across branches and effectively utilizing historical information. To address these issues, this letter proposes a spatial-channel saliency and temporal fusion network (SiamSCT), which aims to enhance feature representation for efficient and accurate aerial tracking. Specifically, SiamSCT introduces a weight-shared spatial saliency block (SSB) to strengthen the spatial feature representation across the tracking network’s branches. In addition, a light channel aware module (CAM) is designed to facilitate deep feature interaction across branches at the channel level, further enhancing feature discriminability. Finally, using a historical similarity response fusion strategy, SiamSCT achieves more stable and reliable tracking responses, effectively tackling complex aerial scenarios. Extensive experiments on several authoritative aerial tracking datasets demonstrate that SiamSCT outperforms state-of-the-art (SOTA) trackers. Furthermore, SiamSCT achieves a tracking speed of 133 frames/s on NVIDIA RTX 3060Ti, proving its excellent performance in real time.