Longyi Li , Liyan Dong , Hao Zhang , Jun Qin , Zhengtai Zhang , Minghui Sun
{"title":"TFS-Net: Temporal first simulation network for video saliency prediction","authors":"Longyi Li , Liyan Dong , Hao Zhang , Jun Qin , Zhengtai Zhang , Minghui Sun","doi":"10.1016/j.eswa.2025.127652","DOIUrl":null,"url":null,"abstract":"<div><div>Video saliency prediction (VSP) plays a critical role in modern video processing systems by optimizing computational resource allocation and enhancing overall system performance. However, existing VSP methods either lack effective temporal modeling or incur high computational costs, particularly struggling with the initialization of video sequences. This paper presents TFS-Net, a novel temporal-first simulation network for VSP that integrates both static and dynamic modeling via parallel-optimized self-attention mechanisms. Specifically, TFS-Net addresses the challenge of initial frame processing with the innovative F31 algorithm and improves multi-scale spatiotemporal feature integration through a Hierarchical Decoder with Multi-dimensional Attention (HDMA). Drawing inspiration from primate saccadic behavior, the F31 algorithm optimizes processing efficiency during both training and inference phases, demonstrating particular effectiveness in unmanned aerial vehicle (UAV) real-time applications. Extensive evaluations on public datasets demonstrate that TFS-Net achieves significant improvements over state-of-the-art methods, with gains of 14.6%, 12.0%, and 11.2% in AUC-J, CC, and SIM metrics, respectively. Further experiments on UAV video analysis validate the model’s robustness and practicality in real-world scenarios.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"282 ","pages":"Article 127652"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425012746","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Video saliency prediction (VSP) plays a critical role in modern video processing systems by optimizing computational resource allocation and enhancing overall system performance. However, existing VSP methods either lack effective temporal modeling or incur high computational costs, particularly struggling with the initialization of video sequences. This paper presents TFS-Net, a novel temporal-first simulation network for VSP that integrates both static and dynamic modeling via parallel-optimized self-attention mechanisms. Specifically, TFS-Net addresses the challenge of initial frame processing with the innovative F31 algorithm and improves multi-scale spatiotemporal feature integration through a Hierarchical Decoder with Multi-dimensional Attention (HDMA). Drawing inspiration from primate saccadic behavior, the F31 algorithm optimizes processing efficiency during both training and inference phases, demonstrating particular effectiveness in unmanned aerial vehicle (UAV) real-time applications. Extensive evaluations on public datasets demonstrate that TFS-Net achieves significant improvements over state-of-the-art methods, with gains of 14.6%, 12.0%, and 11.2% in AUC-J, CC, and SIM metrics, respectively. Further experiments on UAV video analysis validate the model’s robustness and practicality in real-world scenarios.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.