Information Fusion最新文献

筛选
英文 中文
Rethinking multi-level information fusion in temporal graphs: Pre-training then distilling for better embedding
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-29 DOI: 10.1016/j.inffus.2025.103127
Meng Liu , Yong Liu , Qianqian Ren , Meng Han
{"title":"Rethinking multi-level information fusion in temporal graphs: Pre-training then distilling for better embedding","authors":"Meng Liu ,&nbsp;Yong Liu ,&nbsp;Qianqian Ren ,&nbsp;Meng Han","doi":"10.1016/j.inffus.2025.103127","DOIUrl":"10.1016/j.inffus.2025.103127","url":null,"abstract":"<div><div>Temporal graphs occupy an important place in graph data, which store node interactions in sequences, thus enabling a more microscopic view of each node’s dynamics. However, many temporal graph methods primarily concentrate on shallow-level temporal or neighborhood information, while acquiring deep-level community or global graph information necessitates increased computational costs, thereby significantly impacting model efficiency. Inspired by this, we rethink how this information is acquired: if it is difficult to acquire it during model training, why not obtain it before training? Consequently, we propose ReMIT, a novel method for temporal graph learning, which incorporates the concepts of feature pre-training and knowledge distillation to Rethink the embedding of Multi-level Information fusion in Temporal graphs. ReMIT facilitates the “remitting” of prior knowledge to model, wherein hard-to-access information is captured and distilled to the train module by introducing a pre-train module. Experimental results on multiple real-world datasets validate the validity and feasibility of our proposed framework. Our method improves performance by up to 10.2% while reducing almost 30% training time.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103127"},"PeriodicalIF":14.7,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143768933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Urban region function classification via fusing optical imagery and social media data: A spatio-temporal Transformer interaction approach
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-28 DOI: 10.1016/j.inffus.2025.103140
Ruiyang Sun , Xin Su , Qiangqiang Yuan , Hongzan Jiao , Jiang He , Li Zheng
{"title":"Urban region function classification via fusing optical imagery and social media data: A spatio-temporal Transformer interaction approach","authors":"Ruiyang Sun ,&nbsp;Xin Su ,&nbsp;Qiangqiang Yuan ,&nbsp;Hongzan Jiao ,&nbsp;Jiang He ,&nbsp;Li Zheng","doi":"10.1016/j.inffus.2025.103140","DOIUrl":"10.1016/j.inffus.2025.103140","url":null,"abstract":"<div><div>Urban management and planning can benefit from the classification of urban functional areas. Existing researches have demonstrated that remote sensing data can provide essential urban surface spatial information for the identification of urban functional areas, and human activities may characterize the dynamic aspects connected to social and economic temporal information in various urban functional regions. However, current methods lack explicit consideration of the mutual properties of spatial and temporal modality, resulting in suboptimal interaction performance. To address the issue, we propose a novel fusion method of spatio-temporal Transformer of remote sensing and social media data for urban region function classification. We design the Multi-scale Vision Transformer (MultiViT) to extract the multi-scale features of optical image data and Convolutional Transformer (ConvTransformer) to obtain the multi-scale temporal scale features of social media time series data. For multi-modal fusion, we create a crucial spatio-temporal fusion path based on self-attention, using the different modalities semantic information regarded as useful priori information. By the supervised and distilled loss function, the merging of the two sub-networks and the main network is taken into account during training. Extensive experiments on public datasets have demonstrated the favorable performance of our spatio-temporal Transformer interaction approach in merging remote sensing and social media data for urban region function classification. The code will be available at <span><span>https://github.com/Ruiyang-Sun/Spatio-temporal-Transformer-for-urfc</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103140"},"PeriodicalIF":14.7,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143783551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incomplete multi-view classification via graph neural network on heterogeneous graph
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-28 DOI: 10.1016/j.inffus.2025.103137
Guoqing Chao , Jingnan Qi , Jingxuan Li , Bumshik Lee , Dianhui Chu
{"title":"Incomplete multi-view classification via graph neural network on heterogeneous graph","authors":"Guoqing Chao ,&nbsp;Jingnan Qi ,&nbsp;Jingxuan Li ,&nbsp;Bumshik Lee ,&nbsp;Dianhui Chu","doi":"10.1016/j.inffus.2025.103137","DOIUrl":"10.1016/j.inffus.2025.103137","url":null,"abstract":"<div><div>Incomplete multi-view classification aims to classify the multi-view data with missing views. Several works have been proposed to impute the missing views and then conduct the existing multi-view classification, or conduct these two tasks simultaneously. However, the final classification performance of these works depends heavily on the missing view imputation. Unlike these existing works, in this paper, we propose a novel Incomplete Multi-view Classification method with Graph neural network on Heterogeneous graph (IMCGH). We transform the multi-view data into a heterogeneous graph by mapping each sample in each view to a node of a different type. Missing views can be regarded as learning using a subgraph of the heterogeneous graph, allowing our method to conduct incomplete multi-view classification naturally. We also design the loss functions based on mutual information to exploit the consistency and complementarity of information within multi-view data. Experimental results on several benchmark datasets illustrate the effectiveness and superiority of the proposed method compared with its state-of-the-art competitors in the transductive and inductive learning tasks.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103137"},"PeriodicalIF":14.7,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143768936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Poisoning attacks resilient privacy-preserving federated learning scheme based on lightweight homomorphic encryption
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-27 DOI: 10.1016/j.inffus.2025.103131
Chong Zhang , Xiaojun Zhang , Xingchun Yang , Bingyun Liu , Yuan Zhang , Rang Zhou
{"title":"Poisoning attacks resilient privacy-preserving federated learning scheme based on lightweight homomorphic encryption","authors":"Chong Zhang ,&nbsp;Xiaojun Zhang ,&nbsp;Xingchun Yang ,&nbsp;Bingyun Liu ,&nbsp;Yuan Zhang ,&nbsp;Rang Zhou","doi":"10.1016/j.inffus.2025.103131","DOIUrl":"10.1016/j.inffus.2025.103131","url":null,"abstract":"<div><div>Federated learning ensures that multiple participants train the same model without leaking the local raw data. Each participant uploads the local gradient model instead of the original data, however, the uploaded local gradient model may contain certain sensitive information, which can be exploited by an adversary to break privacy protection. Meanwhile, some adversaries can make the model training results contrary to the expected results by tampering with the uploaded local gradient model or mixing malicious data into the local dataset, thereby inducing the model to produce wrong results for specific data. To this end, we devise a privacy-preserving federated learning scheme based on lightweight homomorphic encryption, which simultaneously reduces the weight of malicious data in gradient aggregation and supports anomaly detection of data, achieves the effect of resistance to poisoning attacks. Through theoretical analysis and experimental simulation, the proposed scheme has lightweight computation advantages compared with existing federated learning schemes.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103131"},"PeriodicalIF":14.7,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143740001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PGA-DRL: Progressive graph attention-based deep reinforcement learning for recommender systems
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-27 DOI: 10.1016/j.inffus.2025.103167
Jawad Tanveer , Sang-Woong Lee , Amir Masoud Rahmani , Khursheed Aurangzeb , Mahfooz Alam , Gholamreza Zare , Pegah Malekpour Alamdari , Mehdi Hosseinzadeh
{"title":"PGA-DRL: Progressive graph attention-based deep reinforcement learning for recommender systems","authors":"Jawad Tanveer ,&nbsp;Sang-Woong Lee ,&nbsp;Amir Masoud Rahmani ,&nbsp;Khursheed Aurangzeb ,&nbsp;Mahfooz Alam ,&nbsp;Gholamreza Zare ,&nbsp;Pegah Malekpour Alamdari ,&nbsp;Mehdi Hosseinzadeh","doi":"10.1016/j.inffus.2025.103167","DOIUrl":"10.1016/j.inffus.2025.103167","url":null,"abstract":"<div><div>Advanced graph models, including Graph Convolutional Networks (GCNs) and Graph Attention Networks (GATs), have demonstrated their effectiveness in capturing intricate user-item interactions. However, their integration into Deep Reinforcement Learning (DRL)-based Recommender Systems (RSs) remains relatively underexplored. To address this gap, we propose PGA-DRL, a Progressive Graph Attention-Based DRL model that incrementally fuses GCN and GAT representations via concatenation, effectively combining their complementary strengths to enhance feature representation within an Actor-Critic (AC) framework. This progressive integration refines both global and localized user-item interaction patterns, Specifically, global patterns capture broader user preferences across the entire graph, and localized patterns focus on specific, detailed interactions between closely connected nodes, enabling a more comprehensive understanding of the recommendation environment. We evaluate our approach using extensive experiments on multiple benchmark datasets, including ML-100K, ML-1M, Amazon Subscription Boxes, Amazon Magazine Subscriptions, and ModCloth, employing standard ranking metrics such as Precision@10, Recall@10, NDCG@10, MRR@10, and Hit@10. The experimental results reveal that PGA-DRL outperforms state-of-the-art baselines, such as BPR, NeuMF, and SimGCL, achieving improvements in NDCG@10 and Recall@10. Our core contributions lie in bridging graph-based learning with reinforcement learning through a novel, efficient, and scalable fusion mechanism that enhances recommendation accuracy and ultimately improves user satisfaction. The source code for PGA-DRL is publicly available at <span><span>https://github.com/RS-Research/PGA-DRL</span><svg><path></path></svg></span> to enhance transparency and facilitate future research.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103167"},"PeriodicalIF":14.7,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143759323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Divide and conquer? A combination of judgments method for comparing DSSs. Pairwise comparison vs. holistic paradigms
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-26 DOI: 10.1016/j.inffus.2025.103157
Carlos Sáenz-Royo , Francisco Chiclana
{"title":"Divide and conquer? A combination of judgments method for comparing DSSs. Pairwise comparison vs. holistic paradigms","authors":"Carlos Sáenz-Royo ,&nbsp;Francisco Chiclana","doi":"10.1016/j.inffus.2025.103157","DOIUrl":"10.1016/j.inffus.2025.103157","url":null,"abstract":"<div><div>Despite the prevalence of Decision Support Systems (DSSs) in the field of decision-making, there is a paucity of research dedicated to the evaluation and comparison of these systems. This paper put forward a novel approach to symbolically encoding a DSS, which enables the generalization of comparisons between DSSs for any distribution of performances of the alternatives. The only hypothesis required in the proposed methodology is that the probability of choosing each alternative is proportional to its latent performance. The approach developed is demonstrated with its application to compare two paradigms commonly employed in DSS: holistic versus pairwise. Using a set of three alternatives, the present study provides mathematical proof that a DSS based on the pairwise comparison paradigm achieves higher expected performance than a DSS based on the holistic evaluation paradigm. This result challenges the emerging preference for holistic evaluation of alternatives and suggests that this result may apply to any number of alternatives.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103157"},"PeriodicalIF":14.7,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143768935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring EEG and eye movement fusion for multi-class target RSVP-BCI
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-26 DOI: 10.1016/j.inffus.2025.103135
Xujin Li , Wei Wei , Kun Zhao , Jiayu Mao , Yizhuo Lu , Shuang Qiu , Huiguang He
{"title":"Exploring EEG and eye movement fusion for multi-class target RSVP-BCI","authors":"Xujin Li ,&nbsp;Wei Wei ,&nbsp;Kun Zhao ,&nbsp;Jiayu Mao ,&nbsp;Yizhuo Lu ,&nbsp;Shuang Qiu ,&nbsp;Huiguang He","doi":"10.1016/j.inffus.2025.103135","DOIUrl":"10.1016/j.inffus.2025.103135","url":null,"abstract":"<div><div>Rapid Serial Visual Presentation (RSVP)-based Brain-Computer Interfaces (BCIs) enable high-throughput target image detection by identifying event-related potentials (ERPs) in electroencephalography (EEG) signals. Traditional RSVP-BCI systems detect only single-class targets within image streams, limiting their ability to handle more complex tasks requiring multi-class target identification. Multi-class target RSVP-BCI systems are designed to detect multi-class targets in real-world scenarios. However, distinguishing between different target categories remains challenging due to the high similarity across ERPs evoked by different target categories. In this work, we incorporate the eye movement (EM) modality into traditional EEG-based RSVP decoding and develop an open-source multi-modal dataset comprising EM and EEG signals from 43 subjects in three multi-class target RSVP tasks. We further propose the <strong>M</strong>ulti-class <strong>T</strong>arget <strong>R</strong>SVP <strong>E</strong>EG and <strong>E</strong>M fusion <strong>Net</strong>work (MTREE-Net) to enhance multi-class RSVP decoding. Specifically, a dual-complementary module is designed to strengthen the differentiation of uni-modal features across categories. To achieve more effective multi-modal fusion, we adopt a dynamic reweighting fusion strategy guided by theoretically derived modality contribution ratios for optimization. Furthermore, we propose a hierarchical self-distillation module to reduce the misclassification of non-target samples through knowledge transfer between two hierarchical classifiers. Extensive experiments demonstrate that MTREE-Net achieves significant performance improvements, including over 5.4% and 3.32% increases in balanced accuracy compared to existing EEG decoding and EEG-EM fusion methods, respectively. Our research offers a promising framework that can simultaneously detect target existence and identify their specific categories, enabling more robust and efficient applications in scenarios such as multi-class target detection.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103135"},"PeriodicalIF":14.7,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143735106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multilevel feature encoder for transfer learning-based fault detection on acoustic signal
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-26 DOI: 10.1016/j.inffus.2025.103128
Dezheng Wang , Congyan Chen
{"title":"Multilevel feature encoder for transfer learning-based fault detection on acoustic signal","authors":"Dezheng Wang ,&nbsp;Congyan Chen","doi":"10.1016/j.inffus.2025.103128","DOIUrl":"10.1016/j.inffus.2025.103128","url":null,"abstract":"<div><div>The intelligent diagnosis of faults in industrial assets is crucial for preventing unexpected disruptions to critical services. Although numerous deep learning methods based on acoustic data have been developed to enhance fault detection accuracy, these methods often prove suboptimal in transfer learning due to two key challenges: (1) insufficient generalization capability that causes overfitting to source device characteristics, and (2) failure to capture domain-invariant patterns essential for cross-device fault detection. This work seeks to alleviate these limitations by proposing a multilevel features encoder (MLFE) for transfer learning-based fault detection on acoustic signal. The acoustic data are initially preprocessed with a frequency mask to filter out high-frequency noise. Subsequently, feature engineering techniques are employed to extract several statistical features, such as the mean, standard deviation, and median absolute deviation, etc. with an emphasis on frequency characteristics. Moreover, unsupervised method is then applied to extract additional essential features. These multilevel features are then combined and fed into MLFE to differentiate between faulty and non-faulty signals. After being trained on several source devices, the pre-trained MLFE is transferred to a new target device to evaluate its transfer learning capability. MLFE is evaluated using the pump and fan datasets in MIMII, where it outperforms existing methods and offers a novel solution for transfer learning-based fault detection using acoustic signals.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103128"},"PeriodicalIF":14.7,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143739987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STATIC-LIO: A sliding window and terrain-assisted dynamic points removal LiDAR Inertial Odometry
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-26 DOI: 10.1016/j.inffus.2025.103132
Xuzhe Duan , Qingwu Hu , Mingyao Ai , Pengcheng Zhao , Meng Wu , Jiayuan Li , Chao Xiong
{"title":"STATIC-LIO: A sliding window and terrain-assisted dynamic points removal LiDAR Inertial Odometry","authors":"Xuzhe Duan ,&nbsp;Qingwu Hu ,&nbsp;Mingyao Ai ,&nbsp;Pengcheng Zhao ,&nbsp;Meng Wu ,&nbsp;Jiayuan Li ,&nbsp;Chao Xiong","doi":"10.1016/j.inffus.2025.103132","DOIUrl":"10.1016/j.inffus.2025.103132","url":null,"abstract":"<div><div>With the development of diverse Light Detection and Ranging (LiDAR) sensors, LiDAR-based localization and mapping has become an essential issue in the fields of robotics and autonomous driving. However, the moving object in dynamic environments often introduces errors in LiDAR localization and leaves undesirable traces in the point cloud map. In this work, we propose a novel LiDAR inertial odometry (LIO) framework named STATIC-LIO, Sliding window and Terrain AssisTed dynamIC points removal LiDAR Inertial Odometry, which fuses the geometric, terrestrial, and motion information to enhance the localization and mapping performance. The terrestrial information is extracted through a fast progressive ground segmentation module designed to be compatible with various LiDARs. With the assistance of the terrestrial information, an online dynamic point voting mechanism is proposed to determine the motion information and remove the dynamic points in a point-wise manner. The ground segmentation and dynamic points removal modules are coupled within the sliding window-based STATIC-LIO framework to estimate odometry by leveraging geometric correspondences from ground and static points. We extensively evaluate the proposed framework on both public and real-world datasets encompassing a variety of LiDAR types. The experimental results demonstrate the effectiveness of STATIC-LIO across various datasets and applications, showcasing its superior accuracy by reducing localization errors by up to 92.4% compared to the state-of-the-art LIO framework.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103132"},"PeriodicalIF":14.7,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143715390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AMF-VSN: Adaptive multi-process fusion video steganography based on invertible neural networks
IF 14.7 1区 计算机科学
Information Fusion Pub Date : 2025-03-25 DOI: 10.1016/j.inffus.2025.103130
Yangwen Zhang , Yuling Chen , Hui Dou , Yan Meng , Haojin Zhu
{"title":"AMF-VSN: Adaptive multi-process fusion video steganography based on invertible neural networks","authors":"Yangwen Zhang ,&nbsp;Yuling Chen ,&nbsp;Hui Dou ,&nbsp;Yan Meng ,&nbsp;Haojin Zhu","doi":"10.1016/j.inffus.2025.103130","DOIUrl":"10.1016/j.inffus.2025.103130","url":null,"abstract":"<div><div>The security challenges in information transmission have attracted considerable research focus, particularly in the field of video steganography. Although deep learning advancements have created new research opportunities for video steganography, current models encounter deployment difficulties on mobile devices due to their substantial parameter requirements, which restrict their adaptability to mobile platform constraints. To address these challenges, we propose an adaptive multi-process fusion video steganography based on invertible neural networks. This model incorporates flexible balance adjustment factor and Multi Cross Stage Partial Dense (MCSPDense) Block, which adjusts the parameter count of the MCSPDense Block and the quality of the stego video through the flexible balance adjustment factor. Additionally, the Simple Redundancy Prediction Module (SRPM) has been designed to further minimize model parameters while enhancing the quality of video steganography and restoration. Furthermore, we develop two operational modes to accommodate varying mobile device requirements: Secure Communication (SC) mode for enhanced transmission security and High Quality Recovery (HQR) mode for superior video restoration. Experimental results confirm that compared to existing solutions, our AMF-VSN framework has improved steganography and recovery performance by 3.474 dB and 2.521 dB respectively, reduced parameters by 66.08%, and maintained strong security in mobile deployment scenarios.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"121 ","pages":"Article 103130"},"PeriodicalIF":14.7,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143739986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信