基于分布式交换机与视频监控融合的周界安防多模态识别方法研究

IF 4.3 2区综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Journal Pub Date : 2025-07-02 DOI:10.1109/JSEN.2025.3582973

Wei Zhao;Shaodong Jiang;Yang Zhao;Faxiang Zhang

{"title":"基于分布式交换机与视频监控融合的周界安防多模态识别方法研究","authors":"Wei Zhao;Shaodong Jiang;Yang Zhao;Faxiang Zhang","doi":"10.1109/JSEN.2025.3582973","DOIUrl":null,"url":null,"abstract":"Perimeter security systems widely employ distributed fiber-optic sensing technology and video surveillance as sensing means. However, significant limitations remain in practical applications. Distributed fiber-optic sensing technology is susceptible to interference from environmental noise coupling, resulting in a high false alarm rate. Meanwhile, video surveillance technology faces issues such as increased image noise and blurred target outlines in complex environments. These problems are compounded by the complexity of the background, which makes it difficult to accurately identify subtle behavioral differences. To address these challenges, this article proposes a multimodal fusion classification model, HMFusionNet, which leverages the complementary information from distributed vibration sensing (DVS) and video surveillance to improve classification accuracy. First, we introduce the CGANet module to extract features from 1-D fiber vibration signals and capture the periodic characteristics of the fiber time series. Second, we design the PoseMobiNet module to extract 2-D image features based on human keypoint data and RGB image information, addressing the complexities of the perimeter security background and the subtleties of behavioral differences among intruders. During the feature fusion stage, we propose a probabilistic weighting-based late fusion strategy to integrate decision-level information from both modalities. Finally, using a multimodal dataset constructed based on a real-world perimeter security scenario, the HMFusionNet achieves a detection accuracy of 97.7% with a recognition time of less than 0.1 s.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 15","pages":"29213-29220"},"PeriodicalIF":4.3000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Multimodal Recognition Methods for Perimeter Security Based on the Fusion of DVS and Video Surveillance\",\"authors\":\"Wei Zhao;Shaodong Jiang;Yang Zhao;Faxiang Zhang\",\"doi\":\"10.1109/JSEN.2025.3582973\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Perimeter security systems widely employ distributed fiber-optic sensing technology and video surveillance as sensing means. However, significant limitations remain in practical applications. Distributed fiber-optic sensing technology is susceptible to interference from environmental noise coupling, resulting in a high false alarm rate. Meanwhile, video surveillance technology faces issues such as increased image noise and blurred target outlines in complex environments. These problems are compounded by the complexity of the background, which makes it difficult to accurately identify subtle behavioral differences. To address these challenges, this article proposes a multimodal fusion classification model, HMFusionNet, which leverages the complementary information from distributed vibration sensing (DVS) and video surveillance to improve classification accuracy. First, we introduce the CGANet module to extract features from 1-D fiber vibration signals and capture the periodic characteristics of the fiber time series. Second, we design the PoseMobiNet module to extract 2-D image features based on human keypoint data and RGB image information, addressing the complexities of the perimeter security background and the subtleties of behavioral differences among intruders. During the feature fusion stage, we propose a probabilistic weighting-based late fusion strategy to integrate decision-level information from both modalities. Finally, using a multimodal dataset constructed based on a real-world perimeter security scenario, the HMFusionNet achieves a detection accuracy of 97.7% with a recognition time of less than 0.1 s.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 15\",\"pages\":\"29213-29220\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11062474/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11062474/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

周界安防系统广泛采用分布式光纤传感技术和视频监控作为传感手段。然而，在实际应用中仍有很大的局限性。分布式光纤传感技术容易受到环境噪声耦合的干扰，虚警率高。同时，视频监控技术也面临着复杂环境下图像噪声增大、目标轮廓模糊等问题。这些问题由于背景的复杂性而变得更加复杂，这使得准确识别细微的行为差异变得困难。为了解决这些挑战，本文提出了一种多模态融合分类模型HMFusionNet，该模型利用分布式振动传感（DVS）和视频监控的互补信息来提高分类精度。首先，我们引入CGANet模块从一维光纤振动信号中提取特征，并捕获光纤时间序列的周期特征。其次，我们设计了PoseMobiNet模块，基于人体关键点数据和RGB图像信息提取二维图像特征，解决了周界安全背景的复杂性和入侵者行为差异的微妙性。在特征融合阶段，我们提出了一种基于概率加权的后期融合策略来整合两种模式的决策级信息。最后，使用基于真实周界安全场景构建的多模态数据集，HMFusionNet实现了97.7%的检测准确率，识别时间小于0.1 s。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Research on Multimodal Recognition Methods for Perimeter Security Based on the Fusion of DVS and Video Surveillance

Perimeter security systems widely employ distributed fiber-optic sensing technology and video surveillance as sensing means. However, significant limitations remain in practical applications. Distributed fiber-optic sensing technology is susceptible to interference from environmental noise coupling, resulting in a high false alarm rate. Meanwhile, video surveillance technology faces issues such as increased image noise and blurred target outlines in complex environments. These problems are compounded by the complexity of the background, which makes it difficult to accurately identify subtle behavioral differences. To address these challenges, this article proposes a multimodal fusion classification model, HMFusionNet, which leverages the complementary information from distributed vibration sensing (DVS) and video surveillance to improve classification accuracy. First, we introduce the CGANet module to extract features from 1-D fiber vibration signals and capture the periodic characteristics of the fiber time series. Second, we design the PoseMobiNet module to extract 2-D image features based on human keypoint data and RGB image information, addressing the complexities of the perimeter security background and the subtleties of behavioral differences among intruders. During the feature fusion stage, we propose a probabilistic weighting-based late fusion strategy to integrate decision-level information from both modalities. Finally, using a multimodal dataset constructed based on a real-world perimeter security scenario, the HMFusionNet achieves a detection accuracy of 97.7% with a recognition time of less than 0.1 s.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Sensors Journal 工程技术-工程：电子与电气

CiteScore

7.70

自引率

14.00%

发文量

2058

审稿时长

5.2 months

期刊介绍： The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice