{"title":"基于分布式交换机与视频监控融合的周界安防多模态识别方法研究","authors":"Wei Zhao;Shaodong Jiang;Yang Zhao;Faxiang Zhang","doi":"10.1109/JSEN.2025.3582973","DOIUrl":null,"url":null,"abstract":"Perimeter security systems widely employ distributed fiber-optic sensing technology and video surveillance as sensing means. However, significant limitations remain in practical applications. Distributed fiber-optic sensing technology is susceptible to interference from environmental noise coupling, resulting in a high false alarm rate. Meanwhile, video surveillance technology faces issues such as increased image noise and blurred target outlines in complex environments. These problems are compounded by the complexity of the background, which makes it difficult to accurately identify subtle behavioral differences. To address these challenges, this article proposes a multimodal fusion classification model, HMFusionNet, which leverages the complementary information from distributed vibration sensing (DVS) and video surveillance to improve classification accuracy. First, we introduce the CGANet module to extract features from 1-D fiber vibration signals and capture the periodic characteristics of the fiber time series. Second, we design the PoseMobiNet module to extract 2-D image features based on human keypoint data and RGB image information, addressing the complexities of the perimeter security background and the subtleties of behavioral differences among intruders. During the feature fusion stage, we propose a probabilistic weighting-based late fusion strategy to integrate decision-level information from both modalities. Finally, using a multimodal dataset constructed based on a real-world perimeter security scenario, the HMFusionNet achieves a detection accuracy of 97.7% with a recognition time of less than 0.1 s.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 15","pages":"29213-29220"},"PeriodicalIF":4.3000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Multimodal Recognition Methods for Perimeter Security Based on the Fusion of DVS and Video Surveillance\",\"authors\":\"Wei Zhao;Shaodong Jiang;Yang Zhao;Faxiang Zhang\",\"doi\":\"10.1109/JSEN.2025.3582973\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Perimeter security systems widely employ distributed fiber-optic sensing technology and video surveillance as sensing means. However, significant limitations remain in practical applications. Distributed fiber-optic sensing technology is susceptible to interference from environmental noise coupling, resulting in a high false alarm rate. Meanwhile, video surveillance technology faces issues such as increased image noise and blurred target outlines in complex environments. These problems are compounded by the complexity of the background, which makes it difficult to accurately identify subtle behavioral differences. To address these challenges, this article proposes a multimodal fusion classification model, HMFusionNet, which leverages the complementary information from distributed vibration sensing (DVS) and video surveillance to improve classification accuracy. First, we introduce the CGANet module to extract features from 1-D fiber vibration signals and capture the periodic characteristics of the fiber time series. Second, we design the PoseMobiNet module to extract 2-D image features based on human keypoint data and RGB image information, addressing the complexities of the perimeter security background and the subtleties of behavioral differences among intruders. During the feature fusion stage, we propose a probabilistic weighting-based late fusion strategy to integrate decision-level information from both modalities. Finally, using a multimodal dataset constructed based on a real-world perimeter security scenario, the HMFusionNet achieves a detection accuracy of 97.7% with a recognition time of less than 0.1 s.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 15\",\"pages\":\"29213-29220\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11062474/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11062474/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Research on Multimodal Recognition Methods for Perimeter Security Based on the Fusion of DVS and Video Surveillance
Perimeter security systems widely employ distributed fiber-optic sensing technology and video surveillance as sensing means. However, significant limitations remain in practical applications. Distributed fiber-optic sensing technology is susceptible to interference from environmental noise coupling, resulting in a high false alarm rate. Meanwhile, video surveillance technology faces issues such as increased image noise and blurred target outlines in complex environments. These problems are compounded by the complexity of the background, which makes it difficult to accurately identify subtle behavioral differences. To address these challenges, this article proposes a multimodal fusion classification model, HMFusionNet, which leverages the complementary information from distributed vibration sensing (DVS) and video surveillance to improve classification accuracy. First, we introduce the CGANet module to extract features from 1-D fiber vibration signals and capture the periodic characteristics of the fiber time series. Second, we design the PoseMobiNet module to extract 2-D image features based on human keypoint data and RGB image information, addressing the complexities of the perimeter security background and the subtleties of behavioral differences among intruders. During the feature fusion stage, we propose a probabilistic weighting-based late fusion strategy to integrate decision-level information from both modalities. Finally, using a multimodal dataset constructed based on a real-world perimeter security scenario, the HMFusionNet achieves a detection accuracy of 97.7% with a recognition time of less than 0.1 s.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice