一种多模态音频、可见和红外监视系统(MAVISS)

A. Mittal, P. Kumar
{"title":"一种多模态音频、可见和红外监视系统(MAVISS)","authors":"A. Mittal, P. Kumar","doi":"10.1109/ICISIP.2005.1619428","DOIUrl":null,"url":null,"abstract":"This paper presents a low cost surveillance system employing multimodal information (visible, infrared and audio signals) for monitoring small area and detecting alarming events. To ensure efficient and robust operation, the system captures different aspects of the environment using audio and video information. Infrared imagery is usedfor night and other low level lighting situations. The visual processing module of the system uses a motion based approach for detecting objects, and employs Kalman filter model for tracking its motion. Environmental sound is recognized by processing audio signals to extract features in the form of Mel-Frequency Cepstral coefficients (MFCC), which are then used for classification by Dynamic Time Warping (DTW) technique. Semantic rules are proposed to identify alarming events by using information from audio and video module. Experimental results are shown on some typical sequences and publicly available dataset.","PeriodicalId":261916,"journal":{"name":"2005 3rd International Conference on Intelligent Sensing and Information Processing","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"A Multimodal Audio Visible and Infrared Surveillance System (MAVISS)\",\"authors\":\"A. Mittal, P. Kumar\",\"doi\":\"10.1109/ICISIP.2005.1619428\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a low cost surveillance system employing multimodal information (visible, infrared and audio signals) for monitoring small area and detecting alarming events. To ensure efficient and robust operation, the system captures different aspects of the environment using audio and video information. Infrared imagery is usedfor night and other low level lighting situations. The visual processing module of the system uses a motion based approach for detecting objects, and employs Kalman filter model for tracking its motion. Environmental sound is recognized by processing audio signals to extract features in the form of Mel-Frequency Cepstral coefficients (MFCC), which are then used for classification by Dynamic Time Warping (DTW) technique. Semantic rules are proposed to identify alarming events by using information from audio and video module. Experimental results are shown on some typical sequences and publicly available dataset.\",\"PeriodicalId\":261916,\"journal\":{\"name\":\"2005 3rd International Conference on Intelligent Sensing and Information Processing\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2005 3rd International Conference on Intelligent Sensing and Information Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICISIP.2005.1619428\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 3rd International Conference on Intelligent Sensing and Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISIP.2005.1619428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

摘要

本文介绍了一种利用多模态信息(可见光、红外和音频信号)实现小范围监控和报警事件检测的低成本监控系统。为了确保高效和稳健的操作,该系统使用音频和视频信息捕获环境的不同方面。红外图像用于夜间和其他低水平照明情况。系统的视觉处理模块采用基于运动的方法检测目标,并采用卡尔曼滤波模型跟踪目标的运动。环境声音的识别是通过对音频信号进行处理,提取Mel-Frequency倒谱系数(MFCC)形式的特征,然后通过动态时间扭曲(DTW)技术将其用于分类。利用音频和视频模块的信息,提出了识别报警事件的语义规则。在一些典型序列和公开数据集上给出了实验结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Multimodal Audio Visible and Infrared Surveillance System (MAVISS)
This paper presents a low cost surveillance system employing multimodal information (visible, infrared and audio signals) for monitoring small area and detecting alarming events. To ensure efficient and robust operation, the system captures different aspects of the environment using audio and video information. Infrared imagery is usedfor night and other low level lighting situations. The visual processing module of the system uses a motion based approach for detecting objects, and employs Kalman filter model for tracking its motion. Environmental sound is recognized by processing audio signals to extract features in the form of Mel-Frequency Cepstral coefficients (MFCC), which are then used for classification by Dynamic Time Warping (DTW) technique. Semantic rules are proposed to identify alarming events by using information from audio and video module. Experimental results are shown on some typical sequences and publicly available dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信