Anomaly detection and classification of audio signals with artificial intelligence techniques

Michael Neri
{"title":"Anomaly detection and classification of audio signals with artificial intelligence techniques","authors":"Michael Neri","doi":"10.1016/j.sctalk.2024.100351","DOIUrl":null,"url":null,"abstract":"<div><p>In recent years, video data has been extensively used for surveillance purposes. Anyway, if a fight can be recognized by everyone, abnormal sounds could pass unnoticed. Moreover, if a dangerous event is not in our line of sight, the only cue that can be exploited is the sound produced by the threat. With the adoption of Artificial Intelligence-based techniques, it is possible to detect these anomalies by inspecting the videos acquired by cameras located inside the bus. However, this scenario is complex for several reasons since video analysis is computationally expensive, requiring costly hardware equipment for processing and storage. Moreover, videos suffer from occlusions and luminance variations, making the system not suitable in all situations. To this aim, the objective of my Ph.D. is to propose a data-driven framework that can detect if an audio recording is anomalous and, if this is the case, to identify which and where dangerous events are occurring. The architecture I propose, denoted as Coarse-to-Fine, is composed of two elements. The first is responsible for modeling the normal background of a target environment in an unsupervised fashion. If an anomalous audio is detected, a second element focuses on what, when, and where the anomaly occurs.</p></div>","PeriodicalId":101148,"journal":{"name":"Science Talks","volume":"10 ","pages":"Article 100351"},"PeriodicalIF":0.0000,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772569324000598/pdfft?md5=91bbbcfe2274a0363f48302e81847301&pid=1-s2.0-S2772569324000598-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Talks","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772569324000598","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, video data has been extensively used for surveillance purposes. Anyway, if a fight can be recognized by everyone, abnormal sounds could pass unnoticed. Moreover, if a dangerous event is not in our line of sight, the only cue that can be exploited is the sound produced by the threat. With the adoption of Artificial Intelligence-based techniques, it is possible to detect these anomalies by inspecting the videos acquired by cameras located inside the bus. However, this scenario is complex for several reasons since video analysis is computationally expensive, requiring costly hardware equipment for processing and storage. Moreover, videos suffer from occlusions and luminance variations, making the system not suitable in all situations. To this aim, the objective of my Ph.D. is to propose a data-driven framework that can detect if an audio recording is anomalous and, if this is the case, to identify which and where dangerous events are occurring. The architecture I propose, denoted as Coarse-to-Fine, is composed of two elements. The first is responsible for modeling the normal background of a target environment in an unsupervised fashion. If an anomalous audio is detected, a second element focuses on what, when, and where the anomaly occurs.

利用人工智能技术对音频信号进行异常检测和分类
近年来,视频数据被广泛用于监控目的。无论如何,如果大家都能认出打斗声,那么异常声音就可能被忽视。此外,如果危险事件不在我们的视线范围内,唯一可以利用的线索就是威胁发出的声音。采用人工智能技术后,就有可能通过检查巴士内摄像头获取的视频来检测这些异常情况。然而,由于视频分析的计算成本很高,需要昂贵的硬件设备进行处理和存储,因此这种情况非常复杂。此外,视频还会受到遮挡和亮度变化的影响,因此该系统并不适用于所有情况。为此,我的博士论文的目标是提出一个数据驱动的框架,该框架可以检测音频记录是否异常,如果异常,则可以识别哪些危险事件以及在哪里发生。我提出的架构被称为 "从粗到细"(Coarse-to-Fine),由两个要素组成。第一个元素负责对目标环境的正常背景进行无监督建模。如果检测到异常音频,第二个元素将重点关注异常发生的时间和地点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信