An auditory-based scene change detection in audio data

T. Maka
{"title":"An auditory-based scene change detection in audio data","authors":"T. Maka","doi":"10.1109/ICSES.2014.6948723","DOIUrl":null,"url":null,"abstract":"The problem of auditory scene segmentation plays important role in many audio analysis and processing tasks. The accuracy and robustness of segmentation step have influence on the remaining stages in audio processing chain. In the work a dedicated system for segmentation of the audio stream is presented. The segmentation scheme uses Delta-BIC and metric-based techniques to determine the change-point in audio data. For this purpose a dedicated auditory feature has been proposed, which is based on the gammatone filter bank. The proposed feature (GTEAD) has been designed using inter-channel analysis of the auditory filter bank outputs. For each channel, the temporal envelope and its periodic self-similarities have been calculated. Then, the distances between obtained signals from the neighbouring channels have been computed resulting in the final feature vector. The performance of the GTEAD feature has been compared to the popular MFCC feature using database of audio streams with defined single change-point in each example. The obtained results show that GTEAD feature outperforms MFCC feature in terms of accuracy and the number of detected points.","PeriodicalId":115109,"journal":{"name":"2014 International Conference on Signals and Electronic Systems (ICSES)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Signals and Electronic Systems (ICSES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSES.2014.6948723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The problem of auditory scene segmentation plays important role in many audio analysis and processing tasks. The accuracy and robustness of segmentation step have influence on the remaining stages in audio processing chain. In the work a dedicated system for segmentation of the audio stream is presented. The segmentation scheme uses Delta-BIC and metric-based techniques to determine the change-point in audio data. For this purpose a dedicated auditory feature has been proposed, which is based on the gammatone filter bank. The proposed feature (GTEAD) has been designed using inter-channel analysis of the auditory filter bank outputs. For each channel, the temporal envelope and its periodic self-similarities have been calculated. Then, the distances between obtained signals from the neighbouring channels have been computed resulting in the final feature vector. The performance of the GTEAD feature has been compared to the popular MFCC feature using database of audio streams with defined single change-point in each example. The obtained results show that GTEAD feature outperforms MFCC feature in terms of accuracy and the number of detected points.
音频数据中基于听觉的场景变化检测
听觉场景分割问题在许多音频分析和处理任务中起着重要的作用。分割步骤的准确性和鲁棒性直接影响到音频处理链的其余步骤。本文提出了一种音频流分割专用系统。该分割方案使用Delta-BIC和基于度量的技术来确定音频数据中的变化点。为此,提出了一种基于伽玛酮滤波器组的专用听觉特征。所提出的特征(GTEAD)是利用听觉滤波器组输出的通道间分析来设计的。对于每个信道,计算了时域包络及其周期自相似度。然后,计算从相邻通道获得的信号之间的距离,从而得到最终的特征向量。在每个示例中,使用定义单个更改点的音频流数据库,将GTEAD特性的性能与流行的MFCC特性进行了比较。结果表明,GTEAD特征在精度和检测点数量上都优于MFCC特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信