Estimation of the Perceived Time of Presence of Sources in Urban Acoustic Environments Using Deep Learning Techniques

Q1 Arts and Humanities
Félix Gontier, C. Lavandier, P. Aumond, M. Lagrange, J. Petiot
{"title":"Estimation of the Perceived Time of Presence of Sources in Urban Acoustic Environments Using Deep Learning Techniques","authors":"Félix Gontier, C. Lavandier, P. Aumond, M. Lagrange, J. Petiot","doi":"10.3813/aaa.919384","DOIUrl":null,"url":null,"abstract":"The impact of urban sound on human beings has often been studied from a negative point of view (noise pollution). In the two last decades, the interest of studying its positive impact has been revealed with the soundscape approach (resourcing spaces). The literature shows that the recognition\n of sources plays a great role in the way humans are affected by sound environments. There is thus a need for characterizing urban acoustic environments not only with sound pressure measurements but also with source-specific attributes such as their perceived time of presence, dominance or\n volume. This paper demonstrates, on a controlled dataset, that machine learning techniques based on state of the art neural architectures can predict the perceived time of presence of several sound sources at a sufficient accuracy. To validate this assertion, a corpus of simulated sound\n scenes is first designed. Perceptual attributes corresponding to those stimuli are gathered through a listening experiment. From the contributions of the individual sound sources available for the simulated corpus, a physical indicator approximating the perceived time of presence of sources\n is computed and used to train and evaluate a multi-label source detection model. This model predicts the presence of simultaneously active sources from fast third octave spectra, allowing the estimation of perceptual attributes such as pleasantness in urban sound environments at a sufficient\n degree of precision.","PeriodicalId":35085,"journal":{"name":"Acta Acustica united with Acustica","volume":"46 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Acustica united with Acustica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3813/aaa.919384","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 11

Abstract

The impact of urban sound on human beings has often been studied from a negative point of view (noise pollution). In the two last decades, the interest of studying its positive impact has been revealed with the soundscape approach (resourcing spaces). The literature shows that the recognition of sources plays a great role in the way humans are affected by sound environments. There is thus a need for characterizing urban acoustic environments not only with sound pressure measurements but also with source-specific attributes such as their perceived time of presence, dominance or volume. This paper demonstrates, on a controlled dataset, that machine learning techniques based on state of the art neural architectures can predict the perceived time of presence of several sound sources at a sufficient accuracy. To validate this assertion, a corpus of simulated sound scenes is first designed. Perceptual attributes corresponding to those stimuli are gathered through a listening experiment. From the contributions of the individual sound sources available for the simulated corpus, a physical indicator approximating the perceived time of presence of sources is computed and used to train and evaluate a multi-label source detection model. This model predicts the presence of simultaneously active sources from fast third octave spectra, allowing the estimation of perceptual attributes such as pleasantness in urban sound environments at a sufficient degree of precision.
利用深度学习技术估计城市声环境中声源存在的感知时间
城市声音对人类的影响通常是从消极的角度(噪音污染)来研究的。在过去的二十年里,研究其积极影响的兴趣已经随着声景方法(资源空间)而显露出来。文献表明,对声源的识别在人类受声音环境影响的方式中起着重要作用。因此,不仅需要用声压测量来描述城市声学环境,而且需要用特定于声源的属性来描述城市声学环境,例如它们感知到的存在时间、主导地位或音量。本文在一个受控数据集上证明,基于最先进的神经结构的机器学习技术可以以足够的精度预测几个声源存在的感知时间。为了验证这一断言,首先设计了一个模拟声音场景的语料库。与这些刺激相对应的知觉属性是通过听觉实验收集的。根据模拟语料库中可用的单个声源的贡献,计算近似声源存在的感知时间的物理指标,并用于训练和评估多标签声源检测模型。该模型预测了来自快速三倍频度频谱的同时活跃源的存在,允许以足够的精度估计感知属性,例如城市声音环境中的愉悦度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
审稿时长
6.8 months
期刊介绍: Cessation. Acta Acustica united with Acustica (Acta Acust united Ac), was published together with the European Acoustics Association (EAA). It was an international, peer-reviewed journal on acoustics. It published original articles on all subjects in the field of acoustics, such as • General Linear Acoustics, • Nonlinear Acoustics, Macrosonics, • Aeroacoustics, • Atmospheric Sound, • Underwater Sound, • Ultrasonics, • Physical Acoustics, • Structural Acoustics, • Noise Control, • Active Control, • Environmental Noise, • Building Acoustics, • Room Acoustics, • Acoustic Materials and Metamaterials, • Audio Signal Processing and Transducers, • Computational and Numerical Acoustics, • Hearing, Audiology and Psychoacoustics, • Speech, • Musical Acoustics, • Virtual Acoustics, • Auditory Quality of Systems, • Animal Bioacoustics, • History of Acoustics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信