基于感知的机器环境声音异构信息显著性特征融合方法

Jingyu Wang, Ke Zhang, K. Madani, C. Sabourin
{"title":"基于感知的机器环境声音异构信息显著性特征融合方法","authors":"Jingyu Wang, Ke Zhang, K. Madani, C. Sabourin","doi":"10.1109/ICAWST.2013.6765433","DOIUrl":null,"url":null,"abstract":"Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of \"sense\" or \"awareness\", which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine's awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine's environment sounds based awareness.","PeriodicalId":68697,"journal":{"name":"炎黄地理","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Heterogeneous information saliency features' fusion approach for machine's environment sounds based awareness\",\"authors\":\"Jingyu Wang, Ke Zhang, K. Madani, C. Sabourin\",\"doi\":\"10.1109/ICAWST.2013.6765433\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of \\\"sense\\\" or \\\"awareness\\\", which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine's awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine's environment sounds based awareness.\",\"PeriodicalId\":68697,\"journal\":{\"name\":\"炎黄地理\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"炎黄地理\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAWST.2013.6765433\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"炎黄地理","FirstCategoryId":"1089","ListUrlMain":"https://doi.org/10.1109/ICAWST.2013.6765433","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

人类在处理日常生活中发生的声音时比机器人或其他无人驾驶地面车辆更聪明,因为人类具有“感觉”或“意识”的本能,即区分周围环境中最显著的声音、物体或事件的能力。本文以人类听觉系统的生物声感知和人类视觉的视觉显著性天赋为灵感,提出了一种模拟人类对环境声音的感知以实现机器感知的异构信息显著性特征融合(HISFF)方法。利用短时傅里叶变换(STFT)算法对声音信号进行可视化处理,将声音显著性转化为视觉显著性,并利用Mel-Frequency倒频谱系数(MFCC)表征人的声意识。利用室内和室外环境的真实环境声数据对所提出的HISFF方法进行了测试。结果表明,该方法能够成功且清晰地从长期和短期声音信号中提取显著性信号,为机器基于环境声音的感知提供了非常容易区分的特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Heterogeneous information saliency features' fusion approach for machine's environment sounds based awareness
Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of "sense" or "awareness", which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine's awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine's environment sounds based awareness.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
784
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信