Visualizing Directional Soundscapes of Bird Vocalizations Using Robot Audition Techniques

Reiji Suzuki, Hao Zhao, Shinji Sumitani, Shiho Matsubayashi, Takaya Arita, K. Nakadai, H. Okuno
{"title":"Visualizing Directional Soundscapes of Bird Vocalizations Using Robot Audition Techniques","authors":"Reiji Suzuki, Hao Zhao, Shinji Sumitani, Shiho Matsubayashi, Takaya Arita, K. Nakadai, H. Okuno","doi":"10.1109/IEEECONF49454.2021.9382639","DOIUrl":null,"url":null,"abstract":"A visualisation of the soundscape dynamics is one of the important topics in ecoacousics. However, existing approaches mainly focused on the soundscape in the frequency domain while the soundscape in the directional or spatial domain is also essential to better understand animal vocalizations. This paper proposes and discusses novel applications of robot audition techniques to visualize soundscape dynamics in the directional or spatial domain by using the directional information of sound sources obtained from the robot audition software HARK (Honda Research Institute Japan Audition for Robots with Kyoto University) and the software for birdsong localization HARKBird. First, we create a false-color spectrogram that visualizes directional soundscapes in which the color of the spectrogram reflects the direction of arrival of separated sounds. We also visualize the distribution of directional soundscapes by combining the entropy of the likelihood of sound existence (MUSIC spectrum) and a latent space embedding method (UMAP). We applied these techniques to a 5 min recording with 6 individuals of Zebra Finch in order to show that the extracted visual information can reflect acoustic structures among the group of bird individuals in the directional domain.","PeriodicalId":395378,"journal":{"name":"2021 IEEE/SICE International Symposium on System Integration (SII)","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/SICE International Symposium on System Integration (SII)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEEECONF49454.2021.9382639","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A visualisation of the soundscape dynamics is one of the important topics in ecoacousics. However, existing approaches mainly focused on the soundscape in the frequency domain while the soundscape in the directional or spatial domain is also essential to better understand animal vocalizations. This paper proposes and discusses novel applications of robot audition techniques to visualize soundscape dynamics in the directional or spatial domain by using the directional information of sound sources obtained from the robot audition software HARK (Honda Research Institute Japan Audition for Robots with Kyoto University) and the software for birdsong localization HARKBird. First, we create a false-color spectrogram that visualizes directional soundscapes in which the color of the spectrogram reflects the direction of arrival of separated sounds. We also visualize the distribution of directional soundscapes by combining the entropy of the likelihood of sound existence (MUSIC spectrum) and a latent space embedding method (UMAP). We applied these techniques to a 5 min recording with 6 individuals of Zebra Finch in order to show that the extracted visual information can reflect acoustic structures among the group of bird individuals in the directional domain.
利用机器人试听技术可视化鸟类发声的定向音景
声景动态的可视化是生态声学研究的重要课题之一。然而,现有的方法主要集中在频域的音景,而定向或空间域的音景对于更好地理解动物发声也是必不可少的。本文提出并讨论了机器人试听技术的新应用,利用机器人试听软件HARK(日本本田研究所与京都大学合作的机器人试听软件)和鸟鸣定位软件HARKBird获得的声源方向信息,在方向或空间域可视化声景动态。首先,我们创建了一个假彩色频谱图,使定向声景可视化,其中频谱图的颜色反映了分离声音到达的方向。我们还通过结合声音存在可能性的熵(MUSIC谱)和潜在空间嵌入方法(UMAP)来可视化定向声景的分布。我们对6只斑胸草雀进行了5分钟的录音,结果表明,提取的视觉信息能够反映出斑胸草雀群个体在方向域的声学结构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信