基于无人机的灾难受害者搜索中人声重点成像的创新颜色图

Tomokichi Furusawa, C. Premachandra
{"title":"基于无人机的灾难受害者搜索中人声重点成像的创新颜色图","authors":"Tomokichi Furusawa, C. Premachandra","doi":"10.1109/TENSYMP55890.2023.10223627","DOIUrl":null,"url":null,"abstract":"Unmanned aerial vehicles (UAVs) are being utilized for damage assessment in natural disasters and for search and rescue operations. Currently, the search for victims primarily relies on analyzing images captured by cameras mounted on UAVs. However, this approach has limitations when it comes to locating victims who are not within the camera's field of view. As a result, sound-based search methods are being considered. In this method, a voice message is transmitted to the disaster area through a speaker mounted on a UAV, and the presence of victims is confirmed by detecting their response using the onboard microphone of the UAV. Nevertheless, the UAV's microphone captures both the sound of the victim and the propeller rotation, posing a significant challenge in extracting the victim's voice from this combined audio. To address this issue, we propose a solution that involves generating spectrogram images of the sound mixture and the propeller sound, and extracting the human sound by subtracting them. We found that the conventional colormap was ineffective in emphasizing the human sound in the spectrogram images. To overcome this limitation, this paper introduces a new colormap based on the normal distribution. This colormap enhances human voices while attenuating propeller sounds by adjusting the mean and variance. Through the results of our experiments, we confirm that the proposed colormap effectively reduces propeller sound interference in the sound mixing and simultaneously emphasizes the voice of a disaster victim. By utilizing the proposed colormap, it becomes possible to visualize the victim's voice from the audio mixture acquired by the UAV's onboard microphone, enabling the identification of the victim.","PeriodicalId":314726,"journal":{"name":"2023 IEEE Region 10 Symposium (TENSYMP)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Innovative Colormap for Emphatic Imaging of Human Voice for UAV-Based Disaster Victim Search\",\"authors\":\"Tomokichi Furusawa, C. Premachandra\",\"doi\":\"10.1109/TENSYMP55890.2023.10223627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Unmanned aerial vehicles (UAVs) are being utilized for damage assessment in natural disasters and for search and rescue operations. Currently, the search for victims primarily relies on analyzing images captured by cameras mounted on UAVs. However, this approach has limitations when it comes to locating victims who are not within the camera's field of view. As a result, sound-based search methods are being considered. In this method, a voice message is transmitted to the disaster area through a speaker mounted on a UAV, and the presence of victims is confirmed by detecting their response using the onboard microphone of the UAV. Nevertheless, the UAV's microphone captures both the sound of the victim and the propeller rotation, posing a significant challenge in extracting the victim's voice from this combined audio. To address this issue, we propose a solution that involves generating spectrogram images of the sound mixture and the propeller sound, and extracting the human sound by subtracting them. We found that the conventional colormap was ineffective in emphasizing the human sound in the spectrogram images. To overcome this limitation, this paper introduces a new colormap based on the normal distribution. This colormap enhances human voices while attenuating propeller sounds by adjusting the mean and variance. Through the results of our experiments, we confirm that the proposed colormap effectively reduces propeller sound interference in the sound mixing and simultaneously emphasizes the voice of a disaster victim. By utilizing the proposed colormap, it becomes possible to visualize the victim's voice from the audio mixture acquired by the UAV's onboard microphone, enabling the identification of the victim.\",\"PeriodicalId\":314726,\"journal\":{\"name\":\"2023 IEEE Region 10 Symposium (TENSYMP)\",\"volume\":\"81 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE Region 10 Symposium (TENSYMP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENSYMP55890.2023.10223627\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Region 10 Symposium (TENSYMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENSYMP55890.2023.10223627","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

无人驾驶飞行器(uav)正在被用于自然灾害的损害评估和搜索和救援行动。目前,搜寻遇难者主要依靠分析安装在无人机上的摄像头拍摄的图像。然而,这种方法在定位不在相机视野范围内的受害者时存在局限性。因此,基于声音的搜索方法正在考虑之中。在这种方法中,通过安装在无人机上的扬声器将语音信息传输到灾区,并通过无人机上的机载麦克风检测受害者的反应来确认受害者的存在。然而,无人机的麦克风捕捉到受害者的声音和螺旋桨旋转的声音,这对从这个组合音频中提取受害者的声音提出了重大挑战。为了解决这个问题,我们提出了一个解决方案,包括生成声音混合和螺旋桨声音的频谱图图像,并通过减去它们来提取人的声音。我们发现传统的色图在声谱图图像中对人声的强调是无效的。为了克服这一限制,本文引入了一种新的基于正态分布的颜色图。这种颜色图通过调整平均值和方差来增强人声,同时减弱螺旋桨的声音。通过实验结果,我们证实了所提出的颜色图可以有效地减少混音中的螺旋桨声干扰,同时强调灾民的声音。通过利用所提出的颜色图,从无人机机载麦克风获得的音频混合中可视化受害者的声音成为可能,从而能够识别受害者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Innovative Colormap for Emphatic Imaging of Human Voice for UAV-Based Disaster Victim Search
Unmanned aerial vehicles (UAVs) are being utilized for damage assessment in natural disasters and for search and rescue operations. Currently, the search for victims primarily relies on analyzing images captured by cameras mounted on UAVs. However, this approach has limitations when it comes to locating victims who are not within the camera's field of view. As a result, sound-based search methods are being considered. In this method, a voice message is transmitted to the disaster area through a speaker mounted on a UAV, and the presence of victims is confirmed by detecting their response using the onboard microphone of the UAV. Nevertheless, the UAV's microphone captures both the sound of the victim and the propeller rotation, posing a significant challenge in extracting the victim's voice from this combined audio. To address this issue, we propose a solution that involves generating spectrogram images of the sound mixture and the propeller sound, and extracting the human sound by subtracting them. We found that the conventional colormap was ineffective in emphasizing the human sound in the spectrogram images. To overcome this limitation, this paper introduces a new colormap based on the normal distribution. This colormap enhances human voices while attenuating propeller sounds by adjusting the mean and variance. Through the results of our experiments, we confirm that the proposed colormap effectively reduces propeller sound interference in the sound mixing and simultaneously emphasizes the voice of a disaster victim. By utilizing the proposed colormap, it becomes possible to visualize the victim's voice from the audio mixture acquired by the UAV's onboard microphone, enabling the identification of the victim.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信