用于室内会话机器人语音识别的远程多通道语音和噪声数据库的开发

Youngjoo Suh, Younggwan Kim, Hyungjun Lim, Jahyun Goo, Youngmoon Jung, Yeon-Ji Choi, Hoirin Kim, Dae-Lim Choi, Yong-Ju Lee
{"title":"用于室内会话机器人语音识别的远程多通道语音和噪声数据库的开发","authors":"Youngjoo Suh, Younggwan Kim, Hyungjun Lim, Jahyun Goo, Youngmoon Jung, Yeon-Ji Choi, Hoirin Kim, Dae-Lim Choi, Yong-Ju Lee","doi":"10.1109/ICSDA.2017.8384419","DOIUrl":null,"url":null,"abstract":"In this paper, we presents the method and procedure for collecting the Korean distant multi-channel speech and noise databases, which were designed for developing the highly accurate distant speech recognition system for indoor conversational robot applications. The speech database was collected at four different distant positions in an in-door room, which was furnished to simulate a living room acoustically, by the playback-and-recording method that uses an artificial mouth for playing the clean source speech data and three kinds of multi-channel microphone arrays for recording the distant speech data. The speech database further consists of a read speech dataset and two conversational speech datasets. Additionally, the noise database consists of 12 types of in-door noise, which were collected at a single distant position with the same approach. These speech and noise databases can be used for creating simulated noisy speech data reflecting various in-door acoustic conditions corrupted by room reverberation and additive noise.","PeriodicalId":255147,"journal":{"name":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Development of distant multi-channel speech and noise databases for speech recognition by in-door conversational robots\",\"authors\":\"Youngjoo Suh, Younggwan Kim, Hyungjun Lim, Jahyun Goo, Youngmoon Jung, Yeon-Ji Choi, Hoirin Kim, Dae-Lim Choi, Yong-Ju Lee\",\"doi\":\"10.1109/ICSDA.2017.8384419\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we presents the method and procedure for collecting the Korean distant multi-channel speech and noise databases, which were designed for developing the highly accurate distant speech recognition system for indoor conversational robot applications. The speech database was collected at four different distant positions in an in-door room, which was furnished to simulate a living room acoustically, by the playback-and-recording method that uses an artificial mouth for playing the clean source speech data and three kinds of multi-channel microphone arrays for recording the distant speech data. The speech database further consists of a read speech dataset and two conversational speech datasets. Additionally, the noise database consists of 12 types of in-door noise, which were collected at a single distant position with the same approach. These speech and noise databases can be used for creating simulated noisy speech data reflecting various in-door acoustic conditions corrupted by room reverberation and additive noise.\",\"PeriodicalId\":255147,\"journal\":{\"name\":\"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSDA.2017.8384419\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2017.8384419","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

本文介绍了为开发用于室内会话机器人的高精度远程语音识别系统而设计的朝鲜语远程多通道语音和噪声数据库的采集方法和步骤。在模拟客厅环境的室内房间中,采用人工嘴播放干净源语音数据,三种多声道麦克风阵列记录远程语音数据的放录音方法,在四个不同的距离位置采集语音数据库。语音数据库进一步由一个读语音数据集和两个会话语音数据集组成。此外,噪声数据库由12种室内噪声组成,这些噪声是用相同的方法在一个遥远的位置收集的。这些语音和噪声数据库可用于创建模拟噪声语音数据,这些数据反映了受室内混响和附加噪声干扰的各种室内声学条件。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Development of distant multi-channel speech and noise databases for speech recognition by in-door conversational robots
In this paper, we presents the method and procedure for collecting the Korean distant multi-channel speech and noise databases, which were designed for developing the highly accurate distant speech recognition system for indoor conversational robot applications. The speech database was collected at four different distant positions in an in-door room, which was furnished to simulate a living room acoustically, by the playback-and-recording method that uses an artificial mouth for playing the clean source speech data and three kinds of multi-channel microphone arrays for recording the distant speech data. The speech database further consists of a read speech dataset and two conversational speech datasets. Additionally, the noise database consists of 12 types of in-door noise, which were collected at a single distant position with the same approach. These speech and noise databases can be used for creating simulated noisy speech data reflecting various in-door acoustic conditions corrupted by room reverberation and additive noise.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信