{"title":"微型麦克风阵列移动机器人便携式关键词识别与声源检测系统设计","authors":"Muhammad Bagus Andra, T. Usagawa","doi":"10.1109/ICCAR49639.2020.9108086","DOIUrl":null,"url":null,"abstract":"The capability of locating sound source and detecting speech has been an essential feature in the robot auditory that enables it to and interact with human and perform sophisticated tasks. Recent development of mini microphone array has made robot audition much more accessible and offer a flexible platform to build a robot hearing system. This research proposes a keyword spotting and sound source detection system that is built on ReSpeaker Core v2.0 microphone array. We use Frequency Domain Binaural Model (FDBM) to detect the Direction of Arrival (DOA) of the sound source and estimate the distance of the sound source by taking advantage of the robot mobility to perform a triangulation method. Separate model based on Long-Short Term Memory (LSTM) network is utilized to perform the keyword spotting task. We evaluate the average accuracy of the DOA and distance and Word Error Rate (WER) of the keyword spotting activity. We also compare the performance of the system in the open field non reverberant situation with an ideal simulation.","PeriodicalId":412255,"journal":{"name":"2020 6th International Conference on Control, Automation and Robotics (ICCAR)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Portable Keyword Spotting and Sound Source Detection System Design on Mobile Robot with Mini Microphone Array\",\"authors\":\"Muhammad Bagus Andra, T. Usagawa\",\"doi\":\"10.1109/ICCAR49639.2020.9108086\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The capability of locating sound source and detecting speech has been an essential feature in the robot auditory that enables it to and interact with human and perform sophisticated tasks. Recent development of mini microphone array has made robot audition much more accessible and offer a flexible platform to build a robot hearing system. This research proposes a keyword spotting and sound source detection system that is built on ReSpeaker Core v2.0 microphone array. We use Frequency Domain Binaural Model (FDBM) to detect the Direction of Arrival (DOA) of the sound source and estimate the distance of the sound source by taking advantage of the robot mobility to perform a triangulation method. Separate model based on Long-Short Term Memory (LSTM) network is utilized to perform the keyword spotting task. We evaluate the average accuracy of the DOA and distance and Word Error Rate (WER) of the keyword spotting activity. We also compare the performance of the system in the open field non reverberant situation with an ideal simulation.\",\"PeriodicalId\":412255,\"journal\":{\"name\":\"2020 6th International Conference on Control, Automation and Robotics (ICCAR)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 6th International Conference on Control, Automation and Robotics (ICCAR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCAR49639.2020.9108086\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th International Conference on Control, Automation and Robotics (ICCAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAR49639.2020.9108086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Portable Keyword Spotting and Sound Source Detection System Design on Mobile Robot with Mini Microphone Array
The capability of locating sound source and detecting speech has been an essential feature in the robot auditory that enables it to and interact with human and perform sophisticated tasks. Recent development of mini microphone array has made robot audition much more accessible and offer a flexible platform to build a robot hearing system. This research proposes a keyword spotting and sound source detection system that is built on ReSpeaker Core v2.0 microphone array. We use Frequency Domain Binaural Model (FDBM) to detect the Direction of Arrival (DOA) of the sound source and estimate the distance of the sound source by taking advantage of the robot mobility to perform a triangulation method. Separate model based on Long-Short Term Memory (LSTM) network is utilized to perform the keyword spotting task. We evaluate the average accuracy of the DOA and distance and Word Error Rate (WER) of the keyword spotting activity. We also compare the performance of the system in the open field non reverberant situation with an ideal simulation.