Yi-Wen Liu, Hang-Ming Liang, Shung-You Lao, Che-Wei Wu, Hung-Kuang Hao, Fan-Jie Kung, Yu-Tse Ho, Pei-Yi Lee, S. Kang
{"title":"Developing “voice care”: Real-time methods for event recognition and localization based on acoustic cues","authors":"Yi-Wen Liu, Hang-Ming Liang, Shung-You Lao, Che-Wei Wu, Hung-Kuang Hao, Fan-Jie Kung, Yu-Tse Ho, Pei-Yi Lee, S. Kang","doi":"10.1109/ICMEW.2014.6890676","DOIUrl":null,"url":null,"abstract":"This paper presents methods for sound recognition in a living space and ways to track the location of the sound sources. Algorithms were developed so sound recognition and localization can both be performed in real time. The sound recognition method is based on Gaussian mixture modeling with outlier rejection. The sound source localization method is based on multiple signal classification (MUSIC) and it borrows the idea of particle filtering to confine the estimation error. Estimates of the sound source location can be successively refined by Kalman filtering. The recognition method was tested with real recordings and achieved > 90% of accuracy in distinguishing 8 classes of sounds while keeping both the false-acceptance and the false-rejection rates below 20%. The localization method was tested in real time and demonstrated the capabilities to track a sound source moving at about 0.3 m/s. These results indicate that the methods, when integrated, can be deployed to the home for acoustic event detection purposes.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW.2014.6890676","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This paper presents methods for sound recognition in a living space and ways to track the location of the sound sources. Algorithms were developed so sound recognition and localization can both be performed in real time. The sound recognition method is based on Gaussian mixture modeling with outlier rejection. The sound source localization method is based on multiple signal classification (MUSIC) and it borrows the idea of particle filtering to confine the estimation error. Estimates of the sound source location can be successively refined by Kalman filtering. The recognition method was tested with real recordings and achieved > 90% of accuracy in distinguishing 8 classes of sounds while keeping both the false-acceptance and the false-rejection rates below 20%. The localization method was tested in real time and demonstrated the capabilities to track a sound source moving at about 0.3 m/s. These results indicate that the methods, when integrated, can be deployed to the home for acoustic event detection purposes.