{"title":"Distance metric learning for posteriorgram based keyword search","authors":"Batuhan Gündogdu, M. Saraçlar","doi":"10.1109/ICASSP.2017.7953240","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a neural network based distance metric learning method for a better discrimination in the sequence-matching based keyword search (KWS). In this technique, we conduct a version of Dynamic Time Warping (DTW) based similarity search on the speaker independent posteriorgram space. With this, we aim to compensate for the scarcity of the resources and overcome the out-of-vocabulary (OOV) term problem, which is one of the main issues for KWS on low-resource languages. This distance measure is then used in the DTW-based similarity search, as an alternative and in comparison to the widely and generally used distance metrics. The experiments ran on IARPA Babel Program's Turkish search data show that, the proposed system outperforms the baseline by 6.3% and when combined with the baseline system, the improvement reaches 44.9%.","PeriodicalId":118243,"journal":{"name":"2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"175 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.2017.7953240","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
In this paper, we propose a neural network based distance metric learning method for a better discrimination in the sequence-matching based keyword search (KWS). In this technique, we conduct a version of Dynamic Time Warping (DTW) based similarity search on the speaker independent posteriorgram space. With this, we aim to compensate for the scarcity of the resources and overcome the out-of-vocabulary (OOV) term problem, which is one of the main issues for KWS on low-resource languages. This distance measure is then used in the DTW-based similarity search, as an alternative and in comparison to the widely and generally used distance metrics. The experiments ran on IARPA Babel Program's Turkish search data show that, the proposed system outperforms the baseline by 6.3% and when combined with the baseline system, the improvement reaches 44.9%.