邻近索引在孤立词识别中的应用

2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC) Pub Date : 2013-11-01 DOI:10.1109/ROPEC.2013.6702720

Jose Martin Ruiz Perez, A. Camarena-Ibarrola

{"title":"邻近索引在孤立词识别中的应用","authors":"Jose Martin Ruiz Perez, A. Camarena-Ibarrola","doi":"10.1109/ROPEC.2013.6702720","DOIUrl":null,"url":null,"abstract":"An isolated word recognition system with a small dictionary may perform sequencial search, that is, compare the query utterance with each and every single one in the dictionary. However, when dealing with dictionaries made out of thousands of words, sequencial search is no longer a valid strategy since we would end up with such a slow recognition system that would surely be of no practical use. One approach to solve this problem is the use of proximity indexes so that the most similar word can be found quickly. The question that inmediately arises is: What proximity index is the ideal one for our purpose?. In this paper we compare several proximity indexes in their ability to find the most similar word to the query utterance among those included in the dictionary using less time for that purpose. According to our experiments a permutant-based index requires less distance evaluations at searching to find the nearest neighbor so it is faster than the pivot-based indexes included in our experiments such as the Fixed Query Array (FQA), the Burkhard-Keller Tree (BKT), the Fixed Query Tree (FQT) and the Fixed Height Fixed Query Tree (FHFQT).","PeriodicalId":307120,"journal":{"name":"2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On the use of proximity indexes for isolated word recognition\",\"authors\":\"Jose Martin Ruiz Perez, A. Camarena-Ibarrola\",\"doi\":\"10.1109/ROPEC.2013.6702720\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An isolated word recognition system with a small dictionary may perform sequencial search, that is, compare the query utterance with each and every single one in the dictionary. However, when dealing with dictionaries made out of thousands of words, sequencial search is no longer a valid strategy since we would end up with such a slow recognition system that would surely be of no practical use. One approach to solve this problem is the use of proximity indexes so that the most similar word can be found quickly. The question that inmediately arises is: What proximity index is the ideal one for our purpose?. In this paper we compare several proximity indexes in their ability to find the most similar word to the query utterance among those included in the dictionary using less time for that purpose. According to our experiments a permutant-based index requires less distance evaluations at searching to find the nearest neighbor so it is faster than the pivot-based indexes included in our experiments such as the Fixed Query Array (FQA), the Burkhard-Keller Tree (BKT), the Fixed Query Tree (FQT) and the Fixed Height Fixed Query Tree (FHFQT).\",\"PeriodicalId\":307120,\"journal\":{\"name\":\"2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROPEC.2013.6702720\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROPEC.2013.6702720","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

具有小字典的孤立词识别系统可以执行顺序搜索，即将查询话语与字典中的每一个单独的话语进行比较。然而，当处理由数千个单词组成的字典时，顺序搜索不再是一个有效的策略，因为我们最终会得到这样一个缓慢的识别系统，这肯定没有实际用途。解决这个问题的一种方法是使用接近索引，这样可以快速找到最相似的单词。立即出现的问题是:对于我们的目的，什么接近指数是理想的?在本文中，我们比较了几种接近索引在字典中使用更少的时间找到与查询话语最相似的单词的能力。根据我们的实验，基于置换的索引在寻找最近邻居时需要较少的距离评估，因此它比我们实验中包含的基于pivot的索引更快，例如固定查询数组(FQA)， burk硬-凯勒树(BKT)，固定查询树(FQT)和固定高度固定查询树(FHFQT)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On the use of proximity indexes for isolated word recognition

An isolated word recognition system with a small dictionary may perform sequencial search, that is, compare the query utterance with each and every single one in the dictionary. However, when dealing with dictionaries made out of thousands of words, sequencial search is no longer a valid strategy since we would end up with such a slow recognition system that would surely be of no practical use. One approach to solve this problem is the use of proximity indexes so that the most similar word can be found quickly. The question that inmediately arises is: What proximity index is the ideal one for our purpose?. In this paper we compare several proximity indexes in their ability to find the most similar word to the query utterance among those included in the dictionary using less time for that purpose. According to our experiments a permutant-based index requires less distance evaluations at searching to find the nearest neighbor so it is faster than the pivot-based indexes included in our experiments such as the Fixed Query Array (FQA), the Burkhard-Keller Tree (BKT), the Fixed Query Tree (FQT) and the Fixed Height Fixed Query Tree (FHFQT).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)

自引率

0.00%

发文量