邻近索引在孤立词识别中的应用

Jose Martin Ruiz Perez, A. Camarena-Ibarrola
{"title":"邻近索引在孤立词识别中的应用","authors":"Jose Martin Ruiz Perez, A. Camarena-Ibarrola","doi":"10.1109/ROPEC.2013.6702720","DOIUrl":null,"url":null,"abstract":"An isolated word recognition system with a small dictionary may perform sequencial search, that is, compare the query utterance with each and every single one in the dictionary. However, when dealing with dictionaries made out of thousands of words, sequencial search is no longer a valid strategy since we would end up with such a slow recognition system that would surely be of no practical use. One approach to solve this problem is the use of proximity indexes so that the most similar word can be found quickly. The question that inmediately arises is: What proximity index is the ideal one for our purpose?. In this paper we compare several proximity indexes in their ability to find the most similar word to the query utterance among those included in the dictionary using less time for that purpose. According to our experiments a permutant-based index requires less distance evaluations at searching to find the nearest neighbor so it is faster than the pivot-based indexes included in our experiments such as the Fixed Query Array (FQA), the Burkhard-Keller Tree (BKT), the Fixed Query Tree (FQT) and the Fixed Height Fixed Query Tree (FHFQT).","PeriodicalId":307120,"journal":{"name":"2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On the use of proximity indexes for isolated word recognition\",\"authors\":\"Jose Martin Ruiz Perez, A. Camarena-Ibarrola\",\"doi\":\"10.1109/ROPEC.2013.6702720\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An isolated word recognition system with a small dictionary may perform sequencial search, that is, compare the query utterance with each and every single one in the dictionary. However, when dealing with dictionaries made out of thousands of words, sequencial search is no longer a valid strategy since we would end up with such a slow recognition system that would surely be of no practical use. One approach to solve this problem is the use of proximity indexes so that the most similar word can be found quickly. The question that inmediately arises is: What proximity index is the ideal one for our purpose?. In this paper we compare several proximity indexes in their ability to find the most similar word to the query utterance among those included in the dictionary using less time for that purpose. According to our experiments a permutant-based index requires less distance evaluations at searching to find the nearest neighbor so it is faster than the pivot-based indexes included in our experiments such as the Fixed Query Array (FQA), the Burkhard-Keller Tree (BKT), the Fixed Query Tree (FQT) and the Fixed Height Fixed Query Tree (FHFQT).\",\"PeriodicalId\":307120,\"journal\":{\"name\":\"2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROPEC.2013.6702720\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Autumn Meeting on Power Electronics and Computing (ROPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROPEC.2013.6702720","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

具有小字典的孤立词识别系统可以执行顺序搜索,即将查询话语与字典中的每一个单独的话语进行比较。然而,当处理由数千个单词组成的字典时,顺序搜索不再是一个有效的策略,因为我们最终会得到这样一个缓慢的识别系统,这肯定没有实际用途。解决这个问题的一种方法是使用接近索引,这样可以快速找到最相似的单词。立即出现的问题是:对于我们的目的,什么接近指数是理想的?在本文中,我们比较了几种接近索引在字典中使用更少的时间找到与查询话语最相似的单词的能力。根据我们的实验,基于置换的索引在寻找最近邻居时需要较少的距离评估,因此它比我们实验中包含的基于pivot的索引更快,例如固定查询数组(FQA), burk硬-凯勒树(BKT),固定查询树(FQT)和固定高度固定查询树(FHFQT)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
On the use of proximity indexes for isolated word recognition
An isolated word recognition system with a small dictionary may perform sequencial search, that is, compare the query utterance with each and every single one in the dictionary. However, when dealing with dictionaries made out of thousands of words, sequencial search is no longer a valid strategy since we would end up with such a slow recognition system that would surely be of no practical use. One approach to solve this problem is the use of proximity indexes so that the most similar word can be found quickly. The question that inmediately arises is: What proximity index is the ideal one for our purpose?. In this paper we compare several proximity indexes in their ability to find the most similar word to the query utterance among those included in the dictionary using less time for that purpose. According to our experiments a permutant-based index requires less distance evaluations at searching to find the nearest neighbor so it is faster than the pivot-based indexes included in our experiments such as the Fixed Query Array (FQA), the Burkhard-Keller Tree (BKT), the Fixed Query Tree (FQT) and the Fixed Height Fixed Query Tree (FHFQT).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信