MetaSoundex英语和西班牙语语音匹配

K. Koneru, C. Varol
{"title":"MetaSoundex英语和西班牙语语音匹配","authors":"K. Koneru, C. Varol","doi":"10.18311/GJEIS/2018/19822","DOIUrl":null,"url":null,"abstract":"Researchers confront major problems while searching for various kinds of data in large imprecise databases, as they are not spelled correctly or in the way they were expected to be spelled. As a result, they cannot find the word they sought. Over the years of struggle, pronunciation of words was considered as one of the practices to solve the problem effectively. The technique used to acquire words based on sounds is known as “Phonetic Matching”. Soundex was the first algorithm developed and other algorithms such as Metaphone, Caverphone, DMetaphone, Phonex etc., are also used for information retrieval in different environments. The main contribution of this paper is to analyze and implement the newly proposed MetaSoundex algorithm for fixing ill-defined data in English and Spanish languages. The newly developed MetaSoundex algorithm addresses the limitations of well-known phonetic matching techniques, Metaphone and Soundex. Specifically, the new algorithm provided results that are more accurate compared to both Soundex and Metaphone algorithms and has higher precision compared to Soundex, thus reducing the noise in the considered arena.","PeriodicalId":318809,"journal":{"name":"Global Journal of Enterprise Information System","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MetaSoundex Phonetic Matching for English and Spanish\",\"authors\":\"K. Koneru, C. Varol\",\"doi\":\"10.18311/GJEIS/2018/19822\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Researchers confront major problems while searching for various kinds of data in large imprecise databases, as they are not spelled correctly or in the way they were expected to be spelled. As a result, they cannot find the word they sought. Over the years of struggle, pronunciation of words was considered as one of the practices to solve the problem effectively. The technique used to acquire words based on sounds is known as “Phonetic Matching”. Soundex was the first algorithm developed and other algorithms such as Metaphone, Caverphone, DMetaphone, Phonex etc., are also used for information retrieval in different environments. The main contribution of this paper is to analyze and implement the newly proposed MetaSoundex algorithm for fixing ill-defined data in English and Spanish languages. The newly developed MetaSoundex algorithm addresses the limitations of well-known phonetic matching techniques, Metaphone and Soundex. Specifically, the new algorithm provided results that are more accurate compared to both Soundex and Metaphone algorithms and has higher precision compared to Soundex, thus reducing the noise in the considered arena.\",\"PeriodicalId\":318809,\"journal\":{\"name\":\"Global Journal of Enterprise Information System\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Global Journal of Enterprise Information System\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18311/GJEIS/2018/19822\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Journal of Enterprise Information System","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18311/GJEIS/2018/19822","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

研究人员在不精确的大型数据库中搜索各种数据时遇到了主要问题,因为它们的拼写不正确或不符合预期的拼写方式。结果,他们找不到他们想要的词。在多年的斗争中,单词发音被认为是有效解决这一问题的做法之一。根据声音来获取单词的技术被称为“语音匹配”。Soundex是第一个开发的算法,其他算法如Metaphone, Caverphone, DMetaphone, Phonex等也用于不同环境下的信息检索。本文的主要贡献是分析和实现了新提出的MetaSoundex算法,用于修复英语和西班牙语中定义不清的数据。新开发的MetaSoundex算法解决了众所周知的语音匹配技术Metaphone和Soundex的局限性。具体来说,新算法提供的结果比Soundex和Metaphone算法更准确,比Soundex具有更高的精度,从而减少了所考虑的竞技场中的噪声。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MetaSoundex Phonetic Matching for English and Spanish
Researchers confront major problems while searching for various kinds of data in large imprecise databases, as they are not spelled correctly or in the way they were expected to be spelled. As a result, they cannot find the word they sought. Over the years of struggle, pronunciation of words was considered as one of the practices to solve the problem effectively. The technique used to acquire words based on sounds is known as “Phonetic Matching”. Soundex was the first algorithm developed and other algorithms such as Metaphone, Caverphone, DMetaphone, Phonex etc., are also used for information retrieval in different environments. The main contribution of this paper is to analyze and implement the newly proposed MetaSoundex algorithm for fixing ill-defined data in English and Spanish languages. The newly developed MetaSoundex algorithm addresses the limitations of well-known phonetic matching techniques, Metaphone and Soundex. Specifically, the new algorithm provided results that are more accurate compared to both Soundex and Metaphone algorithms and has higher precision compared to Soundex, thus reducing the noise in the considered arena.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信