Maria del Pilar Angeles, Adrian Espino-Gamez, Jonathan Gil-Moncada
{"title":"Comparison of a Modified Spanish Phonetic, Soundex, and Phonex coding functions during data matching process","authors":"Maria del Pilar Angeles, Adrian Espino-Gamez, Jonathan Gil-Moncada","doi":"10.1109/ICIEV.2015.7334028","DOIUrl":null,"url":null,"abstract":"The present paper is aimed to help native spanish speakers to identify an open and effective spanish encoding function during data matching process. We present the implementation and enhancement of the encoding algorithm Spanish Phonetic Soundex [1]. We have carried out an evaluation of data matching considering Spanish Phonetic Soundex, Soundex [2], [3] and Phonex [4] in terms of precision-recall and f-measure. As far as we know, such comparison against these phonetic functions has not been presented before. We suggest spanish speaker users a Modified Spanish Phonetic Soundex function, that has a better performance in terms of precision, f-measure and similarity values derived from the encoding phase than the common phonetic coding functions utilized until now.","PeriodicalId":367355,"journal":{"name":"2015 International Conference on Informatics, Electronics & Vision (ICIEV)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Informatics, Electronics & Vision (ICIEV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIEV.2015.7334028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The present paper is aimed to help native spanish speakers to identify an open and effective spanish encoding function during data matching process. We present the implementation and enhancement of the encoding algorithm Spanish Phonetic Soundex [1]. We have carried out an evaluation of data matching considering Spanish Phonetic Soundex, Soundex [2], [3] and Phonex [4] in terms of precision-recall and f-measure. As far as we know, such comparison against these phonetic functions has not been presented before. We suggest spanish speaker users a Modified Spanish Phonetic Soundex function, that has a better performance in terms of precision, f-measure and similarity values derived from the encoding phase than the common phonetic coding functions utilized until now.