{"title":"语音编码在病人数据相似度分析中的应用:孟加拉视角","authors":"Abir Bin Ayub Khan, M. Ghazanfar, S. I. Khan","doi":"10.1109/R10-HTC.2017.8289046","DOIUrl":null,"url":null,"abstract":"Due to illiteracy and lack of standardized healthcare systems, patients in Bangladesh usually, provide misspelled names while making their entry via these systems. Different records with slightly misspelled names are thus generated which makes data mining and others tasks quite challenging and inefficient. In this paper, we have looked into the underlying problem of misspelled names of patients in healthcare systems and proposed a modified version of NameSignificance algorithm. Our proposed algorithm has performed significantly better than the existing solutions like NameSignificance, Modified Soundex, Double Metaphone encoding for Bangla as we founded our algorithm on the phonetic nature of Bengali names written in English transliterated form. Our algorithm achieved a staggering 77% matching of names whereas relevant algorithm could not pass 70% correct matches. This proposed method could pave the way for better record linkage and data analysis of the medical patient dataset. Our syllable based approach also helped us identify the reason behind the wrong matches appeared through our algorithm which will surely pave the way to purify our algorithm in the future.","PeriodicalId":411099,"journal":{"name":"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Application of phonetic encoding for analyzing similarity of patient's data: Bangladesh perspective\",\"authors\":\"Abir Bin Ayub Khan, M. Ghazanfar, S. I. Khan\",\"doi\":\"10.1109/R10-HTC.2017.8289046\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to illiteracy and lack of standardized healthcare systems, patients in Bangladesh usually, provide misspelled names while making their entry via these systems. Different records with slightly misspelled names are thus generated which makes data mining and others tasks quite challenging and inefficient. In this paper, we have looked into the underlying problem of misspelled names of patients in healthcare systems and proposed a modified version of NameSignificance algorithm. Our proposed algorithm has performed significantly better than the existing solutions like NameSignificance, Modified Soundex, Double Metaphone encoding for Bangla as we founded our algorithm on the phonetic nature of Bengali names written in English transliterated form. Our algorithm achieved a staggering 77% matching of names whereas relevant algorithm could not pass 70% correct matches. This proposed method could pave the way for better record linkage and data analysis of the medical patient dataset. Our syllable based approach also helped us identify the reason behind the wrong matches appeared through our algorithm which will surely pave the way to purify our algorithm in the future.\",\"PeriodicalId\":411099,\"journal\":{\"name\":\"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/R10-HTC.2017.8289046\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/R10-HTC.2017.8289046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of phonetic encoding for analyzing similarity of patient's data: Bangladesh perspective
Due to illiteracy and lack of standardized healthcare systems, patients in Bangladesh usually, provide misspelled names while making their entry via these systems. Different records with slightly misspelled names are thus generated which makes data mining and others tasks quite challenging and inefficient. In this paper, we have looked into the underlying problem of misspelled names of patients in healthcare systems and proposed a modified version of NameSignificance algorithm. Our proposed algorithm has performed significantly better than the existing solutions like NameSignificance, Modified Soundex, Double Metaphone encoding for Bangla as we founded our algorithm on the phonetic nature of Bengali names written in English transliterated form. Our algorithm achieved a staggering 77% matching of names whereas relevant algorithm could not pass 70% correct matches. This proposed method could pave the way for better record linkage and data analysis of the medical patient dataset. Our syllable based approach also helped us identify the reason behind the wrong matches appeared through our algorithm which will surely pave the way to purify our algorithm in the future.