{"title":"Similarity analysis of patients' data: Bangladesh perspective","authors":"S. I. Khan, A. S. M. L. Hoque","doi":"10.1109/MEDITEC.2016.7835390","DOIUrl":null,"url":null,"abstract":"Misspelling of names is a major problem of real world datasets and a single person is identified differently as its consequence. In Bangladesh, it is common that many people, in real, do not know their full name and many of Bangladeshi citizens are unable to pronounce their name correctly, even in the mother tongue. The Same person provides a different version of their name during taking a public service e.g., treatment in hospital. In almost all healthcare centers, a patient is asked and he reports his demographic data i.e. name, age, etc. orally. This creates ambiguity with misspelled names. In this paper, we have provided an algorithm to identify the same person correctly from the variation of names. Experimental results show that our proposed technique can successfully link records with high accuracy for noisy data like misspelled patient names etc.","PeriodicalId":325916,"journal":{"name":"2016 International Conference on Medical Engineering, Health Informatics and Technology (MediTec)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Medical Engineering, Health Informatics and Technology (MediTec)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MEDITEC.2016.7835390","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Misspelling of names is a major problem of real world datasets and a single person is identified differently as its consequence. In Bangladesh, it is common that many people, in real, do not know their full name and many of Bangladeshi citizens are unable to pronounce their name correctly, even in the mother tongue. The Same person provides a different version of their name during taking a public service e.g., treatment in hospital. In almost all healthcare centers, a patient is asked and he reports his demographic data i.e. name, age, etc. orally. This creates ambiguity with misspelled names. In this paper, we have provided an algorithm to identify the same person correctly from the variation of names. Experimental results show that our proposed technique can successfully link records with high accuracy for noisy data like misspelled patient names etc.