Debby D. Wang, S. Ng, Siti Nabilah Binte Abdul, Sravan Ramachandran, Srinath Sridharan, Xin Quan Tan
{"title":"Imputation of Missing Diagnosis of Diabetes in an Administrative EMR System","authors":"Debby D. Wang, S. Ng, Siti Nabilah Binte Abdul, Sravan Ramachandran, Srinath Sridharan, Xin Quan Tan","doi":"10.1109/BMEICON.2018.8609956","DOIUrl":null,"url":null,"abstract":"Administrative electronic medical records (EMRs) contain rich patient data and are an important data source for health informatics studies. Prevalent in such EMRs, poor/missing diagnosis coding is intractable while can be mitigated by imputation techniques. In this work, based on an administrative EMR database in Singapore, we adopted popular machine learning methods to model the relations between diseases and healthcare utilization features, and used the model to impute missing diagnosis of diabetes. Further, this was partially validated with supplementary clinical data. The structured method in this work can be easily extended to other diseases and would benefit other works in health services and research.","PeriodicalId":232271,"journal":{"name":"2018 11th Biomedical Engineering International Conference (BMEiCON)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th Biomedical Engineering International Conference (BMEiCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMEICON.2018.8609956","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Administrative electronic medical records (EMRs) contain rich patient data and are an important data source for health informatics studies. Prevalent in such EMRs, poor/missing diagnosis coding is intractable while can be mitigated by imputation techniques. In this work, based on an administrative EMR database in Singapore, we adopted popular machine learning methods to model the relations between diseases and healthcare utilization features, and used the model to impute missing diagnosis of diabetes. Further, this was partially validated with supplementary clinical data. The structured method in this work can be easily extended to other diseases and would benefit other works in health services and research.