{"title":"Vietnamese Electronic Medical Record Management with Text Preprocessing for Spelling Errors","authors":"Khang Tran, Anh Nguyen, C. Vo, P. Nguyen","doi":"10.1109/NICS56915.2022.10013386","DOIUrl":null,"url":null,"abstract":"With an important development of information technology, electronic medical records (EMRs) play a crucial role in medicine and its related fields such as pharmacology, bioinformatics, and healthcare research. An EMR management system is thus significant so that EMRs can be manipulated conveniently by their key users, who are doctors, nurses, and patients, and further exploited by other researchers. Nonetheless, there is no consideration on the EMRs when they are committed to the system. Their clinical texts are especially left behind upon their entry. Such a circumstance is understandable because they are created in a high-pressure working environment and normally late examined as needed in other tasks. In order to fill this gap, we propose an EMR management system with text preprocessing for spelling error correction on clinical texts. Our system is dedicated to Vietnamese EMRs with a practical demonstration confirmed by doctors and medical students. In addition, the spelling error correction method is defined to cover a wide diversity of spelling errors in a novel hybrid manner, by effectively combining a rule-based approach, dictionaries, n-grams, and a pre-trained monolingual sequence-to-sequence model. Indeed, the experimental results show that the method has better performance on real EMRs as compared to one of the most recent existing ones. As a result, it is promising that our work can enhance and make Vietnamese EMRs more utilizable.","PeriodicalId":381028,"journal":{"name":"2022 9th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS56915.2022.10013386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With an important development of information technology, electronic medical records (EMRs) play a crucial role in medicine and its related fields such as pharmacology, bioinformatics, and healthcare research. An EMR management system is thus significant so that EMRs can be manipulated conveniently by their key users, who are doctors, nurses, and patients, and further exploited by other researchers. Nonetheless, there is no consideration on the EMRs when they are committed to the system. Their clinical texts are especially left behind upon their entry. Such a circumstance is understandable because they are created in a high-pressure working environment and normally late examined as needed in other tasks. In order to fill this gap, we propose an EMR management system with text preprocessing for spelling error correction on clinical texts. Our system is dedicated to Vietnamese EMRs with a practical demonstration confirmed by doctors and medical students. In addition, the spelling error correction method is defined to cover a wide diversity of spelling errors in a novel hybrid manner, by effectively combining a rule-based approach, dictionaries, n-grams, and a pre-trained monolingual sequence-to-sequence model. Indeed, the experimental results show that the method has better performance on real EMRs as compared to one of the most recent existing ones. As a result, it is promising that our work can enhance and make Vietnamese EMRs more utilizable.