Natural language parsing of patient complaints in Indonesian language

Chanifah Indah Ratnasari, S. Kusumadewi, L. Rosita
{"title":"Natural language parsing of patient complaints in Indonesian language","authors":"Chanifah Indah Ratnasari, S. Kusumadewi, L. Rosita","doi":"10.1109/TICST.2015.7369373","DOIUrl":null,"url":null,"abstract":"In Indonesia, patient complaints are recorded in the form of free-text data or a narrative text by the doctor when taking the medical history or conducting the medical interview. This text, although recorded in electronic medical records (EMR), is difficult to process computationally because the computer does not recognize natural language. The structure of the Indonesian language differs from that of English. Moreover, the language of patient complaints is structured differently from the Indonesian language in general. It does not consist of the S-P-O-K (Subject-Predicate-Object-Adverb) structures that are used in Indonesian sentences. Moreover, there is a wide range of local languages in Indonesia. Based on data on patient complaints obtained from physicians, this study develops production rules for mapping patient complaints. The aim of the study is to develop a parsing method that automatically maps patient complaints from an unstructured text into a structured text that can be recognized by the computer. In the parsing process developed in this research, a narrative text that has been split into words and/or separated phrases/clauses is used to conduct a suitability search of the lexicon. The lexicon that exceeded the minimum suitability value (threshold) and the highest (maximum) suitability value was selected as the candidate for the lexicon. This study was conducted with consideration for the important information in the free text of patient complaints and could be used subsequently to support a wide variety of clinical decisions.","PeriodicalId":251893,"journal":{"name":"2015 International Conference on Science and Technology (TICST)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Science and Technology (TICST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TICST.2015.7369373","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In Indonesia, patient complaints are recorded in the form of free-text data or a narrative text by the doctor when taking the medical history or conducting the medical interview. This text, although recorded in electronic medical records (EMR), is difficult to process computationally because the computer does not recognize natural language. The structure of the Indonesian language differs from that of English. Moreover, the language of patient complaints is structured differently from the Indonesian language in general. It does not consist of the S-P-O-K (Subject-Predicate-Object-Adverb) structures that are used in Indonesian sentences. Moreover, there is a wide range of local languages in Indonesia. Based on data on patient complaints obtained from physicians, this study develops production rules for mapping patient complaints. The aim of the study is to develop a parsing method that automatically maps patient complaints from an unstructured text into a structured text that can be recognized by the computer. In the parsing process developed in this research, a narrative text that has been split into words and/or separated phrases/clauses is used to conduct a suitability search of the lexicon. The lexicon that exceeded the minimum suitability value (threshold) and the highest (maximum) suitability value was selected as the candidate for the lexicon. This study was conducted with consideration for the important information in the free text of patient complaints and could be used subsequently to support a wide variety of clinical decisions.
印尼语患者主诉的自然语言分析
在印度尼西亚,医生在记录病史或进行医疗面谈时,以自由文本数据或叙述文本的形式记录病人的投诉。该文本虽然记录在电子病历(EMR)中,但由于计算机不能识别自然语言,因此难以进行计算处理。印尼语的结构与英语不同。此外,患者投诉的语言结构与一般的印度尼西亚语不同。它不包括印尼句子中使用的S-P-O-K(主语-谓语-宾语-副词)结构。此外,印度尼西亚有各种各样的当地语言。基于从医生处获得的患者投诉数据,本研究开发了患者投诉映射的生成规则。这项研究的目的是开发一种解析方法,自动将患者的投诉从非结构化文本映射到计算机可以识别的结构化文本。在本研究开发的解析过程中,使用被分割成单词和/或分离的短语/分句的叙事文本来进行词汇的适用性搜索。选择超过最小适合度值(阈值)和最高适合度值(最大值)的词汇作为候选词汇。本研究的开展考虑了患者投诉自由文本中的重要信息,并可随后用于支持各种临床决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信