An Improved Word Vector-Based Symptom Extraction Method for Traditional Chinese Medical Record Analysis

Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li
{"title":"An Improved Word Vector-Based Symptom Extraction Method for Traditional Chinese Medical Record Analysis","authors":"Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li","doi":"10.1109/ITME53901.2021.00082","DOIUrl":null,"url":null,"abstract":"Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.","PeriodicalId":6774,"journal":{"name":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","volume":"17 1","pages":"379-384"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME53901.2021.00082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.
一种改进的基于词向量的病案症状提取方法
中医病案症状提取与规范在智能诊断中具有重要作用。近年来,由于词向量模型具有强大的性能,在自然语言处理任务中得到了广泛的应用。然而,单纯以词向量模型为核心进行文本分析很难同时满足时间和精度要求。为了改善这种情况,我们引入了一种改进的基于词向量的中医症状提取方法,该方法可以提取和规范中文原始医学文本中的症状。我们将该方法设计为三个部分:分词、词向量生成和术语替换。在我们的数据集上的实验结果表明,我们的方法在医学症状提取和去除冗余词方面有很好的效果。与其他基线词向量表示模型相比,我们的方法在效率和准确性方面表现良好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信