An Improved Word Vector-Based Symptom Extraction Method for Traditional Chinese Medical Record Analysis

2021 11th International Conference on Information Technology in Medicine and Education (ITME) Pub Date : 2021-11-01 DOI:10.1109/ITME53901.2021.00082

Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li

{"title":"An Improved Word Vector-Based Symptom Extraction Method for Traditional Chinese Medical Record Analysis","authors":"Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li","doi":"10.1109/ITME53901.2021.00082","DOIUrl":null,"url":null,"abstract":"Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.","PeriodicalId":6774,"journal":{"name":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","volume":"17 1","pages":"379-384"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME53901.2021.00082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.

查看原文本刊更多论文

一种改进的基于词向量的病案症状提取方法

中医病案症状提取与规范在智能诊断中具有重要作用。近年来，由于词向量模型具有强大的性能，在自然语言处理任务中得到了广泛的应用。然而，单纯以词向量模型为核心进行文本分析很难同时满足时间和精度要求。为了改善这种情况，我们引入了一种改进的基于词向量的中医症状提取方法，该方法可以提取和规范中文原始医学文本中的症状。我们将该方法设计为三个部分:分词、词向量生成和术语替换。在我们的数据集上的实验结果表明，我们的方法在医学症状提取和去除冗余词方面有很好的效果。与其他基线词向量表示模型相比，我们的方法在效率和准确性方面表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 11th International Conference on Information Technology in Medicine and Education (ITME)

自引率

0.00%

发文量