A NLP Application of Automated Symptom Information Extraction from TCM Medical Cases

Cheng Qiang, Du Zhong-min
{"title":"A NLP Application of Automated Symptom Information Extraction from TCM Medical Cases","authors":"Cheng Qiang, Du Zhong-min","doi":"10.1109/ICCS56273.2022.9988199","DOIUrl":null,"url":null,"abstract":"Starting from natural language processing (NLP) technology, we compare the extraction results of TFIDF and Word2vec methods to explore research ideas that are more applicable to automated extraction of symptom information of traditional Chinese medicine (TCM) medical cases, and provide reference for the development of automated analysis of TCM medical cases. On the basis of constructed medical case dictionary, TFIDF and Word2vec methods were used to extract symptoms from heart cases, and the results were compared and analyzed. In medical cases, the onset of patients was often accompanied by palpitations, chest tightness, chest pain, shortness of breath, dizziness and other symptoms, and certain associations between symptoms were also found. The results of experimental evaluation showed that accuracy and recall rates of Word2vec method extraction were higher than those of TFIDF method. Compared with TFIDF method, Word2vec method is more effective when applied to the task of automated symptom information extraction from TCM cases.","PeriodicalId":382726,"journal":{"name":"2022 IEEE 2nd International Conference on Computer Systems (ICCS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 2nd International Conference on Computer Systems (ICCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCS56273.2022.9988199","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Starting from natural language processing (NLP) technology, we compare the extraction results of TFIDF and Word2vec methods to explore research ideas that are more applicable to automated extraction of symptom information of traditional Chinese medicine (TCM) medical cases, and provide reference for the development of automated analysis of TCM medical cases. On the basis of constructed medical case dictionary, TFIDF and Word2vec methods were used to extract symptoms from heart cases, and the results were compared and analyzed. In medical cases, the onset of patients was often accompanied by palpitations, chest tightness, chest pain, shortness of breath, dizziness and other symptoms, and certain associations between symptoms were also found. The results of experimental evaluation showed that accuracy and recall rates of Word2vec method extraction were higher than those of TFIDF method. Compared with TFIDF method, Word2vec method is more effective when applied to the task of automated symptom information extraction from TCM cases.
中医病例症状信息自动提取的NLP应用
我们从自然语言处理(NLP)技术出发,比较TFIDF和Word2vec方法的提取结果,探索更适用于中医病例症状信息自动提取的研究思路,为中医病例自动分析的发展提供参考。在构建病例词典的基础上,采用TFIDF和Word2vec方法对心脏病例进行症状提取,并对结果进行对比分析。在医疗病例中,患者发病时往往伴有心悸、胸闷、胸痛、呼吸短促、头晕和其他症状,并且还发现了症状之间的某些关联。实验评价结果表明,Word2vec方法提取的准确率和召回率均高于TFIDF方法。与TFIDF方法相比,Word2vec方法在中医病例症状信息自动提取任务中更为有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信