{"title":"中医病例症状信息自动提取的NLP应用","authors":"Cheng Qiang, Du Zhong-min","doi":"10.1109/ICCS56273.2022.9988199","DOIUrl":null,"url":null,"abstract":"Starting from natural language processing (NLP) technology, we compare the extraction results of TFIDF and Word2vec methods to explore research ideas that are more applicable to automated extraction of symptom information of traditional Chinese medicine (TCM) medical cases, and provide reference for the development of automated analysis of TCM medical cases. On the basis of constructed medical case dictionary, TFIDF and Word2vec methods were used to extract symptoms from heart cases, and the results were compared and analyzed. In medical cases, the onset of patients was often accompanied by palpitations, chest tightness, chest pain, shortness of breath, dizziness and other symptoms, and certain associations between symptoms were also found. The results of experimental evaluation showed that accuracy and recall rates of Word2vec method extraction were higher than those of TFIDF method. Compared with TFIDF method, Word2vec method is more effective when applied to the task of automated symptom information extraction from TCM cases.","PeriodicalId":382726,"journal":{"name":"2022 IEEE 2nd International Conference on Computer Systems (ICCS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A NLP Application of Automated Symptom Information Extraction from TCM Medical Cases\",\"authors\":\"Cheng Qiang, Du Zhong-min\",\"doi\":\"10.1109/ICCS56273.2022.9988199\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Starting from natural language processing (NLP) technology, we compare the extraction results of TFIDF and Word2vec methods to explore research ideas that are more applicable to automated extraction of symptom information of traditional Chinese medicine (TCM) medical cases, and provide reference for the development of automated analysis of TCM medical cases. On the basis of constructed medical case dictionary, TFIDF and Word2vec methods were used to extract symptoms from heart cases, and the results were compared and analyzed. In medical cases, the onset of patients was often accompanied by palpitations, chest tightness, chest pain, shortness of breath, dizziness and other symptoms, and certain associations between symptoms were also found. The results of experimental evaluation showed that accuracy and recall rates of Word2vec method extraction were higher than those of TFIDF method. Compared with TFIDF method, Word2vec method is more effective when applied to the task of automated symptom information extraction from TCM cases.\",\"PeriodicalId\":382726,\"journal\":{\"name\":\"2022 IEEE 2nd International Conference on Computer Systems (ICCS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 2nd International Conference on Computer Systems (ICCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCS56273.2022.9988199\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 2nd International Conference on Computer Systems (ICCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCS56273.2022.9988199","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A NLP Application of Automated Symptom Information Extraction from TCM Medical Cases
Starting from natural language processing (NLP) technology, we compare the extraction results of TFIDF and Word2vec methods to explore research ideas that are more applicable to automated extraction of symptom information of traditional Chinese medicine (TCM) medical cases, and provide reference for the development of automated analysis of TCM medical cases. On the basis of constructed medical case dictionary, TFIDF and Word2vec methods were used to extract symptoms from heart cases, and the results were compared and analyzed. In medical cases, the onset of patients was often accompanied by palpitations, chest tightness, chest pain, shortness of breath, dizziness and other symptoms, and certain associations between symptoms were also found. The results of experimental evaluation showed that accuracy and recall rates of Word2vec method extraction were higher than those of TFIDF method. Compared with TFIDF method, Word2vec method is more effective when applied to the task of automated symptom information extraction from TCM cases.