{"title":"数据挖掘算法探索新冠肺炎患者轨迹的初步研究","authors":"Shuliang Chen, Ce Zhang, P. Ren","doi":"10.3760/CMA.J.CN113565-20200212-00013","DOIUrl":null,"url":null,"abstract":"Objective \nTo explore data mining methods and tools for the activity paths of confirmed patients, and provide data analysis tools for epidemic control. \n \n \nMethods \nThe data used came from the trajectory data of confirmed cases collected by Tencent. The jieba word segmentation and word cloud map function of Python 3.6 were used to calculate the high-frequency vocabulary in the trajectory of confirmed patients. The epidemic prevention and control strategy was developed based on the high-frequency vocabulary. \n \n \nResults \nTaking Guangdong Province, the second most confirmed patients in the country, as an example, the key areas of epidemic control obtained through data mining involve Wuhan (epidemiological history), Zhuhai and Guangzhou. The key control activities include family visiting, traveling and shopping. Means of transportation include self-driving, trains and airplanes; the key patients studied were Li and Ding; the symptoms of this patient group were mainly fever and cough. \n \n \nConclusions \nThe data mining algorithm in this paper can provide an advantageous tool for epidemic prevention and control, also assist frontline personnel to adjust the deployment of epidemic prevention and control according to their priorities. \n \n \nKey words: \nCOVID-19; Epidemic prevention and control; Data mining; Python","PeriodicalId":59555,"journal":{"name":"中华医学科研管理杂志","volume":"33 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Preliminary study on exploring the trajectory of patients with COVID-19 by Data mining algorithms\",\"authors\":\"Shuliang Chen, Ce Zhang, P. Ren\",\"doi\":\"10.3760/CMA.J.CN113565-20200212-00013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective \\nTo explore data mining methods and tools for the activity paths of confirmed patients, and provide data analysis tools for epidemic control. \\n \\n \\nMethods \\nThe data used came from the trajectory data of confirmed cases collected by Tencent. The jieba word segmentation and word cloud map function of Python 3.6 were used to calculate the high-frequency vocabulary in the trajectory of confirmed patients. The epidemic prevention and control strategy was developed based on the high-frequency vocabulary. \\n \\n \\nResults \\nTaking Guangdong Province, the second most confirmed patients in the country, as an example, the key areas of epidemic control obtained through data mining involve Wuhan (epidemiological history), Zhuhai and Guangzhou. The key control activities include family visiting, traveling and shopping. Means of transportation include self-driving, trains and airplanes; the key patients studied were Li and Ding; the symptoms of this patient group were mainly fever and cough. \\n \\n \\nConclusions \\nThe data mining algorithm in this paper can provide an advantageous tool for epidemic prevention and control, also assist frontline personnel to adjust the deployment of epidemic prevention and control according to their priorities. \\n \\n \\nKey words: \\nCOVID-19; Epidemic prevention and control; Data mining; Python\",\"PeriodicalId\":59555,\"journal\":{\"name\":\"中华医学科研管理杂志\",\"volume\":\"33 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"中华医学科研管理杂志\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3760/CMA.J.CN113565-20200212-00013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华医学科研管理杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/CMA.J.CN113565-20200212-00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Preliminary study on exploring the trajectory of patients with COVID-19 by Data mining algorithms
Objective
To explore data mining methods and tools for the activity paths of confirmed patients, and provide data analysis tools for epidemic control.
Methods
The data used came from the trajectory data of confirmed cases collected by Tencent. The jieba word segmentation and word cloud map function of Python 3.6 were used to calculate the high-frequency vocabulary in the trajectory of confirmed patients. The epidemic prevention and control strategy was developed based on the high-frequency vocabulary.
Results
Taking Guangdong Province, the second most confirmed patients in the country, as an example, the key areas of epidemic control obtained through data mining involve Wuhan (epidemiological history), Zhuhai and Guangzhou. The key control activities include family visiting, traveling and shopping. Means of transportation include self-driving, trains and airplanes; the key patients studied were Li and Ding; the symptoms of this patient group were mainly fever and cough.
Conclusions
The data mining algorithm in this paper can provide an advantageous tool for epidemic prevention and control, also assist frontline personnel to adjust the deployment of epidemic prevention and control according to their priorities.
Key words:
COVID-19; Epidemic prevention and control; Data mining; Python