数据挖掘算法探索新冠肺炎患者轨迹的初步研究

Shuliang Chen, Ce Zhang, P. Ren
{"title":"数据挖掘算法探索新冠肺炎患者轨迹的初步研究","authors":"Shuliang Chen, Ce Zhang, P. Ren","doi":"10.3760/CMA.J.CN113565-20200212-00013","DOIUrl":null,"url":null,"abstract":"Objective \nTo explore data mining methods and tools for the activity paths of confirmed patients, and provide data analysis tools for epidemic control. \n \n \nMethods \nThe data used came from the trajectory data of confirmed cases collected by Tencent. The jieba word segmentation and word cloud map function of Python 3.6 were used to calculate the high-frequency vocabulary in the trajectory of confirmed patients. The epidemic prevention and control strategy was developed based on the high-frequency vocabulary. \n \n \nResults \nTaking Guangdong Province, the second most confirmed patients in the country, as an example, the key areas of epidemic control obtained through data mining involve Wuhan (epidemiological history), Zhuhai and Guangzhou. The key control activities include family visiting, traveling and shopping. Means of transportation include self-driving, trains and airplanes; the key patients studied were Li and Ding; the symptoms of this patient group were mainly fever and cough. \n \n \nConclusions \nThe data mining algorithm in this paper can provide an advantageous tool for epidemic prevention and control, also assist frontline personnel to adjust the deployment of epidemic prevention and control according to their priorities. \n \n \nKey words: \nCOVID-19; Epidemic prevention and control; Data mining; Python","PeriodicalId":59555,"journal":{"name":"中华医学科研管理杂志","volume":"33 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Preliminary study on exploring the trajectory of patients with COVID-19 by Data mining algorithms\",\"authors\":\"Shuliang Chen, Ce Zhang, P. Ren\",\"doi\":\"10.3760/CMA.J.CN113565-20200212-00013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective \\nTo explore data mining methods and tools for the activity paths of confirmed patients, and provide data analysis tools for epidemic control. \\n \\n \\nMethods \\nThe data used came from the trajectory data of confirmed cases collected by Tencent. The jieba word segmentation and word cloud map function of Python 3.6 were used to calculate the high-frequency vocabulary in the trajectory of confirmed patients. The epidemic prevention and control strategy was developed based on the high-frequency vocabulary. \\n \\n \\nResults \\nTaking Guangdong Province, the second most confirmed patients in the country, as an example, the key areas of epidemic control obtained through data mining involve Wuhan (epidemiological history), Zhuhai and Guangzhou. The key control activities include family visiting, traveling and shopping. Means of transportation include self-driving, trains and airplanes; the key patients studied were Li and Ding; the symptoms of this patient group were mainly fever and cough. \\n \\n \\nConclusions \\nThe data mining algorithm in this paper can provide an advantageous tool for epidemic prevention and control, also assist frontline personnel to adjust the deployment of epidemic prevention and control according to their priorities. \\n \\n \\nKey words: \\nCOVID-19; Epidemic prevention and control; Data mining; Python\",\"PeriodicalId\":59555,\"journal\":{\"name\":\"中华医学科研管理杂志\",\"volume\":\"33 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"中华医学科研管理杂志\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3760/CMA.J.CN113565-20200212-00013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华医学科研管理杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/CMA.J.CN113565-20200212-00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

目的探讨确诊患者活动路径的数据挖掘方法和工具,为疫情控制提供数据分析工具。方法所用数据来自腾讯收集的确诊病例轨迹数据。使用Python 3.6的揭坝分词和单词云图功能计算确诊患者轨迹中的高频词汇。疫情防控策略是根据高频词汇制定的。结果以全国确诊病例第二多的广东省为例,通过数据挖掘获得的疫情控制重点地区涉及武汉(流行病学史)、珠海和广州。关键控制活动包括探亲、旅游和购物。交通工具包括自动驾驶、火车和飞机;研究的重点患者为李和丁;该患者组症状以发热、咳嗽为主。结论本文的数据挖掘算法可以为疫情防控提供一个有利的工具,也可以帮助一线人员根据自己的优先事项调整疫情防控的部署。关键词:新冠肺炎;流行病预防和控制;数据挖掘;Python
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Preliminary study on exploring the trajectory of patients with COVID-19 by Data mining algorithms
Objective To explore data mining methods and tools for the activity paths of confirmed patients, and provide data analysis tools for epidemic control. Methods The data used came from the trajectory data of confirmed cases collected by Tencent. The jieba word segmentation and word cloud map function of Python 3.6 were used to calculate the high-frequency vocabulary in the trajectory of confirmed patients. The epidemic prevention and control strategy was developed based on the high-frequency vocabulary. Results Taking Guangdong Province, the second most confirmed patients in the country, as an example, the key areas of epidemic control obtained through data mining involve Wuhan (epidemiological history), Zhuhai and Guangzhou. The key control activities include family visiting, traveling and shopping. Means of transportation include self-driving, trains and airplanes; the key patients studied were Li and Ding; the symptoms of this patient group were mainly fever and cough. Conclusions The data mining algorithm in this paper can provide an advantageous tool for epidemic prevention and control, also assist frontline personnel to adjust the deployment of epidemic prevention and control according to their priorities. Key words: COVID-19; Epidemic prevention and control; Data mining; Python
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
3322
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信