医疗数据二次使用中的数据探索

2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW) Pub Date : 2011-11-12 DOI:10.1109/BIBM.2011.129

Jian Wang

{"title":"医疗数据二次使用中的数据探索","authors":"Jian Wang","doi":"10.1109/BIBM.2011.129","DOIUrl":null,"url":null,"abstract":"Real world data sets (as opposed to data from randomized, controlled clinical trials) are becoming increasing available from the healthcare industry. Large databases from EMRs/EHRs, insurance claims, pharmacy records, disease registries etc present unique challenges when they are utilized to support pharmaceutical R&D activities. Such \"secondary use\" of healthcare data usually starts with an exploratory phase when the researcher takes a high-level view of the available data and starts to \"connect the dots\". Data exploration is a highly dynamic process: exploratory paths change frequently, sometimes converging, other times diverging, and often resulting in dead ends. Only a small subset of exploratory results end up being formally analyzed to derive quantitative insights. Because of this dynamic nature of data exploration, it is critical that researchers who generate hypotheses, the domain experts, can directly explore in the available data space. Data exploration on large healthcare data sets is often a bottleneck because these data sets tend to be poorly understood in terms of their quality, completeness, consistency, etc. We will discuss this emerging landscape, focusing on case studies to illustrate the powerful convergence of real-world data and technological advancements to help leverage this data.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"43 1","pages":"658-658"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data Exploration in Secondary Use of Healthcare Data\",\"authors\":\"Jian Wang\",\"doi\":\"10.1109/BIBM.2011.129\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real world data sets (as opposed to data from randomized, controlled clinical trials) are becoming increasing available from the healthcare industry. Large databases from EMRs/EHRs, insurance claims, pharmacy records, disease registries etc present unique challenges when they are utilized to support pharmaceutical R&D activities. Such \\\"secondary use\\\" of healthcare data usually starts with an exploratory phase when the researcher takes a high-level view of the available data and starts to \\\"connect the dots\\\". Data exploration is a highly dynamic process: exploratory paths change frequently, sometimes converging, other times diverging, and often resulting in dead ends. Only a small subset of exploratory results end up being formally analyzed to derive quantitative insights. Because of this dynamic nature of data exploration, it is critical that researchers who generate hypotheses, the domain experts, can directly explore in the available data space. Data exploration on large healthcare data sets is often a bottleneck because these data sets tend to be poorly understood in terms of their quality, completeness, consistency, etc. We will discuss this emerging landscape, focusing on case studies to illustrate the powerful convergence of real-world data and technological advancements to help leverage this data.\",\"PeriodicalId\":6345,\"journal\":{\"name\":\"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)\",\"volume\":\"43 1\",\"pages\":\"658-658\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBM.2011.129\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2011.129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

现实世界的数据集(相对于随机对照临床试验的数据)越来越多地来自医疗保健行业。来自电子病历/电子病历、保险索赔、药房记录、疾病登记等的大型数据库在用于支持药物研发活动时面临着独特的挑战。医疗保健数据的这种“二次使用”通常始于探索阶段，即研究人员对可用数据进行高级视图并开始“连接点”。数据探索是一个高度动态的过程:探索路径经常变化，有时收敛，有时发散，并且经常导致死胡同。只有一小部分探索性结果最终被正式分析，以获得定量的见解。由于数据探索的这种动态性质，产生假设的研究人员，即领域专家，可以直接在可用的数据空间中进行探索，这一点至关重要。对大型医疗保健数据集的数据探索通常是一个瓶颈，因为这些数据集在质量、完整性、一致性等方面往往难以理解。我们将讨论这一新兴领域，重点关注案例研究，以说明现实世界数据和技术进步的强大融合，以帮助利用这些数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Data Exploration in Secondary Use of Healthcare Data

Real world data sets (as opposed to data from randomized, controlled clinical trials) are becoming increasing available from the healthcare industry. Large databases from EMRs/EHRs, insurance claims, pharmacy records, disease registries etc present unique challenges when they are utilized to support pharmaceutical R&D activities. Such "secondary use" of healthcare data usually starts with an exploratory phase when the researcher takes a high-level view of the available data and starts to "connect the dots". Data exploration is a highly dynamic process: exploratory paths change frequently, sometimes converging, other times diverging, and often resulting in dead ends. Only a small subset of exploratory results end up being formally analyzed to derive quantitative insights. Because of this dynamic nature of data exploration, it is critical that researchers who generate hypotheses, the domain experts, can directly explore in the available data space. Data exploration on large healthcare data sets is often a bottleneck because these data sets tend to be poorly understood in terms of their quality, completeness, consistency, etc. We will discuss this emerging landscape, focusing on case studies to illustrate the powerful convergence of real-world data and technological advancements to help leverage this data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)

自引率

0.00%

发文量