展望回顾：在未来的分布式数据网络分析中，生成式人工智能是否会使通用数据模型过时？

IF 3.4 3区医学 Q2 PHARMACOLOGY & PHARMACY

Therapeutic Advances in Drug Safety Pub Date : 2025-04-21 eCollection Date: 2025-01-01 DOI:10.1177/20420986251332743

Jeffery L Painter, Darmendra Ramcharran, Andrew Bate

{"title":"展望回顾：在未来的分布式数据网络分析中，生成式人工智能是否会使通用数据模型过时？","authors":"Jeffery L Painter, Darmendra Ramcharran, Andrew Bate","doi":"10.1177/20420986251332743","DOIUrl":null,"url":null,"abstract":"Integrating real-world healthcare data is challenging due to diverse formats and terminologies, making standardization resource-intensive. While Common Data Models (CDMs) facilitate interoperability, they often cause information loss, exhibit semantic inconsistencies, and are labor-intensive to implement and update. We explore how generative artificial intelligence (GenAI), especially large language models (LLMs), could make CDMs obsolete in quantitative healthcare data analysis by interpreting natural language queries and generating code, enabling direct interaction with raw data. Knowledge graphs (KGs) standardize relationships and semantics across heterogeneous data, preserving integrity. This perspective review proposes a fourth generation of distributed data network analysis, building on previous generations categorized by their approach to data standardization and utilization. It emphasizes the potential of GenAI to overcome the limitations CDMs with GenAI-enabled access, KGs, and automatic code generation. A data commons may further enhance this capability, and KGs may well be needed to enable effective GenAI. Addressing privacy, security, and governance is critical; any new method must ensure protections comparable to CDM-based models. Our approach would aim to enable efficient, real-time analyses across diverse datasets and enhance patient safety. We recommend prioritizing research to assess how GenAI can transform quantitative healthcare data analysis by overcoming current limitations.","PeriodicalId":23012,"journal":{"name":"Therapeutic Advances in Drug Safety","volume":"16 ","pages":"20420986251332743"},"PeriodicalIF":3.4000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033412/pdf/","citationCount":"0","resultStr":"{\"title\":\"Perspective review: Will generative AI make common data models obsolete in future analyses of distributed data networks?\",\"authors\":\"Jeffery L Painter, Darmendra Ramcharran, Andrew Bate\",\"doi\":\"10.1177/20420986251332743\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Integrating real-world healthcare data is challenging due to diverse formats and terminologies, making standardization resource-intensive. While Common Data Models (CDMs) facilitate interoperability, they often cause information loss, exhibit semantic inconsistencies, and are labor-intensive to implement and update. We explore how generative artificial intelligence (GenAI), especially large language models (LLMs), could make CDMs obsolete in quantitative healthcare data analysis by interpreting natural language queries and generating code, enabling direct interaction with raw data. Knowledge graphs (KGs) standardize relationships and semantics across heterogeneous data, preserving integrity. This perspective review proposes a fourth generation of distributed data network analysis, building on previous generations categorized by their approach to data standardization and utilization. It emphasizes the potential of GenAI to overcome the limitations CDMs with GenAI-enabled access, KGs, and automatic code generation. A data commons may further enhance this capability, and KGs may well be needed to enable effective GenAI. Addressing privacy, security, and governance is critical; any new method must ensure protections comparable to CDM-based models. Our approach would aim to enable efficient, real-time analyses across diverse datasets and enhance patient safety. We recommend prioritizing research to assess how GenAI can transform quantitative healthcare data analysis by overcoming current limitations.\",\"PeriodicalId\":23012,\"journal\":{\"name\":\"Therapeutic Advances in Drug Safety\",\"volume\":\"16 \",\"pages\":\"20420986251332743\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033412/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Therapeutic Advances in Drug Safety\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/20420986251332743\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Therapeutic Advances in Drug Safety","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/20420986251332743","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 0

摘要

由于格式和术语的多样性，集成现实世界的医疗保健数据具有挑战性，这使得标准化需要大量资源。虽然公共数据模型（cdm）促进了互操作性，但它们通常会导致信息丢失，表现出语义不一致，并且在实现和更新方面需要大量的劳动。我们探讨了生成式人工智能（GenAI），特别是大型语言模型（llm）如何通过解释自然语言查询和生成代码，使cdm在定量医疗数据分析中过时，从而实现与原始数据的直接交互。知识图（KGs）标准化了异构数据之间的关系和语义，保持了完整性。这一观点提出了第四代分布式数据网络分析，建立在前几代的基础上，根据他们的数据标准化和利用方法进行分类。它强调了GenAI的潜力，通过支持GenAI的访问、KGs和自动代码生成来克服cdm的局限性。数据共享可能进一步增强这种能力，并且很可能需要kg来实现有效的GenAI。解决隐私、安全和治理问题至关重要；任何新方法都必须确保与基于cdm的模型相媲美的保护。我们的方法旨在实现跨不同数据集的高效、实时分析，并提高患者的安全性。我们建议优先研究，以评估GenAI如何通过克服当前的局限性来改变定量医疗保健数据分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Perspective review: Will generative AI make common data models obsolete in future analyses of distributed data networks?

查看原文本刊更多论文

Perspective review: Will generative AI make common data models obsolete in future analyses of distributed data networks?

Integrating real-world healthcare data is challenging due to diverse formats and terminologies, making standardization resource-intensive. While Common Data Models (CDMs) facilitate interoperability, they often cause information loss, exhibit semantic inconsistencies, and are labor-intensive to implement and update. We explore how generative artificial intelligence (GenAI), especially large language models (LLMs), could make CDMs obsolete in quantitative healthcare data analysis by interpreting natural language queries and generating code, enabling direct interaction with raw data. Knowledge graphs (KGs) standardize relationships and semantics across heterogeneous data, preserving integrity. This perspective review proposes a fourth generation of distributed data network analysis, building on previous generations categorized by their approach to data standardization and utilization. It emphasizes the potential of GenAI to overcome the limitations CDMs with GenAI-enabled access, KGs, and automatic code generation. A data commons may further enhance this capability, and KGs may well be needed to enable effective GenAI. Addressing privacy, security, and governance is critical; any new method must ensure protections comparable to CDM-based models. Our approach would aim to enable efficient, real-time analyses across diverse datasets and enhance patient safety. We recommend prioritizing research to assess how GenAI can transform quantitative healthcare data analysis by overcoming current limitations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Therapeutic Advances in Drug Safety Medicine-Pharmacology (medical)

CiteScore

6.70

自引率

4.50%

发文量

审稿时长

9 weeks

期刊介绍： Therapeutic Advances in Drug Safety delivers the highest quality peer-reviewed articles, reviews, and scholarly comment on pioneering efforts and innovative studies pertaining to the safe use of drugs in patients. The journal has a strong clinical and pharmacological focus and is aimed at clinicians and researchers in drug safety, providing a forum in print and online for publishing the highest quality articles in this area. The editors welcome articles of current interest on research across all areas of drug safety, including therapeutic drug monitoring, pharmacoepidemiology, adverse drug reactions, drug interactions, pharmacokinetics, pharmacovigilance, medication/prescribing errors, risk management, ethics and regulation.