Yih Chung Tham, Jocelyn Hui Lin Goh, Paul Nderitu, Yukun Zhou, An Ran Ran, Sahana Srinivasan, Gabriel Dawei Yang, Gatera Fiston Kitema, Polly Rawlinson, Hongyang Jiang, Ke Zou, Carol Y. Cheung, Pearse A. Keane
{"title":"Building the world’s first truly global medical foundation model","authors":"Yih Chung Tham, Jocelyn Hui Lin Goh, Paul Nderitu, Yukun Zhou, An Ran Ran, Sahana Srinivasan, Gabriel Dawei Yang, Gatera Fiston Kitema, Polly Rawlinson, Hongyang Jiang, Ke Zou, Carol Y. Cheung, Pearse A. Keane","doi":"10.1038/s41591-025-03859-5","DOIUrl":null,"url":null,"abstract":"<p>Since 2022, the field of medical artificial intelligence (AI) has begun a shift toward foundation models, machine learning systems that are trained on broad data at scale and are adaptable to a wide range of downstream tasks<sup>1,2</sup>. Medical foundation models are rapidly evolving, driven by the synergy of expanding medical data repositories, advances in neural network architecture (especially transformers), self-supervised learning approaches and computing power. Medical foundation models are capable of performing, or can be adapted to perform, a range of medical tasks with a minimal amount of annotated data. To date, several promising breakthroughs with foundation models have been demonstrated across diverse medical domains, including pathology, radiology and ophthalmology<sup>3,4,5</sup>.</p><p>Nevertheless, training robust medical foundation models requires large, diverse and clinically useful representative data<sup>6</sup>. Assembling such datasets remains a major challenge for the research community because of strict data-sharing regulations intended to protect patient privacy and ensure ethical compliance. For these reasons, most existing foundational models are trained on datasets that are geographically and demographically ‘narrow’ (that is, not globally representative), limiting their generalizability and effectiveness, particularly in under-represented regions and populations<sup>6,7,8</sup>.</p>","PeriodicalId":19037,"journal":{"name":"Nature Medicine","volume":"52 1","pages":""},"PeriodicalIF":50.0000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41591-025-03859-5","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Since 2022, the field of medical artificial intelligence (AI) has begun a shift toward foundation models, machine learning systems that are trained on broad data at scale and are adaptable to a wide range of downstream tasks1,2. Medical foundation models are rapidly evolving, driven by the synergy of expanding medical data repositories, advances in neural network architecture (especially transformers), self-supervised learning approaches and computing power. Medical foundation models are capable of performing, or can be adapted to perform, a range of medical tasks with a minimal amount of annotated data. To date, several promising breakthroughs with foundation models have been demonstrated across diverse medical domains, including pathology, radiology and ophthalmology3,4,5.
Nevertheless, training robust medical foundation models requires large, diverse and clinically useful representative data6. Assembling such datasets remains a major challenge for the research community because of strict data-sharing regulations intended to protect patient privacy and ensure ethical compliance. For these reasons, most existing foundational models are trained on datasets that are geographically and demographically ‘narrow’ (that is, not globally representative), limiting their generalizability and effectiveness, particularly in under-represented regions and populations6,7,8.
期刊介绍:
Nature Medicine is a monthly journal publishing original peer-reviewed research in all areas of medicine. The publication focuses on originality, timeliness, interdisciplinary interest, and the impact on improving human health. In addition to research articles, Nature Medicine also publishes commissioned content such as News, Reviews, and Perspectives. This content aims to provide context for the latest advances in translational and clinical research, reaching a wide audience of M.D. and Ph.D. readers. All editorial decisions for the journal are made by a team of full-time professional editors.
Nature Medicine consider all types of clinical research, including:
-Case-reports and small case series
-Clinical trials, whether phase 1, 2, 3 or 4
-Observational studies
-Meta-analyses
-Biomarker studies
-Public and global health studies
Nature Medicine is also committed to facilitating communication between translational and clinical researchers. As such, we consider “hybrid” studies with preclinical and translational findings reported alongside data from clinical studies.