Real-world deployment of a fine-tuned pathology foundation model for lung cancer biomarker detection

IF 58.7 1区 医学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Gabriele Campanella, Neeraj Kumar, Swaraj Nanda, Siddharth Singi, Eugene Fluder, Ricky Kwan, Silke Muehlstedt, Nicole Pfarr, Peter J. Schüffler, Ida Häggström, Noora Neittaanmäki, Levent M. Akyürek, Alina Basnet, Tamara Jamaspishvili, Michel R. Nasr, Matthew M. Croken, Fred R. Hirsch, Arielle Elkrief, Helena Yu, Orly Ardon, Gregory M. Goldgof, Meera Hameed, Jane Houldsworth, Maria Arcila, Thomas J. Fuchs, Chad Vanderbilt
{"title":"Real-world deployment of a fine-tuned pathology foundation model for lung cancer biomarker detection","authors":"Gabriele Campanella, Neeraj Kumar, Swaraj Nanda, Siddharth Singi, Eugene Fluder, Ricky Kwan, Silke Muehlstedt, Nicole Pfarr, Peter J. Schüffler, Ida Häggström, Noora Neittaanmäki, Levent M. Akyürek, Alina Basnet, Tamara Jamaspishvili, Michel R. Nasr, Matthew M. Croken, Fred R. Hirsch, Arielle Elkrief, Helena Yu, Orly Ardon, Gregory M. Goldgof, Meera Hameed, Jane Houldsworth, Maria Arcila, Thomas J. Fuchs, Chad Vanderbilt","doi":"10.1038/s41591-025-03780-x","DOIUrl":null,"url":null,"abstract":"<p>Artificial intelligence models using digital histopathology slides stained with hematoxylin and eosin offer promising, tissue-preserving diagnostic tools for patients with cancer. Despite their advantages, their clinical utility in real-world settings remains unproven. Assessing EGFR mutations in lung adenocarcinoma demands rapid, accurate and cost-effective tests that preserve tissue for genomic sequencing. PCR-based assays provide rapid results but with reduced accuracy compared with next-generation sequencing and require additional tissue. Computational biomarkers leveraging modern foundation models can address these limitations. Here we assembled a large international clinical dataset of digital lung adenocarcinoma slides (<i>N</i> = 8,461) to develop a computational EGFR biomarker. Our model fine-tunes an open-source foundation model, improving task-specific performance with out-of-center generalization and clinical-grade accuracy on primary and metastatic specimens (mean area under the curve: internal 0.847, external 0.870). To evaluate real-world clinical translation, we conducted a prospective silent trial of the biomarker on primary samples, achieving an area under the curve of 0.890. The artificial-intelligence-assisted workflow reduced the number of rapid molecular tests needed by up to 43% while maintaining the current clinical standard performance. Our retrospective and prospective analyses demonstrate the real-world clinical utility of a computational pathology biomarker.</p>","PeriodicalId":19037,"journal":{"name":"Nature Medicine","volume":"21 1","pages":""},"PeriodicalIF":58.7000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41591-025-03780-x","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Artificial intelligence models using digital histopathology slides stained with hematoxylin and eosin offer promising, tissue-preserving diagnostic tools for patients with cancer. Despite their advantages, their clinical utility in real-world settings remains unproven. Assessing EGFR mutations in lung adenocarcinoma demands rapid, accurate and cost-effective tests that preserve tissue for genomic sequencing. PCR-based assays provide rapid results but with reduced accuracy compared with next-generation sequencing and require additional tissue. Computational biomarkers leveraging modern foundation models can address these limitations. Here we assembled a large international clinical dataset of digital lung adenocarcinoma slides (N = 8,461) to develop a computational EGFR biomarker. Our model fine-tunes an open-source foundation model, improving task-specific performance with out-of-center generalization and clinical-grade accuracy on primary and metastatic specimens (mean area under the curve: internal 0.847, external 0.870). To evaluate real-world clinical translation, we conducted a prospective silent trial of the biomarker on primary samples, achieving an area under the curve of 0.890. The artificial-intelligence-assisted workflow reduced the number of rapid molecular tests needed by up to 43% while maintaining the current clinical standard performance. Our retrospective and prospective analyses demonstrate the real-world clinical utility of a computational pathology biomarker.

Abstract Image

真实世界部署微调病理基础模型肺癌生物标志物检测
使用苏木精和伊红染色的数字组织病理学切片的人工智能模型为癌症患者提供了有希望的、保存组织的诊断工具。尽管它们具有优势,但它们在现实世界中的临床应用仍未得到证实。评估肺腺癌中的EGFR突变需要快速、准确和具有成本效益的检测,以保存组织以进行基因组测序。基于pcr的分析提供快速结果,但与下一代测序相比准确性降低,并且需要额外的组织。利用现代基础模型的计算生物标志物可以解决这些限制。在这里,我们收集了一个大型的国际临床数据集的数字肺腺癌幻灯片(N = 8,461),以开发一个计算EGFR生物标志物。我们的模型对开源基础模型进行了微调,通过偏离中心的泛化和原发性和转移性标本的临床级准确性提高了特定任务的性能(平均曲线下面积:内部0.847,外部0.870)。为了评估真实世界的临床翻译,我们对主要样本进行了生物标志物的前瞻性沉默试验,曲线下面积为0.890。人工智能辅助的工作流程将所需的快速分子测试数量减少了43%,同时保持了当前的临床标准性能。我们的回顾性和前瞻性分析证明了计算病理学生物标志物在现实世界中的临床应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Nature Medicine
Nature Medicine 医学-生化与分子生物学
CiteScore
100.90
自引率
0.70%
发文量
525
审稿时长
1 months
期刊介绍: Nature Medicine is a monthly journal publishing original peer-reviewed research in all areas of medicine. The publication focuses on originality, timeliness, interdisciplinary interest, and the impact on improving human health. In addition to research articles, Nature Medicine also publishes commissioned content such as News, Reviews, and Perspectives. This content aims to provide context for the latest advances in translational and clinical research, reaching a wide audience of M.D. and Ph.D. readers. All editorial decisions for the journal are made by a team of full-time professional editors. Nature Medicine consider all types of clinical research, including: -Case-reports and small case series -Clinical trials, whether phase 1, 2, 3 or 4 -Observational studies -Meta-analyses -Biomarker studies -Public and global health studies Nature Medicine is also committed to facilitating communication between translational and clinical researchers. As such, we consider “hybrid” studies with preclinical and translational findings reported alongside data from clinical studies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信