Retrieval Augmented Generation-Enabled Large Language Model for Risk Stratification of Cutaneous Squamous Cell Carcinoma.

IF 11 1区 医学 Q1 DERMATOLOGY
Neil K Jairath, Vartan Pahalyants, Shayan Cheraghlou, Derek Maas, Nayoung Lee, Maressa C Criscito, Mary L Stevenson, Apoorva Mehta, Zachary Leibovit-Reiben, Alyssa Stockard, Nicole Doudican, Aaron Mangold, John A Carucci
{"title":"Retrieval Augmented Generation-Enabled Large Language Model for Risk Stratification of Cutaneous Squamous Cell Carcinoma.","authors":"Neil K Jairath, Vartan Pahalyants, Shayan Cheraghlou, Derek Maas, Nayoung Lee, Maressa C Criscito, Mary L Stevenson, Apoorva Mehta, Zachary Leibovit-Reiben, Alyssa Stockard, Nicole Doudican, Aaron Mangold, John A Carucci","doi":"10.1001/jamadermatol.2025.1614","DOIUrl":null,"url":null,"abstract":"<p><strong>Importance: </strong>There exists substantial heterogeneity in outcomes within T stages for patients with cutaneous squamous cell carcinoma (cSCC).</p><p><strong>Objective: </strong>To determine whether a customized generative pretrained transformer model, trained on a comprehensive dataset with more than 1 trillion parameters and equipped with relevant focused context and retrieval augmented generation (RAG), could excel in aggregating and interpreting vast quantities of data to develop a novel class-based risk stratification system that outperforms the current standards.</p><p><strong>Design, setting, and participants: </strong>To build the RAG knowledge base, a systematic review of the literature was conducted that addressed risk factors for poor outcomes in cSCC. Using the RAG-enabled generative pretrained transformer (GPT) model, we developed a novel class-based risk stratification system that assigned point values for risk factors, culminating in a GPT-based prognostication system called the artificial intelligence-derived risk score (AIRIS). The system's performance was validated on a combined prospective and retrospective cohort of 2379 primary cSCC tumors (1996-2023) with at least 36 months of follow-up, against Brigham and Women's Hospital (BWH) and American Joint Committee on Cancer Staging Manual, eighth edition (AJCC8) systems in stratifying risk for locoregional recurrence (LR), nodal metastasis (NM), distant metastasis (DM), and disease-specific death (DSD).</p><p><strong>Main outcomes and measures: </strong>Performance metrics evaluated included distinctiveness, homogeneity, and monotonicity, as defined by the AJCC8, as well as sensitivity, specificity, positive predictive value, negative predictive value, accuracy, the area under the receiver operating characteristic curve, and concordance.</p><p><strong>Results: </strong>The median age at diagnosis was 73 (IQR, 64-81) years, with 38.5% female patients and 61.5% male patients. The AIRIS prognostication system demonstrated superior sensitivity across all outcomes (LR, 49.1%; NM, 73.7%; DM, 82.5%; and DSD, 72.2%) and the highest area under the receiver operating characteristic curve values (LR, 0.69; NM, 0.81; DM, 0.85; and DSD, 0.80), indicating significantly enhanced discriminative capability compared with the BWH and AJCC8 systems. While all systems were comparably distinctive, the AIRIS prognostication system consistently demonstrated the lowest proportion of tumors exhibiting poor outcomes in low-risk categories, suggesting its improved homogeneity and monotonicity.</p><p><strong>Conclusions and relevance: </strong>The results of this diagnostic study suggest that the AIRIS system outperforms the existing BWH and AJCC8 prognostication systems, potentially providing a more effective tool for predicting poor outcomes in cSCC. This study illustrates the potential of large language models in refining prognostic tools, offering implications for treating patients with cancer.</p>","PeriodicalId":14734,"journal":{"name":"JAMA dermatology","volume":" ","pages":"796-804"},"PeriodicalIF":11.0000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12159850/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMA dermatology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1001/jamadermatol.2025.1614","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DERMATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Importance: There exists substantial heterogeneity in outcomes within T stages for patients with cutaneous squamous cell carcinoma (cSCC).

Objective: To determine whether a customized generative pretrained transformer model, trained on a comprehensive dataset with more than 1 trillion parameters and equipped with relevant focused context and retrieval augmented generation (RAG), could excel in aggregating and interpreting vast quantities of data to develop a novel class-based risk stratification system that outperforms the current standards.

Design, setting, and participants: To build the RAG knowledge base, a systematic review of the literature was conducted that addressed risk factors for poor outcomes in cSCC. Using the RAG-enabled generative pretrained transformer (GPT) model, we developed a novel class-based risk stratification system that assigned point values for risk factors, culminating in a GPT-based prognostication system called the artificial intelligence-derived risk score (AIRIS). The system's performance was validated on a combined prospective and retrospective cohort of 2379 primary cSCC tumors (1996-2023) with at least 36 months of follow-up, against Brigham and Women's Hospital (BWH) and American Joint Committee on Cancer Staging Manual, eighth edition (AJCC8) systems in stratifying risk for locoregional recurrence (LR), nodal metastasis (NM), distant metastasis (DM), and disease-specific death (DSD).

Main outcomes and measures: Performance metrics evaluated included distinctiveness, homogeneity, and monotonicity, as defined by the AJCC8, as well as sensitivity, specificity, positive predictive value, negative predictive value, accuracy, the area under the receiver operating characteristic curve, and concordance.

Results: The median age at diagnosis was 73 (IQR, 64-81) years, with 38.5% female patients and 61.5% male patients. The AIRIS prognostication system demonstrated superior sensitivity across all outcomes (LR, 49.1%; NM, 73.7%; DM, 82.5%; and DSD, 72.2%) and the highest area under the receiver operating characteristic curve values (LR, 0.69; NM, 0.81; DM, 0.85; and DSD, 0.80), indicating significantly enhanced discriminative capability compared with the BWH and AJCC8 systems. While all systems were comparably distinctive, the AIRIS prognostication system consistently demonstrated the lowest proportion of tumors exhibiting poor outcomes in low-risk categories, suggesting its improved homogeneity and monotonicity.

Conclusions and relevance: The results of this diagnostic study suggest that the AIRIS system outperforms the existing BWH and AJCC8 prognostication systems, potentially providing a more effective tool for predicting poor outcomes in cSCC. This study illustrates the potential of large language models in refining prognostic tools, offering implications for treating patients with cancer.

皮肤鳞状细胞癌风险分层的检索增强生成大语言模型。
重要性:皮肤鳞状细胞癌(cSCC)患者的T期预后存在很大的异质性。目的:确定一个定制的生成式预训练变压器模型,在超过1万亿个参数的综合数据集上进行训练,并配备相关的聚焦上下文和检索增强生成(RAG),是否能够在聚合和解释大量数据方面表现出色,从而开发出优于当前标准的基于类别的新型风险分层系统。设计、设置和参与者:为了建立RAG知识库,对cSCC不良结局的危险因素进行了系统的文献回顾。使用支持rag的生成预训练变压器(GPT)模型,我们开发了一种新的基于类别的风险分层系统,该系统为风险因素分配了分值,最终形成了基于GPT的预测系统,称为人工智能衍生风险评分(AIRIS)。该系统的性能在2379例原发性cSCC肿瘤(1996-2023)的联合前瞻性和回顾性队列中得到验证,随访至少36个月,对照布里格姆妇女医院(BWH)和美国癌症分期手册联合委员会第八版(AJCC8)系统,对局部复发(LR)、淋巴结转移(NM)、远处转移(DM)和疾病特异性死亡(DSD)的风险进行分层。主要结果和测量指标:评估的性能指标包括AJCC8定义的独特性、同质性和单调性,以及敏感性、特异性、阳性预测值、阴性预测值、准确性、受试者工作特征曲线下面积和一致性。结果:诊断时中位年龄73岁(IQR, 64-81)岁,其中女性占38.5%,男性占61.5%。AIRIS预测系统在所有结局中均表现出优越的敏感性(LR, 49.1%;海里,73.7%;DM, 82.5%;和DSD, 72.2%)和受试者工作特征曲线值下的最高面积(LR, 0.69;NM, 0.81;DM, 0.85;DSD为0.80),表明与BWH和AJCC8系统相比,识别能力显著增强。虽然所有系统都具有相当的独特性,但AIRIS预测系统始终显示低风险类别中表现出不良预后的肿瘤比例最低,这表明其改善了同质性和单调性。结论和相关性:本诊断研究的结果表明,AIRIS系统优于现有的BWH和AJCC8预测系统,可能为预测cSCC的不良预后提供更有效的工具。这项研究说明了大型语言模型在改进预后工具方面的潜力,为治疗癌症患者提供了启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
JAMA dermatology
JAMA dermatology DERMATOLOGY-
CiteScore
14.10
自引率
5.50%
发文量
300
期刊介绍: JAMA Dermatology is an international peer-reviewed journal that has been in continuous publication since 1882. It began publication by the American Medical Association in 1920 as Archives of Dermatology and Syphilology. The journal publishes material that helps in the development and testing of the effectiveness of diagnosis and treatment in medical and surgical dermatology, pediatric and geriatric dermatology, and oncologic and aesthetic dermatologic surgery. JAMA Dermatology is a member of the JAMA Network, a consortium of peer-reviewed, general medical and specialty publications. It is published online weekly, every Wednesday, and in 12 print/online issues a year. The mission of the journal is to elevate the art and science of health and diseases of skin, hair, nails, and mucous membranes, and their treatment, with the aim of enabling dermatologists to deliver evidence-based, high-value medical and surgical dermatologic care. The journal publishes a broad range of innovative studies and trials that shift research and clinical practice paradigms, expand the understanding of the burden of dermatologic diseases and key outcomes, improve the practice of dermatology, and ensure equitable care to all patients. It also features research and opinion examining ethical, moral, socioeconomic, educational, and political issues relevant to dermatologists, aiming to enable ongoing improvement to the workforce, scope of practice, and the training of future dermatologists. JAMA Dermatology aims to be a leader in developing initiatives to improve diversity, equity, and inclusion within the specialty and within dermatology medical publishing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信