Meng-Fei Xian, Wen-Tong Lan, Zhe Zhang, Ming-De Li, Xin-Xin Lin, Yang Huang, Hui Huang, Li-Da Chen, Qing-Hua Huang, Wei Wang
{"title":"Enhancing hepatocellular carcinoma diagnosis in non-high-risk patients: a customized ChatGPT model integrating contrast-enhanced ultrasound.","authors":"Meng-Fei Xian, Wen-Tong Lan, Zhe Zhang, Ming-De Li, Xin-Xin Lin, Yang Huang, Hui Huang, Li-Da Chen, Qing-Hua Huang, Wei Wang","doi":"10.1007/s11547-025-01994-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aims to improve hepatocellular carcinoma (HCC) diagnostic accuracy in non-high-risk populations by utilizing GPTs that incorporate integrated risk coefficients, and to explore its feasibility.</p><p><strong>Material and methods: </strong>Between August 2016 and June 2019, patients with focal liver lesions (FLLs) in non-high-risk populations, confirmed by histopathology or clinical/imaging evidence, were retrospectively included. A logistic regression model was developed using baseline characteristics and contrast-enhanced ultrasound (CEUS) features to identify independent HCC risk factors. Three ChatGPT-based models were evaluated: ChatGPT 4o (a general-purpose model developed by OpenAI), BaseGPT (a customized model with HCC diagnostic knowledge), and RiskGPT (a further customized model integrating HCC knowledge and identified risk factors). Their intra-agreement and diagnostic performance were compared.</p><p><strong>Results: </strong>Logistic regression identified male, obesity, HBcAb or HBeAb positivity, elevated alpha-fetoprotein, and mild washout on CEUS as associated with HCC. RiskGPT achieved the highest area under a receiver operating characteristic curve (AUC) (0.89) and demonstrated superior accuracy (90.3%) in HCC identification; significantly outperforming both ChatGPT 4o (AUC 0.79, P = 0.002; accuracy 83.1%, P = 0.02) and BaseGPT (AUC 0.81, P = 0.008; accuracy 80.6%, P = 0.002). RiskGPT demonstrated superior sensitivity compared to ChatGPT 4o (85.5% vs. 66.3%) and outperformed BaseGPT in specificity (92.7% vs. 80.6%) and positive predictive value (85.5% vs. 67.7%) (all P < 0.001). Additionally, RiskGPT showed substantial intra-consistency in diagnosing FLLs, with a κ value of 0.78.</p><p><strong>Conclusion: </strong>RiskGPT improves HCC diagnostic accuracy in non-high-risk patients by integrating clinical, imaging features, and risk coefficients, demonstrating significant diagnostic potential.</p>","PeriodicalId":20817,"journal":{"name":"Radiologia Medica","volume":" ","pages":"1013-1023"},"PeriodicalIF":4.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiologia Medica","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11547-025-01994-0","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/15 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: This study aims to improve hepatocellular carcinoma (HCC) diagnostic accuracy in non-high-risk populations by utilizing GPTs that incorporate integrated risk coefficients, and to explore its feasibility.
Material and methods: Between August 2016 and June 2019, patients with focal liver lesions (FLLs) in non-high-risk populations, confirmed by histopathology or clinical/imaging evidence, were retrospectively included. A logistic regression model was developed using baseline characteristics and contrast-enhanced ultrasound (CEUS) features to identify independent HCC risk factors. Three ChatGPT-based models were evaluated: ChatGPT 4o (a general-purpose model developed by OpenAI), BaseGPT (a customized model with HCC diagnostic knowledge), and RiskGPT (a further customized model integrating HCC knowledge and identified risk factors). Their intra-agreement and diagnostic performance were compared.
Results: Logistic regression identified male, obesity, HBcAb or HBeAb positivity, elevated alpha-fetoprotein, and mild washout on CEUS as associated with HCC. RiskGPT achieved the highest area under a receiver operating characteristic curve (AUC) (0.89) and demonstrated superior accuracy (90.3%) in HCC identification; significantly outperforming both ChatGPT 4o (AUC 0.79, P = 0.002; accuracy 83.1%, P = 0.02) and BaseGPT (AUC 0.81, P = 0.008; accuracy 80.6%, P = 0.002). RiskGPT demonstrated superior sensitivity compared to ChatGPT 4o (85.5% vs. 66.3%) and outperformed BaseGPT in specificity (92.7% vs. 80.6%) and positive predictive value (85.5% vs. 67.7%) (all P < 0.001). Additionally, RiskGPT showed substantial intra-consistency in diagnosing FLLs, with a κ value of 0.78.
Conclusion: RiskGPT improves HCC diagnostic accuracy in non-high-risk patients by integrating clinical, imaging features, and risk coefficients, demonstrating significant diagnostic potential.
期刊介绍:
Felice Perussia founded La radiologia medica in 1914. It is a peer-reviewed journal and serves as the official journal of the Italian Society of Medical and Interventional Radiology (SIRM). The primary purpose of the journal is to disseminate information related to Radiology, especially advancements in diagnostic imaging and related disciplines. La radiologia medica welcomes original research on both fundamental and clinical aspects of modern radiology, with a particular focus on diagnostic and interventional imaging techniques. It also covers topics such as radiotherapy, nuclear medicine, radiobiology, health physics, and artificial intelligence in the context of clinical implications. The journal includes various types of contributions such as original articles, review articles, editorials, short reports, and letters to the editor. With an esteemed Editorial Board and a selection of insightful reports, the journal is an indispensable resource for radiologists and professionals in related fields. Ultimately, La radiologia medica aims to serve as a platform for international collaboration and knowledge sharing within the radiological community.