Diagnostic Performance of ChatGPT-4.0 in Histopathological Analysis of Gliomas: A Single Institution Experience.

IF 1.2 4区医学 Q4 CLINICAL NEUROLOGY

Neuropathology Pub Date : 2025-08-01 DOI:10.1111/neup.70023

Manuel Mazzucchelli, Serena Salzano, Rosario Caltabiano, Gaetano Magro, Francesco Certo, Giuseppe Barbagallo, Giuseppe Broggi

{"title":"Diagnostic Performance of ChatGPT-4.0 in Histopathological Analysis of Gliomas: A Single Institution Experience.","authors":"Manuel Mazzucchelli, Serena Salzano, Rosario Caltabiano, Gaetano Magro, Francesco Certo, Giuseppe Barbagallo, Giuseppe Broggi","doi":"10.1111/neup.70023","DOIUrl":null,"url":null,"abstract":"<p><p>This study aimed to evaluate the performance of ChatGPT-4.0 as a diagnostic support tool for pathologists in identifying different types of gliomas based on histopathological data and to compare its performance with that of another artificial intelligence tool (Gemini 2.5 Pro). A retrospective analysis was performed on 25 cases with histopathological descriptions. The dataset, anonymized for patient confidentiality, included clinical details such as age, sex, and site, along with two histological images for each case, obtained from the archive files of the Anatomic Pathology section, Department of Medical, Surgical Sciences and Advanced Technologies \"G.F. Ingrassia\" University of Catania, Italy. ChatGPT-4.0 was tasked with generating diagnoses, which were classified as correct, similar, or different when compared to the pathologists' conclusions and the diagnoses provided by Gemini. ChatGPT-4.0 achieved a diagnostic accuracy of 88%, correctly identifying 22 out of 25 cases. No significant differences in diagnostic performance were observed between male and female patients. The AI performed exceptionally well in diagnosing glioblastomas, with a 100% accuracy rate, while two oligodendrogliomas and one astrocytoma IDH-mutant G3 were misdiagnosed. A comparative evaluation with Gemini 2.5 Pro was also conducted, although its contribution was limited to a qualitative comparison based on the same dataset. ChatGPT-4.0 demonstrated moderate accuracy in the histopathological diagnosis of gliomas, with little variability depending on glioma subtype. While its performance highlights potential for future integration into clinical workflows, significant improvements are required to ensure its reliability and effectiveness in diagnostic applications. Trial Registration: ce 165/2015/PO.</p>","PeriodicalId":19204,"journal":{"name":"Neuropathology","volume":"45 4","pages":"e70023"},"PeriodicalIF":1.2000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12305399/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuropathology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/neup.70023","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

This study aimed to evaluate the performance of ChatGPT-4.0 as a diagnostic support tool for pathologists in identifying different types of gliomas based on histopathological data and to compare its performance with that of another artificial intelligence tool (Gemini 2.5 Pro). A retrospective analysis was performed on 25 cases with histopathological descriptions. The dataset, anonymized for patient confidentiality, included clinical details such as age, sex, and site, along with two histological images for each case, obtained from the archive files of the Anatomic Pathology section, Department of Medical, Surgical Sciences and Advanced Technologies "G.F. Ingrassia" University of Catania, Italy. ChatGPT-4.0 was tasked with generating diagnoses, which were classified as correct, similar, or different when compared to the pathologists' conclusions and the diagnoses provided by Gemini. ChatGPT-4.0 achieved a diagnostic accuracy of 88%, correctly identifying 22 out of 25 cases. No significant differences in diagnostic performance were observed between male and female patients. The AI performed exceptionally well in diagnosing glioblastomas, with a 100% accuracy rate, while two oligodendrogliomas and one astrocytoma IDH-mutant G3 were misdiagnosed. A comparative evaluation with Gemini 2.5 Pro was also conducted, although its contribution was limited to a qualitative comparison based on the same dataset. ChatGPT-4.0 demonstrated moderate accuracy in the histopathological diagnosis of gliomas, with little variability depending on glioma subtype. While its performance highlights potential for future integration into clinical workflows, significant improvements are required to ensure its reliability and effectiveness in diagnostic applications. Trial Registration: ce 165/2015/PO.

Abstract Image

查看原文本刊更多论文

ChatGPT-4.0在胶质瘤组织病理学分析中的诊断性能：单一机构经验。

本研究旨在评估ChatGPT-4.0作为病理学家根据组织病理学数据识别不同类型胶质瘤的诊断支持工具的性能，并将其与另一种人工智能工具（Gemini 2.5 Pro）的性能进行比较。回顾性分析25例有组织病理描述的病例。该数据集为患者保密而匿名化，包括临床细节，如年龄、性别和部位，以及每个病例的两张组织学图像，从意大利卡塔尼亚大学医学、外科科学和先进技术部“G.F. inggrassia”解剖病理学部门的档案文件中获得。ChatGPT-4.0的任务是生成诊断，与病理学家的结论和Gemini提供的诊断相比，这些诊断被分类为正确、相似或不同。ChatGPT-4.0的诊断准确率达到88%，正确识别了25例中的22例。男性和女性患者的诊断表现无显著差异。人工智能在胶质母细胞瘤诊断方面表现异常出色，准确率为100%，而2例少突胶质细胞瘤和1例星形细胞瘤idh -突变G3被误诊。还进行了与Gemini 2.5 Pro的比较评估，尽管其贡献仅限于基于相同数据集的定性比较。ChatGPT-4.0在胶质瘤的组织病理学诊断中表现出中等的准确性，胶质瘤亚型的差异很小。虽然其性能突出了未来集成到临床工作流程中的潜力，但仍需要进行重大改进，以确保其在诊断应用中的可靠性和有效性。试验注册：ce 165/2015/PO。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neuropathology 医学-病理学

CiteScore

4.10

自引率

4.30%

发文量

105

审稿时长

6-12 weeks

期刊介绍： Neuropathology is an international journal sponsored by the Japanese Society of Neuropathology and publishes peer-reviewed original papers dealing with all aspects of human and experimental neuropathology and related fields of research. The Journal aims to promote the international exchange of results and encourages authors from all countries to submit papers in the following categories: Original Articles, Case Reports, Short Communications, Occasional Reviews, Editorials and Letters to the Editor. All articles are peer-reviewed by at least two researchers expert in the field of the submitted paper.