Manuel Mazzucchelli, Serena Salzano, Rosario Caltabiano, Gaetano Magro, Francesco Certo, Giuseppe Barbagallo, Giuseppe Broggi
{"title":"Diagnostic Performance of ChatGPT-4.0 in Histopathological Analysis of Gliomas: A Single Institution Experience.","authors":"Manuel Mazzucchelli, Serena Salzano, Rosario Caltabiano, Gaetano Magro, Francesco Certo, Giuseppe Barbagallo, Giuseppe Broggi","doi":"10.1111/neup.70023","DOIUrl":null,"url":null,"abstract":"<p><p>This study aimed to evaluate the performance of ChatGPT-4.0 as a diagnostic support tool for pathologists in identifying different types of gliomas based on histopathological data and to compare its performance with that of another artificial intelligence tool (Gemini 2.5 Pro). A retrospective analysis was performed on 25 cases with histopathological descriptions. The dataset, anonymized for patient confidentiality, included clinical details such as age, sex, and site, along with two histological images for each case, obtained from the archive files of the Anatomic Pathology section, Department of Medical, Surgical Sciences and Advanced Technologies \"G.F. Ingrassia\" University of Catania, Italy. ChatGPT-4.0 was tasked with generating diagnoses, which were classified as correct, similar, or different when compared to the pathologists' conclusions and the diagnoses provided by Gemini. ChatGPT-4.0 achieved a diagnostic accuracy of 88%, correctly identifying 22 out of 25 cases. No significant differences in diagnostic performance were observed between male and female patients. The AI performed exceptionally well in diagnosing glioblastomas, with a 100% accuracy rate, while two oligodendrogliomas and one astrocytoma IDH-mutant G3 were misdiagnosed. A comparative evaluation with Gemini 2.5 Pro was also conducted, although its contribution was limited to a qualitative comparison based on the same dataset. ChatGPT-4.0 demonstrated moderate accuracy in the histopathological diagnosis of gliomas, with little variability depending on glioma subtype. While its performance highlights potential for future integration into clinical workflows, significant improvements are required to ensure its reliability and effectiveness in diagnostic applications. Trial Registration: ce 165/2015/PO.</p>","PeriodicalId":19204,"journal":{"name":"Neuropathology","volume":"45 4","pages":"e70023"},"PeriodicalIF":1.2000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12305399/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuropathology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/neup.70023","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This study aimed to evaluate the performance of ChatGPT-4.0 as a diagnostic support tool for pathologists in identifying different types of gliomas based on histopathological data and to compare its performance with that of another artificial intelligence tool (Gemini 2.5 Pro). A retrospective analysis was performed on 25 cases with histopathological descriptions. The dataset, anonymized for patient confidentiality, included clinical details such as age, sex, and site, along with two histological images for each case, obtained from the archive files of the Anatomic Pathology section, Department of Medical, Surgical Sciences and Advanced Technologies "G.F. Ingrassia" University of Catania, Italy. ChatGPT-4.0 was tasked with generating diagnoses, which were classified as correct, similar, or different when compared to the pathologists' conclusions and the diagnoses provided by Gemini. ChatGPT-4.0 achieved a diagnostic accuracy of 88%, correctly identifying 22 out of 25 cases. No significant differences in diagnostic performance were observed between male and female patients. The AI performed exceptionally well in diagnosing glioblastomas, with a 100% accuracy rate, while two oligodendrogliomas and one astrocytoma IDH-mutant G3 were misdiagnosed. A comparative evaluation with Gemini 2.5 Pro was also conducted, although its contribution was limited to a qualitative comparison based on the same dataset. ChatGPT-4.0 demonstrated moderate accuracy in the histopathological diagnosis of gliomas, with little variability depending on glioma subtype. While its performance highlights potential for future integration into clinical workflows, significant improvements are required to ensure its reliability and effectiveness in diagnostic applications. Trial Registration: ce 165/2015/PO.
期刊介绍:
Neuropathology is an international journal sponsored by the Japanese Society of Neuropathology and publishes peer-reviewed original papers dealing with all aspects of human and experimental neuropathology and related fields of research. The Journal aims to promote the international exchange of results and encourages authors from all countries to submit papers in the following categories: Original Articles, Case Reports, Short Communications, Occasional Reviews, Editorials and Letters to the Editor. All articles are peer-reviewed by at least two researchers expert in the field of the submitted paper.