Vittorio De Vita, Bianca Destro Castaniti, Mariapia Vassalli, Lorenzo De Mori, Doriana Lacalaprice, Emanuele Arcà, Antonio Cristiano, Chiara Battipaglia, Pietro Eric Risuleo, Tommaso Dionisi, Francesco Andrea Causio
{"title":"复杂临床病例中大型语言模型推理的临床评估。","authors":"Vittorio De Vita, Bianca Destro Castaniti, Mariapia Vassalli, Lorenzo De Mori, Doriana Lacalaprice, Emanuele Arcà, Antonio Cristiano, Chiara Battipaglia, Pietro Eric Risuleo, Tommaso Dionisi, Francesco Andrea Causio","doi":"10.1701/4573.45794","DOIUrl":null,"url":null,"abstract":"<p><p>Large language models (LLMs) show promise in explicit reasoning for complex medical fields like psychiatry. This study assessed the clinical validity of Gemini's chain-of-thought (CoT) reasoning in 10 complex psychiatric cases, evaluated by specialists using six metrics. Results indicate high performance (average score ≥4.26/5), especially in step sufficiency and factual accuracy, suggesting that CoT reasoning by LLMs can support transparent and detailed clinical decision-making.</p>","PeriodicalId":20887,"journal":{"name":"Recenti progressi in medicina","volume":"116 10","pages":"599-600"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Valutazione del ragionamento clinico dei reasoning large language models su casi clinici complessi.\",\"authors\":\"Vittorio De Vita, Bianca Destro Castaniti, Mariapia Vassalli, Lorenzo De Mori, Doriana Lacalaprice, Emanuele Arcà, Antonio Cristiano, Chiara Battipaglia, Pietro Eric Risuleo, Tommaso Dionisi, Francesco Andrea Causio\",\"doi\":\"10.1701/4573.45794\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Large language models (LLMs) show promise in explicit reasoning for complex medical fields like psychiatry. This study assessed the clinical validity of Gemini's chain-of-thought (CoT) reasoning in 10 complex psychiatric cases, evaluated by specialists using six metrics. Results indicate high performance (average score ≥4.26/5), especially in step sufficiency and factual accuracy, suggesting that CoT reasoning by LLMs can support transparent and detailed clinical decision-making.</p>\",\"PeriodicalId\":20887,\"journal\":{\"name\":\"Recenti progressi in medicina\",\"volume\":\"116 10\",\"pages\":\"599-600\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Recenti progressi in medicina\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1701/4573.45794\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Recenti progressi in medicina","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1701/4573.45794","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
Valutazione del ragionamento clinico dei reasoning large language models su casi clinici complessi.
Large language models (LLMs) show promise in explicit reasoning for complex medical fields like psychiatry. This study assessed the clinical validity of Gemini's chain-of-thought (CoT) reasoning in 10 complex psychiatric cases, evaluated by specialists using six metrics. Results indicate high performance (average score ≥4.26/5), especially in step sufficiency and factual accuracy, suggesting that CoT reasoning by LLMs can support transparent and detailed clinical decision-making.
期刊介绍:
Giunta ormai al sessantesimo anno, Recenti Progressi in Medicina continua a costituire un sicuro punto di riferimento ed uno strumento di lavoro fondamentale per l"ampliamento dell"orizzonte culturale del medico italiano. Recenti Progressi in Medicina è una rivista di medicina interna. Ciò significa il recupero di un"ottica globale e integrata, idonea ad evitare sia i particolarismi della informazione specialistica sia la frammentazione di quella generalista.