D. Chytas , S. Kanakaris , M. Piagkou , I. Chryssanthou , A.V. Vasiliadis , K. Natsis
{"title":"人工智能能否准确检测和总结解剖学教育文献?ChatGPT与ScholarGPT的比较分析","authors":"D. Chytas , S. Kanakaris , M. Piagkou , I. Chryssanthou , A.V. Vasiliadis , K. Natsis","doi":"10.1016/j.morpho.2025.101061","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Artificial intelligence platforms have been suggested as tools that can facilitate anatomy teachers’ work and students’ learning process. We aimed to investigate the ability of ChatGPT to detect and summarize studies of the anatomy education literature compared to ScholarGPT, a version of ChatGPT specified in academic research. Secondly, we aimed to explore if the ability of each platform is influenced by the level of queries complexity.</div></div><div><h3>Methods</h3><div>We asked the two platforms to list five studies about each of the following three topics: (1) use of virtual reality in anatomy education, (2) use of stereoscopic virtual reality in anatomy education, (3) use of stereoscopic virtual reality in anatomy education, involving user's interaction with the virtual environment. We assessed if the retrieved studies fulfilled the search criteria, and if their summaries were accurate (if they contained true information about all the educational results of the article's abstract).</div></div><div><h3>Results</h3><div>The ChatGPT's percentages of successful detection were 100%, 60% and 0% respectively for the three queries. The percentages of accurate summaries were 60%, 20% and 0% respectively. ScholarGPT performed better, with a percentage of successful detection 100%, 60% and 40% respectively. The percentages of accurate summaries were 80%, 60% and 40% respectively. Both platforms showed bias in favor of the educational intervention.</div></div><div><h3>Conclusions</h3><div>ChatGPT and ScholarGPT are not currently at an adequate level to essentially aid researchers to detect and summarize studies of the anatomy education literature. Ongoing research may increase the ability of those platforms to provide more reliable information.</div></div>","PeriodicalId":39316,"journal":{"name":"Morphologie","volume":"109 367","pages":"Article 101061"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Can artificial intelligence accurately detect and summarize anatomy education literature? A comparative analysis of ChatGPT and ScholarGPT\",\"authors\":\"D. Chytas , S. Kanakaris , M. Piagkou , I. Chryssanthou , A.V. Vasiliadis , K. Natsis\",\"doi\":\"10.1016/j.morpho.2025.101061\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Purpose</h3><div>Artificial intelligence platforms have been suggested as tools that can facilitate anatomy teachers’ work and students’ learning process. We aimed to investigate the ability of ChatGPT to detect and summarize studies of the anatomy education literature compared to ScholarGPT, a version of ChatGPT specified in academic research. Secondly, we aimed to explore if the ability of each platform is influenced by the level of queries complexity.</div></div><div><h3>Methods</h3><div>We asked the two platforms to list five studies about each of the following three topics: (1) use of virtual reality in anatomy education, (2) use of stereoscopic virtual reality in anatomy education, (3) use of stereoscopic virtual reality in anatomy education, involving user's interaction with the virtual environment. We assessed if the retrieved studies fulfilled the search criteria, and if their summaries were accurate (if they contained true information about all the educational results of the article's abstract).</div></div><div><h3>Results</h3><div>The ChatGPT's percentages of successful detection were 100%, 60% and 0% respectively for the three queries. The percentages of accurate summaries were 60%, 20% and 0% respectively. ScholarGPT performed better, with a percentage of successful detection 100%, 60% and 40% respectively. The percentages of accurate summaries were 80%, 60% and 40% respectively. Both platforms showed bias in favor of the educational intervention.</div></div><div><h3>Conclusions</h3><div>ChatGPT and ScholarGPT are not currently at an adequate level to essentially aid researchers to detect and summarize studies of the anatomy education literature. Ongoing research may increase the ability of those platforms to provide more reliable information.</div></div>\",\"PeriodicalId\":39316,\"journal\":{\"name\":\"Morphologie\",\"volume\":\"109 367\",\"pages\":\"Article 101061\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Morphologie\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1286011525001134\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Morphologie","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1286011525001134","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
Can artificial intelligence accurately detect and summarize anatomy education literature? A comparative analysis of ChatGPT and ScholarGPT
Purpose
Artificial intelligence platforms have been suggested as tools that can facilitate anatomy teachers’ work and students’ learning process. We aimed to investigate the ability of ChatGPT to detect and summarize studies of the anatomy education literature compared to ScholarGPT, a version of ChatGPT specified in academic research. Secondly, we aimed to explore if the ability of each platform is influenced by the level of queries complexity.
Methods
We asked the two platforms to list five studies about each of the following three topics: (1) use of virtual reality in anatomy education, (2) use of stereoscopic virtual reality in anatomy education, (3) use of stereoscopic virtual reality in anatomy education, involving user's interaction with the virtual environment. We assessed if the retrieved studies fulfilled the search criteria, and if their summaries were accurate (if they contained true information about all the educational results of the article's abstract).
Results
The ChatGPT's percentages of successful detection were 100%, 60% and 0% respectively for the three queries. The percentages of accurate summaries were 60%, 20% and 0% respectively. ScholarGPT performed better, with a percentage of successful detection 100%, 60% and 40% respectively. The percentages of accurate summaries were 80%, 60% and 40% respectively. Both platforms showed bias in favor of the educational intervention.
Conclusions
ChatGPT and ScholarGPT are not currently at an adequate level to essentially aid researchers to detect and summarize studies of the anatomy education literature. Ongoing research may increase the ability of those platforms to provide more reliable information.
期刊介绍:
Morphologie est une revue universitaire avec une ouverture médicale qui sa adresse aux enseignants, aux étudiants, aux chercheurs et aux cliniciens en anatomie et en morphologie. Vous y trouverez les développements les plus actuels de votre spécialité, en France comme a international. Le objectif de Morphologie est d?offrir des lectures privilégiées sous forme de revues générales, d?articles originaux, de mises au point didactiques et de revues de la littérature, qui permettront notamment aux enseignants de optimiser leurs cours et aux spécialistes d?enrichir leurs connaissances.