Serhat Gurbuz, Bulent Karslioglu, Ahmet Keskin, Niyazi Igde, Mustafa Bugra Ayaz, Yunus Imren
{"title":"ChatGPT和Gemini在解决膝关节病和全膝关节置换术患者询问方面的比较疗效:一项随机对照试验。","authors":"Serhat Gurbuz, Bulent Karslioglu, Ahmet Keskin, Niyazi Igde, Mustafa Bugra Ayaz, Yunus Imren","doi":"10.1055/a-2693-0756","DOIUrl":null,"url":null,"abstract":"<p><p>The emergence of artificial intelligence (AI) in health care has created novel opportunities for enhancing patient education and alleviating anxiety. This study seeks to evaluate the effectiveness of two leading AI platforms, ChatGPT and Gemini, in delivering accurate and satisfactory responses to patients with gonarthrosis, considering total knee arthroplasty (TKA). A prospective, randomized controlled trial was conducted involving 100 patients diagnosed with gonarthrosis and indicated for TKA. Each patient posed five questions regarding the surgery and postoperative rehabilitation to both ChatGPT and Gemini. Responses were evaluated by two blinded orthopaedic specialists on a 10-point scale for accuracy and patient satisfaction. Patients additionally evaluated their satisfaction with each response using a 10-point scale. The main outcome measures consisted of the average accuracy scores assessed by specialists and the average satisfaction scores reported by patients. Statistical analysis revealed significant differences between ChatGPT and Gemini in both accuracy and patient satisfaction (<i>p</i> < 0.001). ChatGPT demonstrated better performance with a mean accuracy score of 8.7 ± 0.9 compared with Gemini's 7.2 ± 1.1. Patient satisfaction scores aligned with expert evaluations, with ChatGPT achieving a mean satisfaction score of 8.9 ± 0.8 versus Gemini's 7.5 ± 1.2. Notably, ChatGPT excelled in providing comprehensive explanations of surgical procedures (mean score: 9.2 ± 0.7) and postoperative care (9.1 ± 0.8), whereas Gemini performed better in offering concise summaries of recovery timelines (8.4 ± 0.9). This study demonstrates that ChatGPT offers more accurate and satisfactory responses to patient queries regarding gonarthrosis and TKA compared with Gemini. The findings suggest that AI platforms, particularly ChatGPT, can serve as valuable tools in augmenting patient education and potentially reducing preoperative anxiety. Future studies should investigate the incorporation of AI-assisted information delivery into clinical practice and its long-term effects on patient outcomes.</p>","PeriodicalId":48798,"journal":{"name":"Journal of Knee Surgery","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative Efficacy of ChatGPT and Gemini in Addressing Patient Queries on Gonarthrosis and Total Knee Arthroplasty: A Randomized Controlled Trial.\",\"authors\":\"Serhat Gurbuz, Bulent Karslioglu, Ahmet Keskin, Niyazi Igde, Mustafa Bugra Ayaz, Yunus Imren\",\"doi\":\"10.1055/a-2693-0756\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The emergence of artificial intelligence (AI) in health care has created novel opportunities for enhancing patient education and alleviating anxiety. This study seeks to evaluate the effectiveness of two leading AI platforms, ChatGPT and Gemini, in delivering accurate and satisfactory responses to patients with gonarthrosis, considering total knee arthroplasty (TKA). A prospective, randomized controlled trial was conducted involving 100 patients diagnosed with gonarthrosis and indicated for TKA. Each patient posed five questions regarding the surgery and postoperative rehabilitation to both ChatGPT and Gemini. Responses were evaluated by two blinded orthopaedic specialists on a 10-point scale for accuracy and patient satisfaction. Patients additionally evaluated their satisfaction with each response using a 10-point scale. The main outcome measures consisted of the average accuracy scores assessed by specialists and the average satisfaction scores reported by patients. Statistical analysis revealed significant differences between ChatGPT and Gemini in both accuracy and patient satisfaction (<i>p</i> < 0.001). ChatGPT demonstrated better performance with a mean accuracy score of 8.7 ± 0.9 compared with Gemini's 7.2 ± 1.1. Patient satisfaction scores aligned with expert evaluations, with ChatGPT achieving a mean satisfaction score of 8.9 ± 0.8 versus Gemini's 7.5 ± 1.2. Notably, ChatGPT excelled in providing comprehensive explanations of surgical procedures (mean score: 9.2 ± 0.7) and postoperative care (9.1 ± 0.8), whereas Gemini performed better in offering concise summaries of recovery timelines (8.4 ± 0.9). This study demonstrates that ChatGPT offers more accurate and satisfactory responses to patient queries regarding gonarthrosis and TKA compared with Gemini. The findings suggest that AI platforms, particularly ChatGPT, can serve as valuable tools in augmenting patient education and potentially reducing preoperative anxiety. Future studies should investigate the incorporation of AI-assisted information delivery into clinical practice and its long-term effects on patient outcomes.</p>\",\"PeriodicalId\":48798,\"journal\":{\"name\":\"Journal of Knee Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Knee Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1055/a-2693-0756\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ORTHOPEDICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Knee Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/a-2693-0756","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
Comparative Efficacy of ChatGPT and Gemini in Addressing Patient Queries on Gonarthrosis and Total Knee Arthroplasty: A Randomized Controlled Trial.
The emergence of artificial intelligence (AI) in health care has created novel opportunities for enhancing patient education and alleviating anxiety. This study seeks to evaluate the effectiveness of two leading AI platforms, ChatGPT and Gemini, in delivering accurate and satisfactory responses to patients with gonarthrosis, considering total knee arthroplasty (TKA). A prospective, randomized controlled trial was conducted involving 100 patients diagnosed with gonarthrosis and indicated for TKA. Each patient posed five questions regarding the surgery and postoperative rehabilitation to both ChatGPT and Gemini. Responses were evaluated by two blinded orthopaedic specialists on a 10-point scale for accuracy and patient satisfaction. Patients additionally evaluated their satisfaction with each response using a 10-point scale. The main outcome measures consisted of the average accuracy scores assessed by specialists and the average satisfaction scores reported by patients. Statistical analysis revealed significant differences between ChatGPT and Gemini in both accuracy and patient satisfaction (p < 0.001). ChatGPT demonstrated better performance with a mean accuracy score of 8.7 ± 0.9 compared with Gemini's 7.2 ± 1.1. Patient satisfaction scores aligned with expert evaluations, with ChatGPT achieving a mean satisfaction score of 8.9 ± 0.8 versus Gemini's 7.5 ± 1.2. Notably, ChatGPT excelled in providing comprehensive explanations of surgical procedures (mean score: 9.2 ± 0.7) and postoperative care (9.1 ± 0.8), whereas Gemini performed better in offering concise summaries of recovery timelines (8.4 ± 0.9). This study demonstrates that ChatGPT offers more accurate and satisfactory responses to patient queries regarding gonarthrosis and TKA compared with Gemini. The findings suggest that AI platforms, particularly ChatGPT, can serve as valuable tools in augmenting patient education and potentially reducing preoperative anxiety. Future studies should investigate the incorporation of AI-assisted information delivery into clinical practice and its long-term effects on patient outcomes.
期刊介绍:
The Journal of Knee Surgery covers a range of issues relating to the orthopaedic techniques of arthroscopy, arthroplasty, and reconstructive surgery of the knee joint. In addition to original peer-review articles, this periodical provides details on emerging surgical techniques, as well as reviews and special focus sections. Topics of interest include cruciate ligament repair and reconstruction, bone grafting, cartilage regeneration, and magnetic resonance imaging.