评估人工智能对莫氏重建术术后问题的反应。

IF 1.4 4区医学 Q3 SURGERY

Facial Plastic Surgery Pub Date : 2025-09-04 DOI:10.1055/a-2689-2685

Areeb Shah, Luke Schwetschenau, Lisa Velez-Velez, Rohun Gupta, Kevin Chen, Collin Chen

{"title":"评估人工智能对莫氏重建术术后问题的反应。","authors":"Areeb Shah, Luke Schwetschenau, Lisa Velez-Velez, Rohun Gupta, Kevin Chen, Collin Chen","doi":"10.1055/a-2689-2685","DOIUrl":null,"url":null,"abstract":"Patients frequently ask questions after Mohs facial reconstruction. AI tools, particularly large language models (LLMs), may optimize this communication.We evaluated four LLMs-Claude AI, ChatGPT, Microsoft Copilot, and Google Gemini-on responses to postoperative questions, hypothesizing variation in quality, accuracy, comprehensiveness, and readability.Prospective observational study following STROBE guidelines.A total of 31 common postoperative questions were created. Each was submitted to all four LLMs using a standardized prompt. Responses were evaluated by blinded facial plastic surgeons using validated scoring tools (EQIP, Likert scales, readability formulas). IRB exemption was granted.Claude AI outperformed others in quality (EQIP: 90.3), accuracy (4.55/5), and comprehensiveness (4.60/5). All LLMs exceeded the recommended 6th-grade reading level.LLMs show potential for supporting postoperative communication, but variation in readability and content depth highlights the continued need for physician oversight.","PeriodicalId":12195,"journal":{"name":"Facial Plastic Surgery","volume":" ","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating AI Responses to Postoperative Questions in Mohs Reconstruction.\",\"authors\":\"Areeb Shah, Luke Schwetschenau, Lisa Velez-Velez, Rohun Gupta, Kevin Chen, Collin Chen\",\"doi\":\"10.1055/a-2689-2685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Patients frequently ask questions after Mohs facial reconstruction. AI tools, particularly large language models (LLMs), may optimize this communication.We evaluated four LLMs-Claude AI, ChatGPT, Microsoft Copilot, and Google Gemini-on responses to postoperative questions, hypothesizing variation in quality, accuracy, comprehensiveness, and readability.Prospective observational study following STROBE guidelines.A total of 31 common postoperative questions were created. Each was submitted to all four LLMs using a standardized prompt. Responses were evaluated by blinded facial plastic surgeons using validated scoring tools (EQIP, Likert scales, readability formulas). IRB exemption was granted.Claude AI outperformed others in quality (EQIP: 90.3), accuracy (4.55/5), and comprehensiveness (4.60/5). All LLMs exceeded the recommended 6th-grade reading level.LLMs show potential for supporting postoperative communication, but variation in readability and content depth highlights the continued need for physician oversight.\",\"PeriodicalId\":12195,\"journal\":{\"name\":\"Facial Plastic Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Facial Plastic Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1055/a-2689-2685\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Facial Plastic Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/a-2689-2685","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}

引用次数: 0

摘要

简介：莫氏面部重建术后患者常见的问题。人工智能工具，特别是大型语言模型（llm），可以优化这种交流。目的与假设：我们评估了四个法学硕士（claude AI、ChatGPT、Microsoft Copilot和谷歌gemini）对术后问题的反应，假设质量、准确性、全面性和可读性存在差异。研究设计：前瞻性观察性研究，遵循STROBE指南。方法：编制31个术后常见问题。每一份报告都通过标准化的提示提交给所有四个法学硕士。由盲法面部整形外科医生使用经过验证的评分工具（EQIP、Likert量表、可读性公式）对反应进行评估。获IRB豁免。结果：Claude AI在质量（EQIP: 90.3）、准确性（4.55/5）和综合性（4.60/5）方面优于其他AI。所有法学硕士都超过了六年级推荐的阅读水平。结论：llm显示了支持术后沟通的潜力，但可读性和内容深度的差异突出了医生监督的持续需求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Evaluating AI Responses to Postoperative Questions in Mohs Reconstruction.

Patients frequently ask questions after Mohs facial reconstruction. AI tools, particularly large language models (LLMs), may optimize this communication.We evaluated four LLMs-Claude AI, ChatGPT, Microsoft Copilot, and Google Gemini-on responses to postoperative questions, hypothesizing variation in quality, accuracy, comprehensiveness, and readability.Prospective observational study following STROBE guidelines.A total of 31 common postoperative questions were created. Each was submitted to all four LLMs using a standardized prompt. Responses were evaluated by blinded facial plastic surgeons using validated scoring tools (EQIP, Likert scales, readability formulas). IRB exemption was granted.Claude AI outperformed others in quality (EQIP: 90.3), accuracy (4.55/5), and comprehensiveness (4.60/5). All LLMs exceeded the recommended 6th-grade reading level.LLMs show potential for supporting postoperative communication, but variation in readability and content depth highlights the continued need for physician oversight.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Facial Plastic Surgery 医学-外科

CiteScore

1.80

自引率

10.00%

发文量

审稿时长

6-12 weeks

期刊介绍： Facial Plastic Surgery is a journal that publishes topic-specific issues covering areas of aesthetic and reconstructive plastic surgery as it relates to the head, neck, and face. The journal''s scope includes issues devoted to scar revision, periorbital and mid-face rejuvenation, facial trauma, facial implants, rhinoplasty, neck reconstruction, cleft palate, face lifts, as well as various other emerging minimally invasive procedures. Authors provide a global perspective on each topic, critically evaluate recent works in the field, and apply it to clinical practice.