Alexander Z Fazilat, Camille Brenac, Danae Kawamoto-Duran, Charlotte E Berry, Jennifer Alyono, Michael T Chang, David T Liu, Zara M Patel, Stéphane Tringali, Derrick C Wan, Maxime Fieux
{"title":"Evaluating the quality and readability of ChatGPT-generated patient-facing medical information in rhinology.","authors":"Alexander Z Fazilat, Camille Brenac, Danae Kawamoto-Duran, Charlotte E Berry, Jennifer Alyono, Michael T Chang, David T Liu, Zara M Patel, Stéphane Tringali, Derrick C Wan, Maxime Fieux","doi":"10.1007/s00405-024-09180-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The artificial intelligence (AI) chatbot ChatGPT has become a major tool for generating responses in healthcare. This study assessed ChatGPT's ability to generate French preoperative patient-facing medical information (PFI) in rhinology at a comparable level to material provided by an academic source, the French Society of Otorhinolaryngology (Société Française d'Otorhinolaryngologie et Chirurgie Cervico-Faciale, SFORL).</p><p><strong>Methods: </strong>ChatGPT and SFORL French preoperative PFI in rhinology were compared by analyzing responses to 16 questions regarding common rhinology procedures: ethmoidectomy, sphenoidotomy, septoplasty, and endonasal dacryocystorhinostomy. Twenty rhinologists assessed the clarity, comprehensiveness, accuracy, and overall quality of the information, while 24 nonmedical individuals analyzed the clarity and overall quality. Six readability formulas were used to compare readability scores.</p><p><strong>Results: </strong>Among rhinologists, no significant difference was found between ChatGPT and SFORL regarding clarity (7.61 ± 0.36 vs. 7.53 ± 0.28; p = 0.485), comprehensiveness (7.32 ± 0.77 vs. 7.58 ± 0.50; p = 0.872), and accuracy (inaccuracies: 60% vs. 40%; p = 0.228), respectively. Non-medical individuals scored the clarity of ChatGPT significantly higher than that of the SFORL (8.16 ± 1.16 vs. 6.32 ± 1.33; p < 0.0001). The non-medical individuals chose ChatGPT as the most informative source significantly more often than rhinologists (62.8% vs. 39.7%, p < 0.001).</p><p><strong>Conclusion: </strong>ChatGPT-generated French preoperative PFI in rhinology was comparable to SFORL-provided PFI regarding clarity, comprehensiveness, accuracy, readability, and overall quality. This study highlights ChatGPT's potential to increase accessibility to high quality PFI and suggests its use by physicians as a complement to academic resources written by learned societies such as the SFORL.</p>","PeriodicalId":11952,"journal":{"name":"European Archives of Oto-Rhino-Laryngology","volume":" ","pages":"1911-1920"},"PeriodicalIF":1.9000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Archives of Oto-Rhino-Laryngology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00405-024-09180-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/26 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: The artificial intelligence (AI) chatbot ChatGPT has become a major tool for generating responses in healthcare. This study assessed ChatGPT's ability to generate French preoperative patient-facing medical information (PFI) in rhinology at a comparable level to material provided by an academic source, the French Society of Otorhinolaryngology (Société Française d'Otorhinolaryngologie et Chirurgie Cervico-Faciale, SFORL).
Methods: ChatGPT and SFORL French preoperative PFI in rhinology were compared by analyzing responses to 16 questions regarding common rhinology procedures: ethmoidectomy, sphenoidotomy, septoplasty, and endonasal dacryocystorhinostomy. Twenty rhinologists assessed the clarity, comprehensiveness, accuracy, and overall quality of the information, while 24 nonmedical individuals analyzed the clarity and overall quality. Six readability formulas were used to compare readability scores.
Results: Among rhinologists, no significant difference was found between ChatGPT and SFORL regarding clarity (7.61 ± 0.36 vs. 7.53 ± 0.28; p = 0.485), comprehensiveness (7.32 ± 0.77 vs. 7.58 ± 0.50; p = 0.872), and accuracy (inaccuracies: 60% vs. 40%; p = 0.228), respectively. Non-medical individuals scored the clarity of ChatGPT significantly higher than that of the SFORL (8.16 ± 1.16 vs. 6.32 ± 1.33; p < 0.0001). The non-medical individuals chose ChatGPT as the most informative source significantly more often than rhinologists (62.8% vs. 39.7%, p < 0.001).
Conclusion: ChatGPT-generated French preoperative PFI in rhinology was comparable to SFORL-provided PFI regarding clarity, comprehensiveness, accuracy, readability, and overall quality. This study highlights ChatGPT's potential to increase accessibility to high quality PFI and suggests its use by physicians as a complement to academic resources written by learned societies such as the SFORL.
目的:人工智能(AI)聊天机器人ChatGPT已成为医疗保健领域生成响应的主要工具。本研究评估了ChatGPT生成法国术前面向患者的鼻科医学信息(PFI)的能力,该信息与学术来源法国耳鼻喉学会(sociac franaise d’Otorhinolaryngology et Chirurgie Cervico-Faciale, SFORL)提供的材料水平相当。方法:通过对16个常见鼻科手术(筛窦切除术、蝶窦切开术、鼻中隔成形术和鼻内泪囊鼻腔造口术)的回答,比较ChatGPT和SFORL French术前鼻科PFI。20名鼻科医生评估信息的清晰度、全面性、准确性和整体质量,而24名非医疗人员分析信息的清晰度和整体质量。6个可读性公式用于比较可读性评分。结果:在鼻科医生中,ChatGPT和SFORL在清晰度方面无显著差异(7.61±0.36 vs. 7.53±0.28;p = 0.485),全面性(7.32±0.77和7.58±0.50;P = 0.872),准确性(不准确性:60% vs. 40%;P = 0.228)。非医学个体ChatGPT的清晰度评分显著高于SFORL(8.16±1.16 vs. 6.32±1.33;结论:chatgpt生成的法国鼻科术前PFI与sforl提供的PFI在清晰度、全面性、准确性、可读性和整体质量方面相当。这项研究强调了ChatGPT在提高高质量PFI可及性方面的潜力,并建议医生将其作为SFORL等学术团体撰写的学术资源的补充。
期刊介绍:
Official Journal of
European Union of Medical Specialists – ORL Section and Board
Official Journal of Confederation of European Oto-Rhino-Laryngology Head and Neck Surgery
"European Archives of Oto-Rhino-Laryngology" publishes original clinical reports and clinically relevant experimental studies, as well as short communications presenting new results of special interest. With peer review by a respected international editorial board and prompt English-language publication, the journal provides rapid dissemination of information by authors from around the world. This particular feature makes it the journal of choice for readers who want to be informed about the continuing state of the art concerning basic sciences and the diagnosis and management of diseases of the head and neck on an international level.
European Archives of Oto-Rhino-Laryngology was founded in 1864 as "Archiv für Ohrenheilkunde" by A. von Tröltsch, A. Politzer and H. Schwartze.