Assessing the Performance of Chat Generative Pretrained Transformer (ChatGPT) in Answering Andrology-Related Questions.

0 UROLOGY & NEPHROLOGY
Ufuk Caglar, Oguzhan Yildiz, M Fırat Ozervarli, Resat Aydin, Omer Sarilar, Faruk Ozgor, Mazhar Ortac
{"title":"Assessing the Performance of Chat Generative Pretrained Transformer (ChatGPT) in Answering Andrology-Related Questions.","authors":"Ufuk Caglar, Oguzhan Yildiz, M Fırat Ozervarli, Resat Aydin, Omer Sarilar, Faruk Ozgor, Mazhar Ortac","doi":"10.5152/tud.2023.23171","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The internet and social media have become primary sources of health information, with men frequently turning to these platforms before seeking professional help. Chat generative pretrained transformer (ChatGPT), an artificial intelligence model developed by OpenAI, has gained popularity as a natural language processing program. The present study evaluated the accuracy and reproducibility of ChatGPT's responses to andrology-related questions.</p><p><strong>Methods: </strong>The study analyzed frequently asked andrology questions from health forums, hospital websites, and social media platforms like YouTube and Instagram. Questions were categorized into topics like male hypogonadism, erectile dysfunction, etc. The European Association of Urology (EAU) guideline recommendations were also included. These questions were input into ChatGPT, and responses were evaluated by 3 experienced urologists who scored them on a scale of 1 to 4.</p><p><strong>Results: </strong>Out of 136 evaluated questions, 108 met the criteria. Of these, 87.9% received correct and adequate answers, 9.3% were correct but insufficient, and 3 responses contained both correct and incorrect information. No question was answered completely wrong. The highest correct answer rates were for disorders of ejaculation, penile curvature, and male hypogonadism. The EAU guideline-based questions achieved a correctness rate of 86.3%. The reproducibility of the answers was over 90%.</p><p><strong>Conclusion: </strong>The study found that ChatGPT provided accurate and reliable answers to over 80% of andrology-related questions. While limitations exist, such as potential outdated data and inability to understand emotional aspects, ChatGPT's potential in the health-care sector is promising. Collaborating with health-care professionals during artificial intelligence model development could enhance its reliability.</p>","PeriodicalId":101337,"journal":{"name":"Urology research & practice","volume":" ","pages":"365-369"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10765186/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Urology research & practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5152/tud.2023.23171","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: The internet and social media have become primary sources of health information, with men frequently turning to these platforms before seeking professional help. Chat generative pretrained transformer (ChatGPT), an artificial intelligence model developed by OpenAI, has gained popularity as a natural language processing program. The present study evaluated the accuracy and reproducibility of ChatGPT's responses to andrology-related questions.

Methods: The study analyzed frequently asked andrology questions from health forums, hospital websites, and social media platforms like YouTube and Instagram. Questions were categorized into topics like male hypogonadism, erectile dysfunction, etc. The European Association of Urology (EAU) guideline recommendations were also included. These questions were input into ChatGPT, and responses were evaluated by 3 experienced urologists who scored them on a scale of 1 to 4.

Results: Out of 136 evaluated questions, 108 met the criteria. Of these, 87.9% received correct and adequate answers, 9.3% were correct but insufficient, and 3 responses contained both correct and incorrect information. No question was answered completely wrong. The highest correct answer rates were for disorders of ejaculation, penile curvature, and male hypogonadism. The EAU guideline-based questions achieved a correctness rate of 86.3%. The reproducibility of the answers was over 90%.

Conclusion: The study found that ChatGPT provided accurate and reliable answers to over 80% of andrology-related questions. While limitations exist, such as potential outdated data and inability to understand emotional aspects, ChatGPT's potential in the health-care sector is promising. Collaborating with health-care professionals during artificial intelligence model development could enhance its reliability.

评估聊天生成预训练转换器(ChatGPT)在回答男科相关问题时的性能。
目的:互联网和社交媒体已成为健康信息的主要来源,男性在寻求专业帮助之前经常求助于这些平台。聊天生成预训练转换器(ChatGPT)是由OpenAI开发的一种人工智能模型,作为一种自然语言处理程序而广受欢迎。本研究评估了ChatGPT对男科相关问题的回答的准确性和再现性。方法:该研究分析了来自健康论坛、医院网站以及YouTube和Instagram等社交媒体平台的男科常见问题。问题分为男性性腺功能减退症、勃起功能障碍等主题。欧洲泌尿外科协会(EAU)的指南建议也包括在内。这些问题被输入到ChatGPT中,由3名经验丰富的泌尿科医生对回答进行评估,他们对这些问题进行了1-4分的评分。结果:在136个评估问题中,108个符合标准。其中,87.9%的回答正确且充分,9.3%的回答正确但不充分,3份回复同时包含正确和不正确的信息。没有一个问题的答案是完全错误的。正确答案率最高的是射精障碍、阴茎弯曲和男性性腺功能减退。基于EAU指南的问题的正确率为86.3%。答案的可重复性超过90%。结论:研究发现,ChatGPT为超过80%的男科相关问题提供了准确可靠的答案。尽管存在局限性,如潜在的过时数据和无法理解情绪方面,但ChatGPT在医疗保健领域的潜力是有希望的。在人工智能模型开发过程中与医疗保健专业人员合作可以提高其可靠性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信