Can ChatGPT be trusted? Evaluating AI responses to oral health questions among pregnant Arabic-speaking women.

IF 3.1 2区医学 Q1 DENTISTRY, ORAL SURGERY & MEDICINE

BMC Oral Health Pub Date : 2025-10-10 DOI:10.1186/s12903-025-06909-z

Khalid Talal Aboalshamat, Jomanh Humied Alnafei, Lojain Ahmed Alkhattabi, Ghadi Yaqoub Alhawsawi, Shrooq Majed Alahmadi, Shatha Omar Almalki, Afnan Anas Nassar

{"title":"Can ChatGPT be trusted? Evaluating AI responses to oral health questions among pregnant Arabic-speaking women.","authors":"Khalid Talal Aboalshamat, Jomanh Humied Alnafei, Lojain Ahmed Alkhattabi, Ghadi Yaqoub Alhawsawi, Shrooq Majed Alahmadi, Shatha Omar Almalki, Afnan Anas Nassar","doi":"10.1186/s12903-025-06909-z","DOIUrl":null,"url":null,"abstract":"Background: ChatGPT, an artificial intelligence (AI) chatbot developed by OpenAI, is increasingly being used in healthcare, including dentistry, for patient education; this study aimed to assess the usability and quality of ChatGPT's responses to pregnancy-related oral health queries in Saudi Arabia.Method: This two-part cross-sectional study assessed pregnant Arabic women's perceptions of ChatGPT for oral health queries and evaluated its responses using an online questionnaire. Responses from ChatGPT-4o mini were rated by 5 dental experts with regard to accuracy, clarity, relevance, and acceptance using a 5-point Likert scale.Results: Among the 300 participants, 42.0% (126) knew about ChatGPT, 33.7% (101) had previously used it, 14.3% (43) had used it to obtain medical information, 8.7% (26) had used it for dental information, and 8.3% (25) had used it for dental information during pregnancy. Attitudes regarding ChatGPT were rated from 1 to 4. Except for 1 item, the means were all above the midpoint. Attitude ratings ranged from a mean of 2.71 (SD 0.76) for ChatGPT competency to a mean of 2.34 (SD 0.92) for its ability to replace human interactions. However, ChatGPT competency (P = .028), security (P = .015), willingness to use ChatGPT for inquiries (P = .021), ability to assist in informed decision-making (P = .01), willingness to make decisions based on recommendations (P = .024), and persuasiveness (P = .049) were significantly different based on educational level. Pregnant women with higher levels of education rated these aspects significantly lower than those with a high school diploma or bachelor's degree.Conclusion: ChatGPT provided useful oral health information for pregnant individuals, but its responses required revision and supervision by health professionals. Its usage among pregnant women in Saudi Arabia remained low.","PeriodicalId":9072,"journal":{"name":"BMC Oral Health","volume":"25 1","pages":"1597"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12513007/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Oral Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12903-025-06909-z","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}

引用次数: 0

Abstract

Background: ChatGPT, an artificial intelligence (AI) chatbot developed by OpenAI, is increasingly being used in healthcare, including dentistry, for patient education; this study aimed to assess the usability and quality of ChatGPT's responses to pregnancy-related oral health queries in Saudi Arabia.

Method: This two-part cross-sectional study assessed pregnant Arabic women's perceptions of ChatGPT for oral health queries and evaluated its responses using an online questionnaire. Responses from ChatGPT-4o mini were rated by 5 dental experts with regard to accuracy, clarity, relevance, and acceptance using a 5-point Likert scale.

Results: Among the 300 participants, 42.0% (126) knew about ChatGPT, 33.7% (101) had previously used it, 14.3% (43) had used it to obtain medical information, 8.7% (26) had used it for dental information, and 8.3% (25) had used it for dental information during pregnancy. Attitudes regarding ChatGPT were rated from 1 to 4. Except for 1 item, the means were all above the midpoint. Attitude ratings ranged from a mean of 2.71 (SD 0.76) for ChatGPT competency to a mean of 2.34 (SD 0.92) for its ability to replace human interactions. However, ChatGPT competency (P = .028), security (P = .015), willingness to use ChatGPT for inquiries (P = .021), ability to assist in informed decision-making (P = .01), willingness to make decisions based on recommendations (P = .024), and persuasiveness (P = .049) were significantly different based on educational level. Pregnant women with higher levels of education rated these aspects significantly lower than those with a high school diploma or bachelor's degree.

Conclusion: ChatGPT provided useful oral health information for pregnant individuals, but its responses required revision and supervision by health professionals. Its usage among pregnant women in Saudi Arabia remained low.

查看原文本刊更多论文

ChatGPT可信吗？评估人工智能对怀孕阿拉伯语妇女口腔健康问题的反应。

背景：ChatGPT是OpenAI开发的人工智能聊天机器人，越来越多地用于医疗保健，包括牙科，用于患者教育；本研究旨在评估ChatGPT在沙特阿拉伯对妊娠相关口腔健康问题的反应的可用性和质量。方法：这项分为两部分的横断面研究评估了阿拉伯孕妇对ChatGPT口腔健康问题的看法，并使用在线问卷评估了其反应。chatgpt - 40mini的回答由5位牙科专家根据准确性、清晰度、相关性和接受度使用5分李克特量表进行评分。结果：300名参与者中，42.0%（126人）知道ChatGPT， 33.7%（101人）曾使用过，14.3%（43人）曾使用ChatGPT获取医疗信息，8.7%（26人）曾使用ChatGPT获取牙科信息，8.3%（25人）曾在孕期使用ChatGPT获取牙科信息。对ChatGPT的态度评分从1到4。除1项外，平均值均高于中点。态度评分从ChatGPT能力的平均2.71（标准差0.76）到其取代人类互动能力的平均2.34（标准差0.92）不等。然而，ChatGPT能力(P =。028)，安全性(P =。015)，使用ChatGPT进行查询的意愿(P =。021)，协助知情决策的能力(P =。01)，根据建议做出决策的意愿(P =。024)、说服力（P = .049）在教育程度上存在显著差异。受教育程度较高的孕妇对这些方面的评分明显低于拥有高中文凭或学士学位的孕妇。结论：ChatGPT为孕妇提供了有用的口腔健康信息，但其反馈需要卫生专业人员的修改和监督。它在沙特阿拉伯孕妇中的使用率仍然很低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Oral Health DENTISTRY, ORAL SURGERY & MEDICINE-

CiteScore

3.90

自引率

6.90%

发文量

481

审稿时长

6-12 weeks

期刊介绍： BMC Oral Health is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of disorders of the mouth, teeth and gums, as well as related molecular genetics, pathophysiology, and epidemiology.