Erin E. Maruska DMD, MPH , Amira Elgreatly BDS, MS , William Madaio DMD , Klud Razoky BDS, NZDREX , Curt Bay PhD , Ahmed Mahrous BDS, MS
{"title":"比较牙医和聊天机器人回答牙科问题的质量和同理心","authors":"Erin E. Maruska DMD, MPH , Amira Elgreatly BDS, MS , William Madaio DMD , Klud Razoky BDS, NZDREX , Curt Bay PhD , Ahmed Mahrous BDS, MS","doi":"10.1016/j.jfscie.2025.100044","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Integration of large language models (LLMs) into health care, particularly in patient communication, is a growing trend. This study evaluated the effectiveness of LLM chatbots in addressing dental patient queries compared with responses from human dentists on a public online forum.</div></div><div><h3>Methods</h3><div>In January 2024, 20 patient questions and responses were randomly sampled from Reddit’s dental advice community. We assessed the quality and empathy of ChatGPT-generated responses (Version GPT-3.5, OpenAI) by 9 blinded dentists. The dentists were selected from a dental faculty pool familiar with reading and assessing written communication. The evaluators rated the information quality of the responses on a Likert scale (very poor, 1; poor, 2; acceptable, 3; good, 4; very good, 5) and empathy (not empathetic, 1; slightly empathetic, 2; moderately empathetic, 3; empathetic, 4; very empathetic, 5). Subsequently, they selected the best response (dentist or artificial intelligence). Nine blinded dentists rated 20 responses to the online inquiries, providing 180 potential responses.</div></div><div><h3>Results</h3><div>The results indicated that the LLM chatbots’ responses were rated as higher quality and exhibited higher levels of empathy than human responses. Among 179 responses (1 was missing) to the question about whether the response was better from ChatGPT or the dentist, 167 (93.3%) responses indicated ChatGPT and 12 (6.7%) indicated dentist (<em>P</em> < .001).</div></div><div><h3>Conclusions</h3><div>Although subjective variations in assessing quality and empathy may exist, this study suggests that LLM chatbot responses show higher quality and empathy than online dentist responses. The use of LLM chatbots by dentists can enhance patient communication in dental practice owing to their efficiency, empathy, and quality. Further research is needed to determine the full potential of artificial intelligence in dentistry.</div></div>","PeriodicalId":73530,"journal":{"name":"JADA foundational science","volume":"4 ","pages":"Article 100044"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparing dentist and chatbot answers to dental questions for quality and empathy\",\"authors\":\"Erin E. Maruska DMD, MPH , Amira Elgreatly BDS, MS , William Madaio DMD , Klud Razoky BDS, NZDREX , Curt Bay PhD , Ahmed Mahrous BDS, MS\",\"doi\":\"10.1016/j.jfscie.2025.100044\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Integration of large language models (LLMs) into health care, particularly in patient communication, is a growing trend. This study evaluated the effectiveness of LLM chatbots in addressing dental patient queries compared with responses from human dentists on a public online forum.</div></div><div><h3>Methods</h3><div>In January 2024, 20 patient questions and responses were randomly sampled from Reddit’s dental advice community. We assessed the quality and empathy of ChatGPT-generated responses (Version GPT-3.5, OpenAI) by 9 blinded dentists. The dentists were selected from a dental faculty pool familiar with reading and assessing written communication. The evaluators rated the information quality of the responses on a Likert scale (very poor, 1; poor, 2; acceptable, 3; good, 4; very good, 5) and empathy (not empathetic, 1; slightly empathetic, 2; moderately empathetic, 3; empathetic, 4; very empathetic, 5). Subsequently, they selected the best response (dentist or artificial intelligence). Nine blinded dentists rated 20 responses to the online inquiries, providing 180 potential responses.</div></div><div><h3>Results</h3><div>The results indicated that the LLM chatbots’ responses were rated as higher quality and exhibited higher levels of empathy than human responses. Among 179 responses (1 was missing) to the question about whether the response was better from ChatGPT or the dentist, 167 (93.3%) responses indicated ChatGPT and 12 (6.7%) indicated dentist (<em>P</em> < .001).</div></div><div><h3>Conclusions</h3><div>Although subjective variations in assessing quality and empathy may exist, this study suggests that LLM chatbot responses show higher quality and empathy than online dentist responses. The use of LLM chatbots by dentists can enhance patient communication in dental practice owing to their efficiency, empathy, and quality. Further research is needed to determine the full potential of artificial intelligence in dentistry.</div></div>\",\"PeriodicalId\":73530,\"journal\":{\"name\":\"JADA foundational science\",\"volume\":\"4 \",\"pages\":\"Article 100044\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JADA foundational science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772414X25000015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JADA foundational science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772414X25000015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparing dentist and chatbot answers to dental questions for quality and empathy
Background
Integration of large language models (LLMs) into health care, particularly in patient communication, is a growing trend. This study evaluated the effectiveness of LLM chatbots in addressing dental patient queries compared with responses from human dentists on a public online forum.
Methods
In January 2024, 20 patient questions and responses were randomly sampled from Reddit’s dental advice community. We assessed the quality and empathy of ChatGPT-generated responses (Version GPT-3.5, OpenAI) by 9 blinded dentists. The dentists were selected from a dental faculty pool familiar with reading and assessing written communication. The evaluators rated the information quality of the responses on a Likert scale (very poor, 1; poor, 2; acceptable, 3; good, 4; very good, 5) and empathy (not empathetic, 1; slightly empathetic, 2; moderately empathetic, 3; empathetic, 4; very empathetic, 5). Subsequently, they selected the best response (dentist or artificial intelligence). Nine blinded dentists rated 20 responses to the online inquiries, providing 180 potential responses.
Results
The results indicated that the LLM chatbots’ responses were rated as higher quality and exhibited higher levels of empathy than human responses. Among 179 responses (1 was missing) to the question about whether the response was better from ChatGPT or the dentist, 167 (93.3%) responses indicated ChatGPT and 12 (6.7%) indicated dentist (P < .001).
Conclusions
Although subjective variations in assessing quality and empathy may exist, this study suggests that LLM chatbot responses show higher quality and empathy than online dentist responses. The use of LLM chatbots by dentists can enhance patient communication in dental practice owing to their efficiency, empathy, and quality. Further research is needed to determine the full potential of artificial intelligence in dentistry.