ChatGPT在回答肺癌及其手术患者问题中的准确性和可靠性：由胸外科医生专家小组评估。

IF 1.3 4区医学 Q3 EDUCATION, SCIENTIFIC DISCIPLINES

Journal of Cancer Education Pub Date : 2025-07-04 DOI:10.1007/s13187-025-02682-3

Onur Akçay, Özgür Öztürk, Tuba Acar, Soner Gürsoy

{"title":"ChatGPT在回答肺癌及其手术患者问题中的准确性和可靠性：由胸外科医生专家小组评估。","authors":"Onur Akçay, Özgür Öztürk, Tuba Acar, Soner Gürsoy","doi":"10.1007/s13187-025-02682-3","DOIUrl":null,"url":null,"abstract":"This study aimed to evaluate the accuracy, clarity, and scientific adequacy of ChatGPT's responses to frequently asked patient questions concerning lung cancer and its surgical treatment, through an expert panel of thoracic surgeons. A total of 36 frequently asked questions-20 related to lung cancer and 16 related to lung cancer surgery-were collected from various online sources and clinical experience. These questions were submitted to ChatGPT-4.0 in a single session, and the initial responses were assessed by four experienced thoracic surgeons. Each response was scored independently using a 5-point Likert scale for scientific adequacy, clarity, and accuracy. The mean scores, standard deviations, and word counts were calculated. Inter-group comparisons were conducted using independent-samples t-tests. ChatGPT's responses were rated generally high across all domains. For lung cancer questions, the mean scores were 4.50 ± 0.18 (scientific adequacy), 4.57 ± 0.21 (clarity), and 4.66 ± 0.21 (accuracy), with an average word count of 152.4 ± 36.86. For surgical questions, scores were slightly higher: 4.57 ± 0.31, 4.64 ± 0.26, and 4.73 ± 0.21, respectively, with an average word count of 163.68 ± 35.64. Although the differences were not statistically significant, responses to surgical questions were associated with slightly higher agreement scores. Full scores were achieved in three surgical questions. ChatGPT demonstrated a high degree of reliability and clarity in answering commonly asked patient questions about lung cancer and surgery. While the model can serve as a supportive educational tool, it should not replace personalized physician-patient communication, especially in clinical decision-making processes.","PeriodicalId":50246,"journal":{"name":"Journal of Cancer Education","volume":" ","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Accuracy and Reliability of ChatGPT in Answering Patient Questions About Lung Cancer and Its Surgery: An Expert Panel Evaluation by Thoracic Surgeons.\",\"authors\":\"Onur Akçay, Özgür Öztürk, Tuba Acar, Soner Gürsoy\",\"doi\":\"10.1007/s13187-025-02682-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study aimed to evaluate the accuracy, clarity, and scientific adequacy of ChatGPT's responses to frequently asked patient questions concerning lung cancer and its surgical treatment, through an expert panel of thoracic surgeons. A total of 36 frequently asked questions-20 related to lung cancer and 16 related to lung cancer surgery-were collected from various online sources and clinical experience. These questions were submitted to ChatGPT-4.0 in a single session, and the initial responses were assessed by four experienced thoracic surgeons. Each response was scored independently using a 5-point Likert scale for scientific adequacy, clarity, and accuracy. The mean scores, standard deviations, and word counts were calculated. Inter-group comparisons were conducted using independent-samples t-tests. ChatGPT's responses were rated generally high across all domains. For lung cancer questions, the mean scores were 4.50 ± 0.18 (scientific adequacy), 4.57 ± 0.21 (clarity), and 4.66 ± 0.21 (accuracy), with an average word count of 152.4 ± 36.86. For surgical questions, scores were slightly higher: 4.57 ± 0.31, 4.64 ± 0.26, and 4.73 ± 0.21, respectively, with an average word count of 163.68 ± 35.64. Although the differences were not statistically significant, responses to surgical questions were associated with slightly higher agreement scores. Full scores were achieved in three surgical questions. ChatGPT demonstrated a high degree of reliability and clarity in answering commonly asked patient questions about lung cancer and surgery. While the model can serve as a supportive educational tool, it should not replace personalized physician-patient communication, especially in clinical decision-making processes.\",\"PeriodicalId\":50246,\"journal\":{\"name\":\"Journal of Cancer Education\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cancer Education\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s13187-025-02682-3\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"EDUCATION, SCIENTIFIC DISCIPLINES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cancer Education","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13187-025-02682-3","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"EDUCATION, SCIENTIFIC DISCIPLINES","Score":null,"Total":0}

引用次数: 0

摘要

本研究旨在通过胸外科专家小组评估ChatGPT对肺癌及其手术治疗的常见问题的回答的准确性、清晰度和科学充分性。共收集了36个常见问题，其中20个与肺癌有关，16个与肺癌手术有关，这些问题来自各种在线资源和临床经验。这些问题在一次会话中提交给ChatGPT-4.0，并由四位经验丰富的胸外科医生评估初步回答。每个回答都使用5分李克特量表对科学充分性、清晰度和准确性进行独立评分。计算平均得分、标准差和字数。组间比较采用独立样本t检验。ChatGPT的回答在所有领域都得到了很高的评价。肺癌问题的平均得分为4.50±0.18（科学充分性）、4.57±0.21（清晰度）和4.66±0.21（准确性），平均字数为152.4±36.86。对于外科问题，得分略高，分别为4.57±0.31,4.64±0.26和4.73±0.21，平均字数为163.68±35.64。虽然差异在统计学上不显著，但对手术问题的回答与稍高的一致性得分相关。在三个外科问题中获得满分。ChatGPT在回答患者关于肺癌和手术的常见问题时表现出高度的可靠性和清晰度。虽然该模型可以作为一种支持性教育工具，但它不应取代个性化的医患沟通，尤其是在临床决策过程中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Accuracy and Reliability of ChatGPT in Answering Patient Questions About Lung Cancer and Its Surgery: An Expert Panel Evaluation by Thoracic Surgeons.

This study aimed to evaluate the accuracy, clarity, and scientific adequacy of ChatGPT's responses to frequently asked patient questions concerning lung cancer and its surgical treatment, through an expert panel of thoracic surgeons. A total of 36 frequently asked questions-20 related to lung cancer and 16 related to lung cancer surgery-were collected from various online sources and clinical experience. These questions were submitted to ChatGPT-4.0 in a single session, and the initial responses were assessed by four experienced thoracic surgeons. Each response was scored independently using a 5-point Likert scale for scientific adequacy, clarity, and accuracy. The mean scores, standard deviations, and word counts were calculated. Inter-group comparisons were conducted using independent-samples t-tests. ChatGPT's responses were rated generally high across all domains. For lung cancer questions, the mean scores were 4.50 ± 0.18 (scientific adequacy), 4.57 ± 0.21 (clarity), and 4.66 ± 0.21 (accuracy), with an average word count of 152.4 ± 36.86. For surgical questions, scores were slightly higher: 4.57 ± 0.31, 4.64 ± 0.26, and 4.73 ± 0.21, respectively, with an average word count of 163.68 ± 35.64. Although the differences were not statistically significant, responses to surgical questions were associated with slightly higher agreement scores. Full scores were achieved in three surgical questions. ChatGPT demonstrated a high degree of reliability and clarity in answering commonly asked patient questions about lung cancer and surgery. While the model can serve as a supportive educational tool, it should not replace personalized physician-patient communication, especially in clinical decision-making processes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Cancer Education 医学-医学：信息

CiteScore

3.40

自引率

6.20%

发文量

122

审稿时长

4-8 weeks

期刊介绍： The Journal of Cancer Education, the official journal of the American Association for Cancer Education (AACE) and the European Association for Cancer Education (EACE), is an international, quarterly journal dedicated to the publication of original contributions dealing with the varied aspects of cancer education for physicians, dentists, nurses, students, social workers and other allied health professionals, patients, the general public, and anyone interested in effective education about cancer related issues. Articles featured include reports of original results of educational research, as well as discussions of current problems and techniques in cancer education. Manuscripts are welcome on such subjects as educational methods, instruments, and program evaluation. Suitable topics include teaching of basic science aspects of cancer; the assessment of attitudes toward cancer patient management; the teaching of diagnostic skills relevant to cancer; the evaluation of undergraduate, postgraduate, or continuing education programs; and articles about all aspects of cancer education from prevention to palliative care. We encourage contributions to a special column called Reflections; these articles should relate to the human aspects of dealing with cancer, cancer patients, and their families and finding meaning and support in these efforts. Letters to the Editor (600 words or less) dealing with published articles or matters of current interest are also invited. Also featured are commentary; book and media reviews; and announcements of educational programs, fellowships, and grants. Articles should be limited to no more than ten double-spaced typed pages, and there should be no more than three tables or figures and 25 references. We also encourage brief reports of five typewritten pages or less, with no more than one figure or table and 15 references.