Exploring the possibilities and limitations of customized large language model to support and improve cervical cancer screening.

IF 3.8 3区医学 Q2 MEDICAL INFORMATICS

BMC Medical Informatics and Decision Making Pub Date : 2025-07-01 DOI:10.1186/s12911-025-03088-3

Viola Angyal, Ádám Bertalan, Péter Domján, Elek Dinya

{"title":"Exploring the possibilities and limitations of customized large language model to support and improve cervical cancer screening.","authors":"Viola Angyal, Ádám Bertalan, Péter Domján, Elek Dinya","doi":"10.1186/s12911-025-03088-3","DOIUrl":null,"url":null,"abstract":"Background: The rapid advancement of artificial intelligence, driven by Generative Pre-trained Transformers (GPT), has transformed natural language processing. Prompt engineering plays a key role in guiding model outputs effectively. Our primary objective was to explore the possibilities and limitations of a custom GPT, developed via prompt engineering, as a patient education tool, which delivers publicly available information through a user-friendly design that facilitates more effective access to cervical cancer screening knowledge.Method: The system was developed using the OpenAI GPT-4 model and Python programming language, with the interface built on Streamlit for cloud-based accessibility and testing. It initially presented questions to testers for preliminary assessment. For cervical cancer-related information, we referenced medical guidelines. Iterative testing optimized the prompts for quality and relevance; techniques like context provision, question chaining, and prompt-based constraints were used. Human-in-the-loop and two independent medical doctor evaluations were employed. Additionally, system performance metrics were measured.Result: The web application was tested 115 times over a three-week period in 2024, with 87 female (76%) and 28 male (24%) participants. A total of 112 users completed the user experience questionnaire. Statistical analysis showed a significant association between age and perceived personalization (p = 0.047) and between gender and system customization (p = 0.037). Younger participants reported higher engagement, though not significantly. Females valued guidance on screening schedules and early detection, while males highlighted the usefulness of information regarding HPV vaccination and its role in preventing HPV-related cancers. Independent evaluations by medical doctors demonstrated consistent assessments of the system's responses in terms of accuracy, clarity, and usefulness.Discussion: While the system demonstrates potential to enhance public health awareness and promote preventive behaviors, encouraging individuals to seek information on cervical cancer screening and HPV vaccination, its conversational capabilities remain constrained by the inherent limitations of current language model technology.Conclusions: Although custom GPTs can not substitute a healthcare consultations, these tools can streamline workflows, expedite information access, and support personalized care. Further research should focus on conducting well-designed randomized controlled trials to establish definitive conclusions regarding its impact and reliability.Clinical trial number: Not applicable.","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"242"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220158/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03088-3","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The rapid advancement of artificial intelligence, driven by Generative Pre-trained Transformers (GPT), has transformed natural language processing. Prompt engineering plays a key role in guiding model outputs effectively. Our primary objective was to explore the possibilities and limitations of a custom GPT, developed via prompt engineering, as a patient education tool, which delivers publicly available information through a user-friendly design that facilitates more effective access to cervical cancer screening knowledge.

Method: The system was developed using the OpenAI GPT-4 model and Python programming language, with the interface built on Streamlit for cloud-based accessibility and testing. It initially presented questions to testers for preliminary assessment. For cervical cancer-related information, we referenced medical guidelines. Iterative testing optimized the prompts for quality and relevance; techniques like context provision, question chaining, and prompt-based constraints were used. Human-in-the-loop and two independent medical doctor evaluations were employed. Additionally, system performance metrics were measured.

Result: The web application was tested 115 times over a three-week period in 2024, with 87 female (76%) and 28 male (24%) participants. A total of 112 users completed the user experience questionnaire. Statistical analysis showed a significant association between age and perceived personalization (p = 0.047) and between gender and system customization (p = 0.037). Younger participants reported higher engagement, though not significantly. Females valued guidance on screening schedules and early detection, while males highlighted the usefulness of information regarding HPV vaccination and its role in preventing HPV-related cancers. Independent evaluations by medical doctors demonstrated consistent assessments of the system's responses in terms of accuracy, clarity, and usefulness.

Discussion: While the system demonstrates potential to enhance public health awareness and promote preventive behaviors, encouraging individuals to seek information on cervical cancer screening and HPV vaccination, its conversational capabilities remain constrained by the inherent limitations of current language model technology.

Conclusions: Although custom GPTs can not substitute a healthcare consultations, these tools can streamline workflows, expedite information access, and support personalized care. Further research should focus on conducting well-designed randomized controlled trials to establish definitive conclusions regarding its impact and reliability.

Clinical trial number: Not applicable.

查看原文本刊更多论文

探讨定制化大语言模型支持和改进宫颈癌筛查的可能性和局限性。

背景：在生成式预训练变形器（GPT）的推动下，人工智能的快速发展已经改变了自然语言处理。提示工程是有效引导模型输出的关键。我们的主要目标是探索通过快速工程开发的定制GPT作为患者教育工具的可能性和局限性，该工具通过用户友好的设计提供公开可用的信息，从而促进更有效地获取宫颈癌筛查知识。方法：采用OpenAI GPT-4模型和Python编程语言开发系统，基于Streamlit构建界面，进行基于云的访问和测试。它首先向测试人员提出问题进行初步评估。关于子宫颈癌的相关信息，我们参考了医学指南。迭代测试优化了提示的质量和相关性；使用了上下文提供、问题链和基于提示的约束等技术。采用了人在循环和两个独立的医生评价。此外，还测量了系统性能指标。结果：该网络应用程序在2024年的三周内测试了115次，参与者中有87名女性（76%）和28名男性（24%）。共有112名用户完成了用户体验问卷。统计分析显示，年龄与感知个性化（p = 0.047）、性别与系统定制（p = 0.037）之间存在显著相关性。年轻的参与者报告了更高的参与度，尽管不是很明显。女性重视关于筛查计划和早期发现的指导，而男性则强调有关HPV疫苗接种信息的有用性及其在预防HPV相关癌症中的作用。医生的独立评估证明了系统在准确性、清晰度和实用性方面的一致评价。讨论：虽然该系统显示出提高公众健康意识和促进预防行为的潜力，鼓励个人寻求宫颈癌筛查和HPV疫苗接种的信息，但其会话能力仍然受到当前语言模型技术固有局限性的限制。结论：尽管定制gpt不能替代医疗咨询，但这些工具可以简化工作流程，加快信息访问，并支持个性化护理。进一步的研究应侧重于进行设计良好的随机对照试验，以确定其影响和可靠性的明确结论。临床试验号：不适用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Informatics and Decision Making 医学-医学：信息

CiteScore

7.20

自引率

5.70%

发文量

297

审稿时长

1 months

期刊介绍： BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.