Mini Han Wang, Xudong Jiang, Peijin Zeng, Xinyue Li, Kelvin Kam-Lung Chong, Guanghui Hou, Xiaoxiao Fang, Yang Yu, Xiangrong Yu, Junbin Fang, Yi Pan
{"title":"平衡准确性和用户满意度:即时工程在人工智能驱动的医疗保健解决方案中的作用。","authors":"Mini Han Wang, Xudong Jiang, Peijin Zeng, Xinyue Li, Kelvin Kam-Lung Chong, Guanghui Hou, Xiaoxiao Fang, Yang Yu, Xiangrong Yu, Junbin Fang, Yi Pan","doi":"10.3389/frai.2025.1517918","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>The rapid evolution of the Internet of Things (IoT) and Artificial Intelligence (AI) has opened new possibilities for public healthcare. Effective integration of these technologies is essential to ensure precise and efficient healthcare delivery. This study explores the application of IoT-enabled, AI-driven systems for detecting and managing Dry Eye Disease (DED), emphasizing the use of prompt engineering to enhance system performance.</p><p><strong>Methods: </strong>A specialized prompt mechanism was developed utilizing OpenAI GPT-4.0 and ERNIE Bot-4.0 APIs to assess the urgency of medical attention based on 5,747 simulated patient complaints. A Bidirectional Encoder Representations from Transformers (BERT) machine learning model was employed for text classification to differentiate urgent from non-urgent cases. User satisfaction was evaluated through composite scores derived from Service Experiences (SE) and Medical Quality (MQ) assessments.</p><p><strong>Results: </strong>The comparison between prompted and non-prompted queries revealed a significant accuracy increase from 80.1% to 99.6%. However, this improvement was accompanied by a notable rise in response time, resulting in a decrease in SE scores (95.5 to 84.7) but a substantial increase in MQ satisfaction (73.4 to 96.7). These findings indicate a trade-off between accuracy and user satisfaction.</p><p><strong>Discussion: </strong>The study highlights the critical role of prompt engineering in improving AI-based healthcare services. While enhanced accuracy is achievable, careful attention must be given to balancing response time and user satisfaction. Future research should optimize prompt structures, explore dynamic prompting approaches, and prioritize real-time evaluations to address the identified challenges and maximize the potential of IoT-integrated AI systems in medical applications.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1517918"},"PeriodicalIF":3.0000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11865202/pdf/","citationCount":"0","resultStr":"{\"title\":\"Balancing accuracy and user satisfaction: the role of prompt engineering in AI-driven healthcare solutions.\",\"authors\":\"Mini Han Wang, Xudong Jiang, Peijin Zeng, Xinyue Li, Kelvin Kam-Lung Chong, Guanghui Hou, Xiaoxiao Fang, Yang Yu, Xiangrong Yu, Junbin Fang, Yi Pan\",\"doi\":\"10.3389/frai.2025.1517918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>The rapid evolution of the Internet of Things (IoT) and Artificial Intelligence (AI) has opened new possibilities for public healthcare. Effective integration of these technologies is essential to ensure precise and efficient healthcare delivery. This study explores the application of IoT-enabled, AI-driven systems for detecting and managing Dry Eye Disease (DED), emphasizing the use of prompt engineering to enhance system performance.</p><p><strong>Methods: </strong>A specialized prompt mechanism was developed utilizing OpenAI GPT-4.0 and ERNIE Bot-4.0 APIs to assess the urgency of medical attention based on 5,747 simulated patient complaints. A Bidirectional Encoder Representations from Transformers (BERT) machine learning model was employed for text classification to differentiate urgent from non-urgent cases. User satisfaction was evaluated through composite scores derived from Service Experiences (SE) and Medical Quality (MQ) assessments.</p><p><strong>Results: </strong>The comparison between prompted and non-prompted queries revealed a significant accuracy increase from 80.1% to 99.6%. However, this improvement was accompanied by a notable rise in response time, resulting in a decrease in SE scores (95.5 to 84.7) but a substantial increase in MQ satisfaction (73.4 to 96.7). These findings indicate a trade-off between accuracy and user satisfaction.</p><p><strong>Discussion: </strong>The study highlights the critical role of prompt engineering in improving AI-based healthcare services. While enhanced accuracy is achievable, careful attention must be given to balancing response time and user satisfaction. Future research should optimize prompt structures, explore dynamic prompting approaches, and prioritize real-time evaluations to address the identified challenges and maximize the potential of IoT-integrated AI systems in medical applications.</p>\",\"PeriodicalId\":33315,\"journal\":{\"name\":\"Frontiers in Artificial Intelligence\",\"volume\":\"8 \",\"pages\":\"1517918\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-02-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11865202/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/frai.2025.1517918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frai.2025.1517918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
导读:物联网(IoT)和人工智能(AI)的快速发展为公共医疗保健开辟了新的可能性。这些技术的有效集成对于确保精确和高效的医疗保健服务至关重要。本研究探讨了基于物联网、人工智能驱动的干眼病(DED)检测和管理系统的应用,强调使用即时工程来提高系统性能。方法:利用OpenAI GPT-4.0和ERNIE Bot-4.0 api,基于5,747例模拟患者投诉,开发专门的提示机制,对医疗服务的紧急程度进行评估。采用BERT (Bidirectional Encoder Representations from Transformers)机器学习模型进行文本分类,区分紧急和非紧急情况。通过服务体验(SE)和医疗质量(MQ)评估得出的综合分数来评估用户满意度。结果:提示查询和非提示查询的比较显示准确率从80.1%显著提高到99.6%。然而,这种改进伴随着响应时间的显着增加,导致SE分数下降(95.5到84.7),但MQ满意度大幅增加(73.4到96.7)。这些发现表明了准确性和用户满意度之间的权衡。讨论:该研究强调了即时工程在改善基于人工智能的医疗保健服务中的关键作用。虽然提高准确性是可以实现的,但必须仔细注意平衡响应时间和用户满意度。未来的研究应优化提示结构,探索动态提示方法,并优先考虑实时评估,以解决已识别的挑战,并最大限度地发挥物联网集成人工智能系统在医疗应用中的潜力。
Balancing accuracy and user satisfaction: the role of prompt engineering in AI-driven healthcare solutions.
Introduction: The rapid evolution of the Internet of Things (IoT) and Artificial Intelligence (AI) has opened new possibilities for public healthcare. Effective integration of these technologies is essential to ensure precise and efficient healthcare delivery. This study explores the application of IoT-enabled, AI-driven systems for detecting and managing Dry Eye Disease (DED), emphasizing the use of prompt engineering to enhance system performance.
Methods: A specialized prompt mechanism was developed utilizing OpenAI GPT-4.0 and ERNIE Bot-4.0 APIs to assess the urgency of medical attention based on 5,747 simulated patient complaints. A Bidirectional Encoder Representations from Transformers (BERT) machine learning model was employed for text classification to differentiate urgent from non-urgent cases. User satisfaction was evaluated through composite scores derived from Service Experiences (SE) and Medical Quality (MQ) assessments.
Results: The comparison between prompted and non-prompted queries revealed a significant accuracy increase from 80.1% to 99.6%. However, this improvement was accompanied by a notable rise in response time, resulting in a decrease in SE scores (95.5 to 84.7) but a substantial increase in MQ satisfaction (73.4 to 96.7). These findings indicate a trade-off between accuracy and user satisfaction.
Discussion: The study highlights the critical role of prompt engineering in improving AI-based healthcare services. While enhanced accuracy is achievable, careful attention must be given to balancing response time and user satisfaction. Future research should optimize prompt structures, explore dynamic prompting approaches, and prioritize real-time evaluations to address the identified challenges and maximize the potential of IoT-integrated AI systems in medical applications.