Healthcare Informatics Research最新文献

筛选
英文 中文
Large Language Models for Pre-mediation Counseling in Medical Disputes: A Comparative Evaluation against Human Experts. 医疗纠纷调解前咨询的大型语言模型:与人类专家的比较评价。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.200
Min Seo Kim, Jung Su Lee, Hyuna Bae
{"title":"Large Language Models for Pre-mediation Counseling in Medical Disputes: A Comparative Evaluation against Human Experts.","authors":"Min Seo Kim, Jung Su Lee, Hyuna Bae","doi":"10.4258/hir.2025.31.2.200","DOIUrl":"10.4258/hir.2025.31.2.200","url":null,"abstract":"<p><strong>Objectives: </strong>Assessing medical disputes requires both medical and legal expertise, presenting challenges for patients seeking clarity regarding potential malpractice claims. This study aimed to develop and evaluate a chatbot based on a chain-of-thought pipeline using a large language model (LLM) for providing medical dispute counseling and compare its performance with responses from human experts.</p><p><strong>Methods: </strong>Retrospective counseling cases (n = 279) were collected from the Korea Medical Dispute Mediation and Arbitration Agency's website, from which 50 cases were randomly selected as a validation dataset. The Claude 3.5 Sonnet model processed each counseling request through a five-step chain-of-thought pipeline. Thirty-eight experts evaluated the chatbot's responses against the original human expert responses, rating them across four dimensions on a 5-point Likert scale. Statistical analyses were conducted using Wilcoxon signed-rank tests.</p><p><strong>Results: </strong>The chatbot significantly outperformed human experts in quality of information (p < 0.001), understanding and reasoning (p < 0.001), and overall satisfaction (p < 0.001). It also demonstrated a stronger tendency to produce opinion-driven content (p < 0.001). Despite generally high scores, evaluators noted specific instances where the chatbot encountered difficulties.</p><p><strong>Conclusions: </strong>A chain-of-thought-based LLM chatbot shows promise for enhancing the quality of medical dispute counseling, outperforming human experts across key evaluation metrics. Future research should address inaccuracies resulting from legal and contextual variability, investigate patient acceptance, and further refine the chatbot's performance in domain-specific applications.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"200-208"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086436/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Agent Approach for Sepsis Management. 脓毒症管理的多药物方法。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.209
Victor Iapascurta, Ion Fiodorov, Adrian Belii, Viorel Bostan
{"title":"Multi-Agent Approach for Sepsis Management.","authors":"Victor Iapascurta, Ion Fiodorov, Adrian Belii, Viorel Bostan","doi":"10.4258/hir.2025.31.2.209","DOIUrl":"10.4258/hir.2025.31.2.209","url":null,"abstract":"<p><strong>Objectives: </strong>The high incidence of sepsis necessitates the development of practical decision-making tools for intensivists, especially during the early, critical phases of management. This study evaluates a multi-agent system intended to assist clinicians with antibiotic therapy and adherence to current sepsis management guidelines before diagnostic results become available.</p><p><strong>Methods: </strong>A multi-agent system incorporating three specialized agents was developed: a sepsis management agent, an antibiotic recommendation agent, and a sepsis guidelines compliance agent. A sepsis case from the MIMIC IV database, organized as a clinical vignette, was used to integrate and test these agents for generating management recommendations. The system leverages retrieval-augmented generation to improve decision-making through the integration of current literature and guidelines.</p><p><strong>Results: </strong>The application produced management recommendations for a sepsis case associated with pneumonia, including early initiation of broad-spectrum antibiotics and close monitoring for clinical deterioration. Two expert intensivists evaluated these recommendations as \"acceptable\" and reported moderate interrater agreement (Cohen's kappa = 0.622, p = 0.003) across various aspects of recommendation usefulness.</p><p><strong>Conclusions: </strong>The multi-agent system shows promise in enhancing decision-making for sepsis management by optimizing antibiotic therapy and ensuring guideline compliance. However, reliance on a single case study limits the generalizability of the findings, highlighting the need for broader validation in diverse clinical settings to improve patient outcomes.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"209-214"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086443/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Symptom and Sentiment Analysis of Older People with Cancer and Caregivers: A Text Mining Approach Using Korean Social Media Data. 老年癌症患者和护理者的症状和情绪分析:使用韩国社交媒体数据的文本挖掘方法。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.175
Kyunghwa Lee, Soomin Hong
{"title":"Symptom and Sentiment Analysis of Older People with Cancer and Caregivers: A Text Mining Approach Using Korean Social Media Data.","authors":"Kyunghwa Lee, Soomin Hong","doi":"10.4258/hir.2025.31.2.175","DOIUrl":"10.4258/hir.2025.31.2.175","url":null,"abstract":"<p><strong>Objectives: </strong>This study examined the symptoms and emotions expressed by older adults with cancer and their caregivers in South Korean online cancer communities. It aimed to identify narrative patterns and provide insights to inform personalized care strategies.</p><p><strong>Methods: </strong>We analyzed 6,908 user-generated posts collected from major online cancer communities in South Korea. Keyword frequency analysis, term frequency-inverse document frequency, 2-gram analysis, and latent Dirichlet allocation-based topic modeling were applied to explore language patterns. Sentiment analysis identified 12 emotional categories, and Pearson correlation coefficients were calculated to examine associations between symptoms and emotional expressions. All data were cleaned and standardized prior to analysis.</p><p><strong>Results: </strong>Many users expressed anxiety (20.63%) and depression (19.59%), frequently associated with chemotherapy and sleep disturbances. Among reported symptoms, sleep problems carried the highest negative sentiment (79.81%), underscoring their profound impact on well-being. Topic modeling consistently revealed seven recurring themes, including treatment decision-making, symptom management, and concerns about family, demonstrating the layered and personalized experiences of older cancer patients and their caregivers.</p><p><strong>Conclusions: </strong>This study explored treatment-related and symptom-related difficulties faced by older adults with cancer. Many reported significant emotional strain, especially anxiety, depression, and sleep disturbances. These findings highlight the necessity for supportive strategies addressing both psychological and physical aspects of care. Future research could investigate the utility of large language models in analyzing these narratives, provided the data is ethically managed and appropriate for such use.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"175-188"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086434/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144092920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Consensus on the Potential of Large Language Models in Healthcare: Insights from a Delphi Survey in Korea. 关于医疗保健中大型语言模型潜力的共识:来自韩国德尔菲调查的见解。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.146
Ah-Ram Sul, Seihee Kim
{"title":"Consensus on the Potential of Large Language Models in Healthcare: Insights from a Delphi Survey in Korea.","authors":"Ah-Ram Sul, Seihee Kim","doi":"10.4258/hir.2025.31.2.146","DOIUrl":"10.4258/hir.2025.31.2.146","url":null,"abstract":"<p><strong>Objectives: </strong>Given the rapidly growing expectations for large language models (LLMs) in healthcare, this study systematically collected perspectives from Korean experts on the potential benefits and risks of LLMs, aiming to promote their safe and effective utilization.</p><p><strong>Methods: </strong>A web-based mini-Delphi survey was conducted from August 27 to October 14, 2024, with 20 selected panelists. The expert questionnaire comprised 84 judgment items across five domains: potential applications, benefits, risks, reliability requirements, and safe usage. These items were developed through a literature review and expert consultation. Participants rated their agreement or perceived importance on a 5-point scale. Items meeting predefined thresholds (content validity ratio ≥0.49, degree of convergence ≤0.50, and degree of consensus ≥0.75) were prioritized.</p><p><strong>Results: </strong>Seventeen participants (85%) responded to the first round, and 16 participants (80%) completed the second round. Consensus was achieved on several potential applications, benefits, and reliability requirements for the use of LLMs in healthcare. However, significant heterogeneity was found regarding perceptions of associated risks and criteria for safe usage of LLMs. Of the 84 total items, 52 met the criteria for statistical validity, confirming the diversity of expert opinions.</p><p><strong>Conclusions: </strong>Experts reached a consensus on certain aspects of LLM utilization in healthcare. Nonetheless, notable differences remained concerning risks and requirements for safe implementation, highlighting the need for further investigation. This study provides foundational insights to guide future research and inform policy development for the responsible introduction of LLMs into the healthcare field.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"146-155"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086437/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative AI-Based Nursing Diagnosis and Documentation Recommendation Using Virtual Patient Electronic Nursing Record Data. 基于生成人工智能的护理诊断和基于虚拟患者电子护理记录数据的文献推荐。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.156
Hongshin Ju, Minsul Park, Hyeonsil Jeong, Youngjin Lee, Hyeoneui Kim, Mihyeon Seong, Dongkyun Lee
{"title":"Generative AI-Based Nursing Diagnosis and Documentation Recommendation Using Virtual Patient Electronic Nursing Record Data.","authors":"Hongshin Ju, Minsul Park, Hyeonsil Jeong, Youngjin Lee, Hyeoneui Kim, Mihyeon Seong, Dongkyun Lee","doi":"10.4258/hir.2025.31.2.156","DOIUrl":"10.4258/hir.2025.31.2.156","url":null,"abstract":"<p><strong>Objectives: </strong>Nursing documentation consumes approximately 30% of nurses' professional time, making improvements in efficiency essential for patient safety and workflow optimization. This study compares traditional nursing documentation methods with a generative artificial intelligence (AI)-based system, evaluating its effectiveness in reducing documentation time and ensuring the accuracy of AI-suggested entries. Furthermore, the study aims to assess the system's impact on overall documentation efficiency and quality.</p><p><strong>Methods: </strong>Forty nurses with a minimum of 6 months of clinical experience participated. In the pre-assessment phase, they documented a nursing scenario using traditional electronic nursing records (ENRs). In the post-assessment phase, they used the SmartENR AI version, developed with OpenAI's ChatGPT 4.0 API and customized for domestic nursing standards; it supports NANDA, SOAPIE, Focus DAR, and narrative formats. Documentation was evaluated on a 5-point scale for accuracy, comprehensiveness, usability, ease of use, and fluency.</p><p><strong>Results: </strong>Participants averaged 64 months of clinical experience. Traditional documentation required 467.18 ± 314.77 seconds, whereas AI-assisted documentation took 182.68 ± 99.71 seconds, reducing documentation time by approximately 40%. AI-generated documentation received scores of 3.62 ± 1.29 for accuracy, 4.13 ± 1.07 for comprehensiveness, 3.50 ± 0.93 for usability, 4.80 ± 0.61 for ease of use, and 4.50 ± 0.88 for fluency.</p><p><strong>Conclusions: </strong>Generative AI substantially reduces the nursing documentation workload and increases efficiency. Nevertheless, further refinement of AI models is necessary to improve accuracy and ensure seamless integration into clinical practice with minimal manual modifications. This study underscores AI's potential to improve nursing documentation efficiency and accuracy in future clinical settings.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"156-165"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086439/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative Pre-trained Transformer: Trends, Applications, Strengths and Challenges in Dentistry: A Systematic Review. 生成预训练变压器:趋势,应用,优势和挑战在牙科:系统回顾。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.189
Sibyl Siluvai, Vivek Narayanan, Vinoo Subramaniam Ramachandran, Victor Rakesh Lazar
{"title":"Generative Pre-trained Transformer: Trends, Applications, Strengths and Challenges in Dentistry: A Systematic Review.","authors":"Sibyl Siluvai, Vivek Narayanan, Vinoo Subramaniam Ramachandran, Victor Rakesh Lazar","doi":"10.4258/hir.2025.31.2.189","DOIUrl":"10.4258/hir.2025.31.2.189","url":null,"abstract":"<p><strong>Objectives: </strong>The integration of large language models (LLMs), particularly those based on the generative pre-trained transformer (GPT) architecture, has begun to revolutionize various fields, including dentistry. Despite these promising applications, the use of GPT in dentistry presents several challenges. Ongoing research and the development of robust ethical frameworks are essential to mitigate these issues and enhance the responsible deployment of GPT technologies in clinical settings. Hence, this systematic review aims to explore the trends, applications, strengths, and challenges associated with the use of GPT in dentistry.</p><p><strong>Methods: </strong>Articles were selected if they contained detailed information on the application of GPT in dentistry. The search strategy used in systematic reviews follows the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Our search of databases and other sources yielded a total of 704 studies. After removing duplicates and conducting a full-text screening, 16 articles were included in the review. The methodological quality of the research was evaluated using the Critical Appraisal Skills Programme (CASP) checklist.</p><p><strong>Results: </strong>Out of a total of 91 articles published on GPT in dentistry, 20 were editorials and 11 were narrative reviews; these were excluded, leaving 60 original research articles for further analysis. The articles were assessed based on the type of results they provided. Ultimately, 16 articles that reported positive findings with robust methodology were included in this review.</p><p><strong>Conclusions: </strong>The results highlight mixed responses; therefore, further research on integration into clinical workflows must be conducted with extensive methodological rigor.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"189-199"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086441/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Language Models in Medicine. 医学中的大型语言模型。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.111
Jinwook Choi
{"title":"Large Language Models in Medicine.","authors":"Jinwook Choi","doi":"10.4258/hir.2025.31.2.111","DOIUrl":"10.4258/hir.2025.31.2.111","url":null,"abstract":"","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"111-113"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086435/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Language Models in Medicine: Clinical Applications, Technical Challenges, and Ethical Considerations. 医学中的大型语言模型:临床应用、技术挑战和伦理考虑。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.114
Kyu-Hwan Jung
{"title":"Large Language Models in Medicine: Clinical Applications, Technical Challenges, and Ethical Considerations.","authors":"Kyu-Hwan Jung","doi":"10.4258/hir.2025.31.2.114","DOIUrl":"10.4258/hir.2025.31.2.114","url":null,"abstract":"<p><strong>Objectives: </strong>This study presents a comprehensive review of the clinical applications, technical challenges, and ethical considerations associated with using large language models (LLMs) in medicine.</p><p><strong>Methods: </strong>A literature survey of peer-reviewed articles, technical reports, and expert commentary from relevant medical and artificial intelligence journals was conducted. Key clinical application areas, technical limitations (e.g., accuracy, validation, transparency), and ethical issues (e.g., bias, safety, accountability, privacy) were identified and analyzed.</p><p><strong>Results: </strong>LLMs have potential in clinical documentation assistance, decision support, patient communication, and workflow optimization. The level of supporting evidence varies; documentation support applications are relatively mature, whereas autonomous diagnostics continue to face notable limitations regarding accuracy and validation. Key technical challenges include model hallucination, lack of robust clinical validation, integration issues, and limited transparency. Ethical concerns involve algorithmic bias risking health inequities, threats to patient safety from inaccuracies, unclear accountability, data privacy, and impacts on clinician-patient interactions.</p><p><strong>Conclusions: </strong>LLMs possess transformative potential for clinical medicine, particularly by augmenting clinician capabilities. However, substantial technical and ethical hurdles necessitate rigorous research, validation, clearly defined guidelines, and human oversight. Existing evidence supports an assistive rather than autonomous role, mandating careful, evidence-based integration that prioritizes patient safety and equity.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"114-124"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086438/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancing Korean Medical Large Language Models: Automated Pipeline for Korean Medical Preference Dataset Construction. 推进韩国医疗大语言模型:韩国医疗偏好数据集构建的自动化管道。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.166
Jean Seo, Sumin Park, Sungjoo Byun, Jinwook Choi, Jinho Choi, Hyopil Shin
{"title":"Advancing Korean Medical Large Language Models: Automated Pipeline for Korean Medical Preference Dataset Construction.","authors":"Jean Seo, Sumin Park, Sungjoo Byun, Jinwook Choi, Jinho Choi, Hyopil Shin","doi":"10.4258/hir.2025.31.2.166","DOIUrl":"10.4258/hir.2025.31.2.166","url":null,"abstract":"<p><strong>Objectives: </strong>Developing large language models (LLMs) in biomedicine requires access to high-quality training and alignment tuning datasets. However, publicly available Korean medical preference datasets are scarce, hindering the advancement of Korean medical LLMs. This study constructs and evaluates the efficacy of the Korean Medical Preference Dataset (KoMeP), an alignment tuning dataset constructed with an automated pipeline, minimizing the high costs of human annotation.</p><p><strong>Methods: </strong>KoMeP was generated using the DAHL score, an automated hallucination evaluation metric. Five LLMs (Dolly-v2-3B, MPT-7B, GPT-4o, Qwen-2-7B, Llama-3-8B) produced responses to 8,573 biomedical examination questions, from which 5,551 preference pairs were extracted. Each pair consisted of a \"chosen\" response and a \"rejected\" response, as determined by their DAHL scores. The dataset was evaluated when trained through two different alignment tuning methods, direct preference optimization (DPO) and odds ratio preference optimization (ORPO) respectively across five different models. The KorMedMCQA benchmark was employed to assess the effectiveness of alignment tuning.</p><p><strong>Results: </strong>Models trained with DPO consistently improved KorMedMCQA performance; notably, Llama-3.1-8B showed a 43.96% increase. In contrast, ORPO training produced inconsistent results. Additionally, English-to-Korean transfer learning proved effective, particularly for English-centric models like Gemma-2, whereas Korean-to-English transfer learning achieved limited success. Instruction tuning with KoMeP yielded mixed outcomes, which suggests challenges in dataset formatting.</p><p><strong>Conclusions: </strong>KoMeP is the first publicly available Korean medical preference dataset and significantly improves alignment tuning performance in LLMs. The DPO method outperforms ORPO in alignment tuning. Future work should focus on expanding KoMeP, developing a Korean-native dataset, and refining alignment tuning methods to produce safer and more reliable Korean medical LLMs.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"166-174"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086433/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing an Explainable Artificial Intelligence System for the Mobile-Based Diagnosis of Febrile Diseases Using Random Forest, LIME, and GPT. 利用随机森林、LIME和GPT开发可解释的温病移动诊断人工智能系统。
IF 2.3
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI: 10.4258/hir.2025.31.2.125
Kingsley F Attai, Constance Amannah, Moses Ekpenyong, Daniel E Asuquo, Oryina K Akputu, Okure U Obot, Peterben C Ajuga, Jeremiah C Obi, Omosivie Maduka, Christie Akwaowo, Faith-Michael Uzoka
{"title":"Developing an Explainable Artificial Intelligence System for the Mobile-Based Diagnosis of Febrile Diseases Using Random Forest, LIME, and GPT.","authors":"Kingsley F Attai, Constance Amannah, Moses Ekpenyong, Daniel E Asuquo, Oryina K Akputu, Okure U Obot, Peterben C Ajuga, Jeremiah C Obi, Omosivie Maduka, Christie Akwaowo, Faith-Michael Uzoka","doi":"10.4258/hir.2025.31.2.125","DOIUrl":"10.4258/hir.2025.31.2.125","url":null,"abstract":"<p><strong>Objectives: </strong>This study proposes a mobile-based explainable artificial intelligence (XAI) platform designed for diagnosing febrile illnesses.</p><p><strong>Methods: </strong>We integrated the interpretability offered by local interpretable model-agnostic explanations (LIME) and the explainability provided by generative pre-trained transformers (GPT) to bridge the gap in understanding and trust often created by machine learning models in critical healthcare decision-making. The developed system employed random forest for disease diagnosis, LIME for interpretation of the results, and GPT-3.5 for generating explanations in easy-to-understand language.</p><p><strong>Results: </strong>Our model demonstrated robust performance in detecting malaria, achieving precision, recall, and F1-scores of 85%, 91%, and 88%, respectively. It performed moderately well in detecting urinary tract and respiratory tract infections, with precision, recall, and F1-scores of 80%, 65%, and 72%, and 77%, 68%, and 72%, respectively, maintaining an effective balance between sensitivity and specificity. However, the model exhibited limitations in detecting typhoid fever and human immunodeficiency virus/acquired immune deficiency syndrome, achieving lower precision, recall, and F1-scores of 69%, 53%, and 60%, and 75%, 39%, and 51%, respectively. These results indicate missed true-positive cases, necessitating further model fine-tuning. LIME and GPT-3.5 were integrated to enhance transparency and provide natural language explanations, thereby aiding decision-making and improving user comprehension of the diagnoses.</p><p><strong>Conclusions: </strong>The LIME plots revealed key symptoms influencing the diagnoses, with bitter taste in the mouth and fever showing the highest negative influence on predictions, and GPT-3.5 provided natural language explanations that increased the reliability and trustworthiness of the system, promoting improved patient outcomes and reducing the healthcare burden.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"125-135"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086442/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信