{"title":"优化尿路感染诊断的机器学习模型:逻辑回归与随机森林的比较研究","authors":"Gregorius Airlangga","doi":"10.37034/infeb.v6i1.854","DOIUrl":null,"url":null,"abstract":"Urinary Tract Infections (UTIs) present a significant healthcare challenge due to their prevalence and diagnostic complexity. Timely and accurate diagnosis is critical for effective treatment, yet traditional methods like microbial cultures and urinalysis are often slow and inconsistent. This study introduces machine learning (ML) as a transformative solution for UTI diagnostics, particularly focusing on logistic regression and random forest models renowned for their interpretability and robustness. We conducted a meticulous hyperparameter tuning process using a rich dataset from a clinic in Northern Mindanao, Philippines, incorporating demographic, clinical, and urinalysis data. Our research outlines a detailed methodology for applying and refining these ML models to predict UTI outcomes accurately. Through comprehensive hyperparameter optimization, we enhanced the predictive performance, demonstrating a significant improvement over standard diagnostic practice. The findings reveal a clear superiority of the random forest model, achieving a top testing accuracy of 0.9814, compared to the best-performing logistic regression model's accuracy of 0.7626. This comparative analysis not only validates the efficacy of ML in medical diagnostics but also emphasizes the potential clinical impact of these models in real-world settings. The study contributes to the burgeoning literature on ML applications in healthcare by providing a blueprint for optimizing ML models for clinical use, particularly in diagnosing UTIs. It underscores the promise of ML in augmenting diagnostic precision, thereby potentially reducing the global healthcare burden associated with UTIs.","PeriodicalId":242689,"journal":{"name":"Jurnal Informatika Ekonomi Bisnis","volume":"23 14","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing Machine Learning Models for Urinary Tract Infection Diagnostics: A Comparative Study of Logistic Regression and Random Forest\",\"authors\":\"Gregorius Airlangga\",\"doi\":\"10.37034/infeb.v6i1.854\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Urinary Tract Infections (UTIs) present a significant healthcare challenge due to their prevalence and diagnostic complexity. Timely and accurate diagnosis is critical for effective treatment, yet traditional methods like microbial cultures and urinalysis are often slow and inconsistent. This study introduces machine learning (ML) as a transformative solution for UTI diagnostics, particularly focusing on logistic regression and random forest models renowned for their interpretability and robustness. We conducted a meticulous hyperparameter tuning process using a rich dataset from a clinic in Northern Mindanao, Philippines, incorporating demographic, clinical, and urinalysis data. Our research outlines a detailed methodology for applying and refining these ML models to predict UTI outcomes accurately. Through comprehensive hyperparameter optimization, we enhanced the predictive performance, demonstrating a significant improvement over standard diagnostic practice. The findings reveal a clear superiority of the random forest model, achieving a top testing accuracy of 0.9814, compared to the best-performing logistic regression model's accuracy of 0.7626. This comparative analysis not only validates the efficacy of ML in medical diagnostics but also emphasizes the potential clinical impact of these models in real-world settings. The study contributes to the burgeoning literature on ML applications in healthcare by providing a blueprint for optimizing ML models for clinical use, particularly in diagnosing UTIs. It underscores the promise of ML in augmenting diagnostic precision, thereby potentially reducing the global healthcare burden associated with UTIs.\",\"PeriodicalId\":242689,\"journal\":{\"name\":\"Jurnal Informatika Ekonomi Bisnis\",\"volume\":\"23 14\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Informatika Ekonomi Bisnis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37034/infeb.v6i1.854\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Informatika Ekonomi Bisnis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37034/infeb.v6i1.854","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
尿路感染(UTI)因其发病率高、诊断复杂而成为医疗保健领域的重大挑战。及时、准确的诊断对于有效治疗至关重要,然而微生物培养和尿液分析等传统方法往往既缓慢又不一致。本研究将机器学习(ML)作为UTI 诊断的变革性解决方案,尤其关注以可解释性和鲁棒性著称的逻辑回归和随机森林模型。我们利用菲律宾北棉兰老岛一家诊所的丰富数据集,结合人口统计学、临床和尿液分析数据,进行了细致的超参数调整过程。我们的研究概述了应用和改进这些 ML 模型以准确预测 UTI 结果的详细方法。通过全面的超参数优化,我们提高了预测性能,与标准诊断实践相比有了显著改善。研究结果表明,随机森林模型具有明显的优势,其最高测试准确率为 0.9814,而表现最好的逻辑回归模型的准确率为 0.7626。这项对比分析不仅验证了 ML 在医学诊断中的功效,还强调了这些模型在现实世界中的潜在临床影响。这项研究为优化 ML 模型的临床应用(尤其是在诊断 UTI 方面)提供了蓝图,从而为 ML 在医疗保健领域的应用提供了新的文献资料。它强调了 ML 在提高诊断精确度方面的前景,从而有可能减轻与 UTIs 相关的全球医疗负担。
Optimizing Machine Learning Models for Urinary Tract Infection Diagnostics: A Comparative Study of Logistic Regression and Random Forest
Urinary Tract Infections (UTIs) present a significant healthcare challenge due to their prevalence and diagnostic complexity. Timely and accurate diagnosis is critical for effective treatment, yet traditional methods like microbial cultures and urinalysis are often slow and inconsistent. This study introduces machine learning (ML) as a transformative solution for UTI diagnostics, particularly focusing on logistic regression and random forest models renowned for their interpretability and robustness. We conducted a meticulous hyperparameter tuning process using a rich dataset from a clinic in Northern Mindanao, Philippines, incorporating demographic, clinical, and urinalysis data. Our research outlines a detailed methodology for applying and refining these ML models to predict UTI outcomes accurately. Through comprehensive hyperparameter optimization, we enhanced the predictive performance, demonstrating a significant improvement over standard diagnostic practice. The findings reveal a clear superiority of the random forest model, achieving a top testing accuracy of 0.9814, compared to the best-performing logistic regression model's accuracy of 0.7626. This comparative analysis not only validates the efficacy of ML in medical diagnostics but also emphasizes the potential clinical impact of these models in real-world settings. The study contributes to the burgeoning literature on ML applications in healthcare by providing a blueprint for optimizing ML models for clinical use, particularly in diagnosing UTIs. It underscores the promise of ML in augmenting diagnostic precision, thereby potentially reducing the global healthcare burden associated with UTIs.