Practical guide to building machine learning-based clinical prediction models using imbalanced datasets.

IF 2.1 Q3 CRITICAL CARE MEDICINE

Trauma Surgery & Acute Care Open Pub Date : 2024-06-12 eCollection Date: 2024-01-01 DOI:10.1136/tsaco-2023-001222

Jacklyn Luu, Evgenia Borisenko, Valerie Przekop, Advait Patil, Joseph D Forrester, Jeff Choi

引用次数: 0

Abstract

Clinical prediction models often aim to predict rare, high-risk events, but building such models requires robust understanding of imbalance datasets and their unique study design considerations. This practical guide highlights foundational prediction model principles for surgeon-data scientists and readers who encounter clinical prediction models, from feature engineering and algorithm selection strategies to model evaluation and design techniques specific to imbalanced datasets. We walk through a clinical example using readable code to highlight important considerations and common pitfalls in developing machine learning-based prediction models. We hope this practical guide facilitates developing and critically appraising robust clinical prediction models for the surgical community.

查看原文本刊更多论文

使用不平衡数据集构建基于机器学习的临床预测模型实用指南。

临床预测模型通常旨在预测罕见的高风险事件，但建立此类模型需要对不平衡数据集及其独特的研究设计考虑因素有深入的了解。本实用指南为外科医生和数据科学家以及遇到临床预测模型的读者重点介绍了预测模型的基本原理，从特征工程和算法选择策略到不平衡数据集特有的模型评估和设计技术。我们通过一个临床实例，使用可读代码来强调开发基于机器学习的预测模型时的重要注意事项和常见陷阱。我们希望这本实用指南有助于外科界开发和严格评估稳健的临床预测模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Trauma Surgery & Acute Care Open Multiple-

CiteScore

3.70

自引率

5.00%

发文量

审稿时长

12 weeks