Lyse Naomi Wamba Momo , Vincent Scheltjens , Wouter Verbeke , Frank Rademakers , Bart De Moor
{"title":"Understanding deep learning models for Length of Stay prediction on critically ill patients through latent space visualization","authors":"Lyse Naomi Wamba Momo , Vincent Scheltjens , Wouter Verbeke , Frank Rademakers , Bart De Moor","doi":"10.1016/j.cmpb.2025.108832","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Continuous, real-time monitoring of Length of Stay (LoS) for critically ill patients in Intensive Care Units (ICUs) is essential for anticipating patient needs, reduce the risk of adverse events, optimize resource allocation, plan incoming patients, and improve overall care. While previous research has focused primarily on predicting LoS, less attention has been given to how these prediction systems can be used by non-machine learning experts in real hospital environments for capacity planning.</div></div><div><h3>Methods:</h3><div>In this work, we predict remaining ICU LoS using data from the Amsterdam University Medical Center dataset, which is the first freely available European critical care dataset and shows higher variability in LoS compared to U.S. datasets. We applied state-of-the-art sequence-to-sequence deep learning models – Long Short-Term Memory Networks, Gated Recurrent Units, Temporal Convolutional Networks with and without attention, and Transformer models – on 271 input features extracted from 20,481 ICU stays. Additionally, the latent spaces of these models were extracted, projected onto a 2D space and explored interactively to intuitively understand how the models learn patterns from the clinical data over time.</div></div><div><h3>Results:</h3><div>The TCN model with attention (TCN-att) returned the best performance, reducing the prediction error by 2.24 days from a MAE of 6.94 days to a MAE of 4.70 days. Latent space analysis revealed that with just 5–6 h of patient data, the models could clearly differentiate between short (<span><math><mrow><mn>0</mn><mo><</mo><mtext>LoS</mtext><mo>≤</mo><mn>3</mn></mrow></math></span> days) and long (<span><math><mrow><mn>7</mn><mo>≤</mo><mtext>LoS</mtext><mo><</mo><mo>+</mo><mi>∞</mi></mrow></math></span> days) ICU stays, identify a patient cluster at risk of in-hospital mortality, and more.</div></div><div><h3>Conclusion:</h3><div>Beyond reporting prediction error metrics, this study shows how an interactive dashboard can be used to gain insights into the inner workings of complex algorithms and how these can be integrated into clinical decision support systems. This approach allows for real-time intuitive understanding of how models learn and how patient predictions evolve, facilitating their use in real hospital environments.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"268 ","pages":"Article 108832"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725002494","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Objective:
Continuous, real-time monitoring of Length of Stay (LoS) for critically ill patients in Intensive Care Units (ICUs) is essential for anticipating patient needs, reduce the risk of adverse events, optimize resource allocation, plan incoming patients, and improve overall care. While previous research has focused primarily on predicting LoS, less attention has been given to how these prediction systems can be used by non-machine learning experts in real hospital environments for capacity planning.
Methods:
In this work, we predict remaining ICU LoS using data from the Amsterdam University Medical Center dataset, which is the first freely available European critical care dataset and shows higher variability in LoS compared to U.S. datasets. We applied state-of-the-art sequence-to-sequence deep learning models – Long Short-Term Memory Networks, Gated Recurrent Units, Temporal Convolutional Networks with and without attention, and Transformer models – on 271 input features extracted from 20,481 ICU stays. Additionally, the latent spaces of these models were extracted, projected onto a 2D space and explored interactively to intuitively understand how the models learn patterns from the clinical data over time.
Results:
The TCN model with attention (TCN-att) returned the best performance, reducing the prediction error by 2.24 days from a MAE of 6.94 days to a MAE of 4.70 days. Latent space analysis revealed that with just 5–6 h of patient data, the models could clearly differentiate between short ( days) and long ( days) ICU stays, identify a patient cluster at risk of in-hospital mortality, and more.
Conclusion:
Beyond reporting prediction error metrics, this study shows how an interactive dashboard can be used to gain insights into the inner workings of complex algorithms and how these can be integrated into clinical decision support systems. This approach allows for real-time intuitive understanding of how models learn and how patient predictions evolve, facilitating their use in real hospital environments.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.