Zero-Inflated Count Regression Models in Solving Challenges Posed by Outlier-Prone Data; an Application to Length of Hospital Stay.

IF 2.9 Q1 EMERGENCY MEDICINE
Archives of Academic Emergency Medicine Pub Date : 2023-11-21 eCollection Date: 2024-01-01 DOI:10.22037/aaem.v12i1.2074
Saeed Shahsavari, Abbas Moghimbeigi, Rohollah Kalhor, Ali Moghadas Jafari, Mehrdad Bagherpour-Kalo, Mehdi Yaseri, Mostafa Hosseini
{"title":"Zero-Inflated Count Regression Models in Solving Challenges Posed by Outlier-Prone Data; an Application to Length of Hospital Stay.","authors":"Saeed Shahsavari, Abbas Moghimbeigi, Rohollah Kalhor, Ali Moghadas Jafari, Mehrdad Bagherpour-Kalo, Mehdi Yaseri, Mostafa Hosseini","doi":"10.22037/aaem.v12i1.2074","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Ignoring outliers in data may lead to misleading results. Length of stay (LOS) is often considered a count variable with a high frequency of outliers. This study exemplifies the potential of robust methodologies in enhancing the accuracy and reliability of analyses conducted on skewed and outlier-prone count data of LOS.</p><p><strong>Methods: </strong>The application of Zero-Inflated Poisson (ZIP) and robust Zero-Inflated Poisson (RZIP) models in solving challenges posed by outlier LOS data were evaluated. The ZIP model incorporates two components, tackling excess zeros with a zero-inflation component and modeling positive counts with a Poisson component. The RZIP model introduces the Robust Expectation-Solution (RES) algorithm to enhance parameter estimation and address the impact of outliers on the model's performance.</p><p><strong>Results: </strong>Data from 254 intensive care unit patients were analyzed (62.2% male). Patients aged 65 or older accounted for 58.3% of the sample. Notably, 38.6% of patients exhibited zero LOS. The overall mean LOS was 5.89 (± 9.81) days, and 9.45% of cases displayed outliers. Our analysis using the RZIP model revealed significant predictors of LOS, including age, underlying comorbidities (p<0.001), and insurance status (p=0.013). Model comparison demonstrated the RZIP model's superiority over ZIP, as evidenced by lower Akaike information criteria (AIC) and Bayesians information criteria (BIC) values.</p><p><strong>Conclusions: </strong>The application of the RZIP model allowed us to uncover meaningful insights into the factors influencing LOS, paving the way for more informed decision-making in hospital management.</p>","PeriodicalId":8146,"journal":{"name":"Archives of Academic Emergency Medicine","volume":"12 1","pages":"e13"},"PeriodicalIF":2.9000,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10871051/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Academic Emergency Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22037/aaem.v12i1.2074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Ignoring outliers in data may lead to misleading results. Length of stay (LOS) is often considered a count variable with a high frequency of outliers. This study exemplifies the potential of robust methodologies in enhancing the accuracy and reliability of analyses conducted on skewed and outlier-prone count data of LOS.

Methods: The application of Zero-Inflated Poisson (ZIP) and robust Zero-Inflated Poisson (RZIP) models in solving challenges posed by outlier LOS data were evaluated. The ZIP model incorporates two components, tackling excess zeros with a zero-inflation component and modeling positive counts with a Poisson component. The RZIP model introduces the Robust Expectation-Solution (RES) algorithm to enhance parameter estimation and address the impact of outliers on the model's performance.

Results: Data from 254 intensive care unit patients were analyzed (62.2% male). Patients aged 65 or older accounted for 58.3% of the sample. Notably, 38.6% of patients exhibited zero LOS. The overall mean LOS was 5.89 (± 9.81) days, and 9.45% of cases displayed outliers. Our analysis using the RZIP model revealed significant predictors of LOS, including age, underlying comorbidities (p<0.001), and insurance status (p=0.013). Model comparison demonstrated the RZIP model's superiority over ZIP, as evidenced by lower Akaike information criteria (AIC) and Bayesians information criteria (BIC) values.

Conclusions: The application of the RZIP model allowed us to uncover meaningful insights into the factors influencing LOS, paving the way for more informed decision-making in hospital management.

零膨胀计数回归模型在解决异常值数据带来的挑战中的应用;对住院时间的应用。
导言忽略数据中的异常值可能会导致误导性结果。住院时间(LOS)通常被认为是一个计数变量,离群值出现的频率很高。本研究体现了稳健方法在提高对偏斜和离群的 LOS 计数数据进行分析的准确性和可靠性方面的潜力:评估了零膨胀泊松(ZIP)和稳健零膨胀泊松(RZIP)模型在解决离群 LOS 数据带来的挑战方面的应用。ZIP 模型包含两个部分,用零膨胀部分解决过多零的问题,用泊松部分对正计数进行建模。RZIP 模型引入了稳健期望求解(RES)算法,以加强参数估计并解决异常值对模型性能的影响:分析了 254 名重症监护室患者(62.2% 为男性)的数据。65 岁或以上的患者占样本的 58.3%。值得注意的是,38.6% 的患者的 LOS 为零。总体平均 LOS 为 5.89 (± 9.81) 天,9.45% 的病例出现异常值。我们使用 RZIP 模型进行的分析表明,年龄、基础并发症(p 结论:RZIP 模型的应用可显著预测 LOS:RZIP 模型的应用使我们能够深入了解影响住院时间的因素,为医院管理中更明智的决策铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Archives of Academic Emergency Medicine
Archives of Academic Emergency Medicine Medicine-Emergency Medicine
CiteScore
8.90
自引率
7.40%
发文量
0
审稿时长
6 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信