Saeed Shahsavari, Abbas Moghimbeigi, Rohollah Kalhor, Ali Moghadas Jafari, Mehrdad Bagherpour-Kalo, Mehdi Yaseri, Mostafa Hosseini
{"title":"零膨胀计数回归模型在解决异常值数据带来的挑战中的应用;对住院时间的应用。","authors":"Saeed Shahsavari, Abbas Moghimbeigi, Rohollah Kalhor, Ali Moghadas Jafari, Mehrdad Bagherpour-Kalo, Mehdi Yaseri, Mostafa Hosseini","doi":"10.22037/aaem.v12i1.2074","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Ignoring outliers in data may lead to misleading results. Length of stay (LOS) is often considered a count variable with a high frequency of outliers. This study exemplifies the potential of robust methodologies in enhancing the accuracy and reliability of analyses conducted on skewed and outlier-prone count data of LOS.</p><p><strong>Methods: </strong>The application of Zero-Inflated Poisson (ZIP) and robust Zero-Inflated Poisson (RZIP) models in solving challenges posed by outlier LOS data were evaluated. The ZIP model incorporates two components, tackling excess zeros with a zero-inflation component and modeling positive counts with a Poisson component. The RZIP model introduces the Robust Expectation-Solution (RES) algorithm to enhance parameter estimation and address the impact of outliers on the model's performance.</p><p><strong>Results: </strong>Data from 254 intensive care unit patients were analyzed (62.2% male). Patients aged 65 or older accounted for 58.3% of the sample. Notably, 38.6% of patients exhibited zero LOS. The overall mean LOS was 5.89 (± 9.81) days, and 9.45% of cases displayed outliers. Our analysis using the RZIP model revealed significant predictors of LOS, including age, underlying comorbidities (p<0.001), and insurance status (p=0.013). Model comparison demonstrated the RZIP model's superiority over ZIP, as evidenced by lower Akaike information criteria (AIC) and Bayesians information criteria (BIC) values.</p><p><strong>Conclusions: </strong>The application of the RZIP model allowed us to uncover meaningful insights into the factors influencing LOS, paving the way for more informed decision-making in hospital management.</p>","PeriodicalId":8146,"journal":{"name":"Archives of Academic Emergency Medicine","volume":"12 1","pages":"e13"},"PeriodicalIF":2.9000,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10871051/pdf/","citationCount":"0","resultStr":"{\"title\":\"Zero-Inflated Count Regression Models in Solving Challenges Posed by Outlier-Prone Data; an Application to Length of Hospital Stay.\",\"authors\":\"Saeed Shahsavari, Abbas Moghimbeigi, Rohollah Kalhor, Ali Moghadas Jafari, Mehrdad Bagherpour-Kalo, Mehdi Yaseri, Mostafa Hosseini\",\"doi\":\"10.22037/aaem.v12i1.2074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Ignoring outliers in data may lead to misleading results. Length of stay (LOS) is often considered a count variable with a high frequency of outliers. This study exemplifies the potential of robust methodologies in enhancing the accuracy and reliability of analyses conducted on skewed and outlier-prone count data of LOS.</p><p><strong>Methods: </strong>The application of Zero-Inflated Poisson (ZIP) and robust Zero-Inflated Poisson (RZIP) models in solving challenges posed by outlier LOS data were evaluated. The ZIP model incorporates two components, tackling excess zeros with a zero-inflation component and modeling positive counts with a Poisson component. The RZIP model introduces the Robust Expectation-Solution (RES) algorithm to enhance parameter estimation and address the impact of outliers on the model's performance.</p><p><strong>Results: </strong>Data from 254 intensive care unit patients were analyzed (62.2% male). Patients aged 65 or older accounted for 58.3% of the sample. Notably, 38.6% of patients exhibited zero LOS. The overall mean LOS was 5.89 (± 9.81) days, and 9.45% of cases displayed outliers. Our analysis using the RZIP model revealed significant predictors of LOS, including age, underlying comorbidities (p<0.001), and insurance status (p=0.013). Model comparison demonstrated the RZIP model's superiority over ZIP, as evidenced by lower Akaike information criteria (AIC) and Bayesians information criteria (BIC) values.</p><p><strong>Conclusions: </strong>The application of the RZIP model allowed us to uncover meaningful insights into the factors influencing LOS, paving the way for more informed decision-making in hospital management.</p>\",\"PeriodicalId\":8146,\"journal\":{\"name\":\"Archives of Academic Emergency Medicine\",\"volume\":\"12 1\",\"pages\":\"e13\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2023-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10871051/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Archives of Academic Emergency Medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22037/aaem.v12i1.2074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"EMERGENCY MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Academic Emergency Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22037/aaem.v12i1.2074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0
摘要
导言忽略数据中的异常值可能会导致误导性结果。住院时间(LOS)通常被认为是一个计数变量,离群值出现的频率很高。本研究体现了稳健方法在提高对偏斜和离群的 LOS 计数数据进行分析的准确性和可靠性方面的潜力:评估了零膨胀泊松(ZIP)和稳健零膨胀泊松(RZIP)模型在解决离群 LOS 数据带来的挑战方面的应用。ZIP 模型包含两个部分,用零膨胀部分解决过多零的问题,用泊松部分对正计数进行建模。RZIP 模型引入了稳健期望求解(RES)算法,以加强参数估计并解决异常值对模型性能的影响:分析了 254 名重症监护室患者(62.2% 为男性)的数据。65 岁或以上的患者占样本的 58.3%。值得注意的是,38.6% 的患者的 LOS 为零。总体平均 LOS 为 5.89 (± 9.81) 天,9.45% 的病例出现异常值。我们使用 RZIP 模型进行的分析表明,年龄、基础并发症(p 结论:RZIP 模型的应用可显著预测 LOS:RZIP 模型的应用使我们能够深入了解影响住院时间的因素,为医院管理中更明智的决策铺平了道路。
Zero-Inflated Count Regression Models in Solving Challenges Posed by Outlier-Prone Data; an Application to Length of Hospital Stay.
Introduction: Ignoring outliers in data may lead to misleading results. Length of stay (LOS) is often considered a count variable with a high frequency of outliers. This study exemplifies the potential of robust methodologies in enhancing the accuracy and reliability of analyses conducted on skewed and outlier-prone count data of LOS.
Methods: The application of Zero-Inflated Poisson (ZIP) and robust Zero-Inflated Poisson (RZIP) models in solving challenges posed by outlier LOS data were evaluated. The ZIP model incorporates two components, tackling excess zeros with a zero-inflation component and modeling positive counts with a Poisson component. The RZIP model introduces the Robust Expectation-Solution (RES) algorithm to enhance parameter estimation and address the impact of outliers on the model's performance.
Results: Data from 254 intensive care unit patients were analyzed (62.2% male). Patients aged 65 or older accounted for 58.3% of the sample. Notably, 38.6% of patients exhibited zero LOS. The overall mean LOS was 5.89 (± 9.81) days, and 9.45% of cases displayed outliers. Our analysis using the RZIP model revealed significant predictors of LOS, including age, underlying comorbidities (p<0.001), and insurance status (p=0.013). Model comparison demonstrated the RZIP model's superiority over ZIP, as evidenced by lower Akaike information criteria (AIC) and Bayesians information criteria (BIC) values.
Conclusions: The application of the RZIP model allowed us to uncover meaningful insights into the factors influencing LOS, paving the way for more informed decision-making in hospital management.