Yao Bu, Danqi Wang, Xiaomao Fan, Jiongying Li, Lei Hua, Lin Zhang, Wenjun Ma, Liwen He, Hao Zang, Haijun Zhang, Xingyu Liu, Yufeng Gao, Li Liu
{"title":"Enhancing predictions of health insurance overspending risk through hospital departmental performance indicators.","authors":"Yao Bu, Danqi Wang, Xiaomao Fan, Jiongying Li, Lei Hua, Lin Zhang, Wenjun Ma, Liwen He, Hao Zang, Haijun Zhang, Xingyu Liu, Yufeng Gao, Li Liu","doi":"10.17305/bb.2025.12051","DOIUrl":null,"url":null,"abstract":"<p><p>The substantial rise in health insurance expenditures, combined with delayed feedback on overspending from administrative departments, highlights the urgent need for timely reporting of such data. This study analyzed a large cohort of 549,910 discharged patients' medical records from the Wuxi Health Commission, covering the period from January 2022 to November 2023. We applied four widely recognized machine learning techniques-Logistic Regression (LR), LightGBM, Random Forest (RF), and Artificial Neural Networks (ANN)-alongside departmental performance indicators (DPIs) to develop Insurance Overspending Risk Prediction (IORP) models at both regional and hospital levels. The dataset was divided into training and testing sets in a 7:3 ratio. Experimental results showed that LightGBM outperformed the other models, achieving an accuracy of 0.82 for both regional and hospital-level predictions. Its weighted F1-score reached 0.78 at the regional level and 0.82 at the hospital level, with corresponding AUC-ROC (Area Under the Receiver Operating Characteristic Curve) values of 0.91 and 0.94, demonstrating strong performance in identifying overspending risks. The model's high recall and precision further ensure reliable predictions and minimize misclassifications. Notably, four key DPIs-Total Amount of Discharged Patients (TADP), Average Inpatient Stay (AIS), Medicine Expenses Percentage (MEP), and Consumable Expenses Percentage (CEP)-were strongly correlated with overspending risks. The integration of IORP models into the Health Insurance Management System (HIMS) at the Affiliated Hospital of Jiangnan University has significantly improved departmental managers' ability to anticipate overspending. By effectively leveraging HIMS in combination with this advanced model, managers can perform timely, accurate assessments, thereby enhancing financial oversight and resource allocation.</p>","PeriodicalId":72398,"journal":{"name":"Biomolecules & biomedicine","volume":" ","pages":"2269-2280"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12451977/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomolecules & biomedicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17305/bb.2025.12051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
The substantial rise in health insurance expenditures, combined with delayed feedback on overspending from administrative departments, highlights the urgent need for timely reporting of such data. This study analyzed a large cohort of 549,910 discharged patients' medical records from the Wuxi Health Commission, covering the period from January 2022 to November 2023. We applied four widely recognized machine learning techniques-Logistic Regression (LR), LightGBM, Random Forest (RF), and Artificial Neural Networks (ANN)-alongside departmental performance indicators (DPIs) to develop Insurance Overspending Risk Prediction (IORP) models at both regional and hospital levels. The dataset was divided into training and testing sets in a 7:3 ratio. Experimental results showed that LightGBM outperformed the other models, achieving an accuracy of 0.82 for both regional and hospital-level predictions. Its weighted F1-score reached 0.78 at the regional level and 0.82 at the hospital level, with corresponding AUC-ROC (Area Under the Receiver Operating Characteristic Curve) values of 0.91 and 0.94, demonstrating strong performance in identifying overspending risks. The model's high recall and precision further ensure reliable predictions and minimize misclassifications. Notably, four key DPIs-Total Amount of Discharged Patients (TADP), Average Inpatient Stay (AIS), Medicine Expenses Percentage (MEP), and Consumable Expenses Percentage (CEP)-were strongly correlated with overspending risks. The integration of IORP models into the Health Insurance Management System (HIMS) at the Affiliated Hospital of Jiangnan University has significantly improved departmental managers' ability to anticipate overspending. By effectively leveraging HIMS in combination with this advanced model, managers can perform timely, accurate assessments, thereby enhancing financial oversight and resource allocation.
健康保险支出大幅增加,加上行政部门对超支的反馈延迟,突出表明迫切需要及时报告这类数据。本研究分析了无锡市卫生健康委员会从2022年1月至2023年11月期间549,910例出院患者的医疗记录。我们应用了四种被广泛认可的机器学习技术——逻辑回归(LR)、LightGBM、随机森林(RF)和人工神经网络(ANN)——以及部门绩效指标(dpi)来开发区域和医院层面的保险超支风险预测(IORP)模型。数据集以7:3的比例分为训练集和测试集。实验结果表明,LightGBM优于其他模型,在区域和医院级别的预测中都达到了0.82的精度。其区域一级和医院一级的加权f1得分分别为0.78和0.82,AUC-ROC (Receiver Operating Characteristic Curve Area Under the Area)分别为0.91和0.94,在识别超支风险方面表现出较强的能力。该模型的高召回率和精度进一步确保了可靠的预测,并最大限度地减少了错误分类。值得注意的是,四个关键dpis -出院总人数(TADP)、平均住院时间(AIS)、药品费用百分比(MEP)和消耗品费用百分比(CEP)-与超支风险密切相关。将IORP模型整合到江南大学附属医院医疗保险管理系统(HIMS)中,显著提高了部门管理者预测超支的能力。通过有效地利用HIMS与这一先进模型的结合,管理人员可以进行及时、准确的评估,从而加强财务监督和资源分配。