基于机器学习预测肺结核患者的住院费用。

IF 3.4 3区 医学 Q2 INFECTIOUS DISEASES
Shiyu Fan, Abudoukeyoumujiang Abulizi, Yi You, Chencui Huang, Yasen Yimit, Qiange Li, Xiaoguang Zou, Mayidili Nijiati
{"title":"基于机器学习预测肺结核患者的住院费用。","authors":"Shiyu Fan, Abudoukeyoumujiang Abulizi, Yi You, Chencui Huang, Yasen Yimit, Qiange Li, Xiaoguang Zou, Mayidili Nijiati","doi":"10.1186/s12879-024-09771-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Pulmonary tuberculosis (PTB) is a prevalent chronic disease associated with a significant economic burden on patients. Using machine learning to predict hospitalization costs can allocate medical resources effectively and optimize the cost structure rationally, so as to control the hospitalization costs of patients better.</p><p><strong>Methods: </strong>This research analyzed data (2020-2022) from a Kashgar pulmonary hospital's information system, involving 9570 eligible PTB patients. SPSS 26.0 was used for multiple regression analysis, while Python 3.7 was used for random forest regression (RFR) and MLP. The training set included data from 2020 and 2021, while the test set included data from 2022. The models predicted seven various costs related to PTB patients, including diagnostic cost, medical service cost, material cost, treatment cost, drug cost, other cost, and total hospitalization cost. The model's predictive performance was evaluated using R-square (R<sup>2</sup>), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) metrics.</p><p><strong>Results: </strong>Among the 9570 PTB patients included in the study, the median and quartile of total hospitalization cost were 13,150.45 (9891.34, 19,648.48) yuan. Nine factors, including age, marital status, admission condition, length of hospital stay, initial treatment, presence of other diseases, transfer, drug resistance, and admission department, significantly influenced hospitalization costs for PTB patients. Overall, MLP demonstrated superior performance in most cost predictions, outperforming RFR and multiple regression; The performance of RFR is between MLP and multiple regression; The predictive performance of multiple regression is the lowest, but it shows the best results for Other costs.</p><p><strong>Conclusion: </strong>The MLP can effectively leverage patient information and accurately predict various hospitalization costs, achieving a rationalized structure of hospitalization costs by adjusting higher-cost inpatient items and balancing different cost categories. The insights of this predictive model also hold relevance for research in other medical conditions.</p>","PeriodicalId":8981,"journal":{"name":"BMC Infectious Diseases","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11360310/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting hospitalization costs for pulmonary tuberculosis patients based on machine learning.\",\"authors\":\"Shiyu Fan, Abudoukeyoumujiang Abulizi, Yi You, Chencui Huang, Yasen Yimit, Qiange Li, Xiaoguang Zou, Mayidili Nijiati\",\"doi\":\"10.1186/s12879-024-09771-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Pulmonary tuberculosis (PTB) is a prevalent chronic disease associated with a significant economic burden on patients. Using machine learning to predict hospitalization costs can allocate medical resources effectively and optimize the cost structure rationally, so as to control the hospitalization costs of patients better.</p><p><strong>Methods: </strong>This research analyzed data (2020-2022) from a Kashgar pulmonary hospital's information system, involving 9570 eligible PTB patients. SPSS 26.0 was used for multiple regression analysis, while Python 3.7 was used for random forest regression (RFR) and MLP. The training set included data from 2020 and 2021, while the test set included data from 2022. The models predicted seven various costs related to PTB patients, including diagnostic cost, medical service cost, material cost, treatment cost, drug cost, other cost, and total hospitalization cost. The model's predictive performance was evaluated using R-square (R<sup>2</sup>), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) metrics.</p><p><strong>Results: </strong>Among the 9570 PTB patients included in the study, the median and quartile of total hospitalization cost were 13,150.45 (9891.34, 19,648.48) yuan. Nine factors, including age, marital status, admission condition, length of hospital stay, initial treatment, presence of other diseases, transfer, drug resistance, and admission department, significantly influenced hospitalization costs for PTB patients. Overall, MLP demonstrated superior performance in most cost predictions, outperforming RFR and multiple regression; The performance of RFR is between MLP and multiple regression; The predictive performance of multiple regression is the lowest, but it shows the best results for Other costs.</p><p><strong>Conclusion: </strong>The MLP can effectively leverage patient information and accurately predict various hospitalization costs, achieving a rationalized structure of hospitalization costs by adjusting higher-cost inpatient items and balancing different cost categories. The insights of this predictive model also hold relevance for research in other medical conditions.</p>\",\"PeriodicalId\":8981,\"journal\":{\"name\":\"BMC Infectious Diseases\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11360310/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Infectious Diseases\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12879-024-09771-6\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INFECTIOUS DISEASES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Infectious Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12879-024-09771-6","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0

摘要

背景:肺结核(PTB)是一种普遍存在的慢性疾病,给患者带来了巨大的经济负担。利用机器学习预测住院费用可以有效配置医疗资源,合理优化费用结构,从而更好地控制患者的住院费用:本研究分析了喀什某肺科医院信息系统中的数据(2020-2022 年),涉及 9570 名符合条件的肺结核患者。多元回归分析使用 SPSS 26.0,随机森林回归(RFR)和 MLP 使用 Python 3.7。训练集包括 2020 年和 2021 年的数据,测试集包括 2022 年的数据。模型预测了 PTB 患者的七种相关费用,包括诊断费用、医疗服务费用、材料费用、治疗费用、药物费用、其他费用和住院总费用。使用R2、均方根误差(RMSE)和平均绝对误差(MAE)指标对模型的预测性能进行评估:在 9570 名 PTB 患者中,住院总费用的中位数和四分位数分别为 13150.45(9891.34,19648.48)元。年龄、婚姻状况、入院条件、住院时间、初始治疗、是否患有其他疾病、转院、耐药性、入院科室等九个因素对 PTB 患者的住院费用有显著影响。总体而言,MLP 在大多数费用预测中表现优异,优于 RFR 和多元回归;RFR 的性能介于 MLP 和多元回归之间;多元回归的预测性能最低,但对其他费用的预测结果最好:MLP能有效利用患者信息,准确预测各种住院费用,通过调整费用较高的住院项目和平衡不同费用类别,实现住院费用结构的合理化。该预测模型的见解对其他医疗条件的研究也有借鉴意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting hospitalization costs for pulmonary tuberculosis patients based on machine learning.

Background: Pulmonary tuberculosis (PTB) is a prevalent chronic disease associated with a significant economic burden on patients. Using machine learning to predict hospitalization costs can allocate medical resources effectively and optimize the cost structure rationally, so as to control the hospitalization costs of patients better.

Methods: This research analyzed data (2020-2022) from a Kashgar pulmonary hospital's information system, involving 9570 eligible PTB patients. SPSS 26.0 was used for multiple regression analysis, while Python 3.7 was used for random forest regression (RFR) and MLP. The training set included data from 2020 and 2021, while the test set included data from 2022. The models predicted seven various costs related to PTB patients, including diagnostic cost, medical service cost, material cost, treatment cost, drug cost, other cost, and total hospitalization cost. The model's predictive performance was evaluated using R-square (R2), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) metrics.

Results: Among the 9570 PTB patients included in the study, the median and quartile of total hospitalization cost were 13,150.45 (9891.34, 19,648.48) yuan. Nine factors, including age, marital status, admission condition, length of hospital stay, initial treatment, presence of other diseases, transfer, drug resistance, and admission department, significantly influenced hospitalization costs for PTB patients. Overall, MLP demonstrated superior performance in most cost predictions, outperforming RFR and multiple regression; The performance of RFR is between MLP and multiple regression; The predictive performance of multiple regression is the lowest, but it shows the best results for Other costs.

Conclusion: The MLP can effectively leverage patient information and accurately predict various hospitalization costs, achieving a rationalized structure of hospitalization costs by adjusting higher-cost inpatient items and balancing different cost categories. The insights of this predictive model also hold relevance for research in other medical conditions.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Infectious Diseases
BMC Infectious Diseases 医学-传染病学
CiteScore
6.50
自引率
0.00%
发文量
860
审稿时长
3.3 months
期刊介绍: BMC Infectious Diseases is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of infectious and sexually transmitted diseases in humans, as well as related molecular genetics, pathophysiology, and epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信