{"title":"Hybrid time series and machine learning models for forecasting cardiovascular mortality in India: an age specific analysis.","authors":"M Darshan Teja, G Mokesh Rayalu","doi":"10.1186/s12889-025-23318-7","DOIUrl":null,"url":null,"abstract":"<p><p>Cardiovascular disease (CVD) is a primary cause of death in India, accounting for a significant portion of the global CVD burden. This study looks at statistics on heart disease mortality from the Institute for Health Metrics and Evaluation (IHME) from 1990 to 2021, divided into five age groups: 0-5, 6-15, 16-49, 50-69, and 70 + . We used both classic ARIMA and hybrid models that combined ARIMA with machine learning techniques such as Random Forest, Support Vector Machine (SVM), XGBoost, and GARCH to anticipate mortality trends. Model performance was assessed using the Root Mean Square Error (RMSE) and Mean Absolute Percentage Error (MAPE). Across several age groups, the ARIMA + SVM model outperformed standalone ARIMA in terms of accuracy, with RMSE improvements of up to 15.6%. The 70 + population has the greatest mortality rates, highlighting the urgent need for focused healthcare treatments. These hybrid models are valuable tools for healthcare legislators in developing preventative programs, allocating resources effectively, and prioritizing treatment for high-risk age groups, especially the elderly, since they improve forecasting accuracy and offer interpretive insights. Given India's growing cardiovascular disease load, our results highlight how predictive analytics may support data-driven public health planning.</p>","PeriodicalId":9039,"journal":{"name":"BMC Public Health","volume":"25 1","pages":"2150"},"PeriodicalIF":3.6000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150534/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Public Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12889-025-23318-7","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Cardiovascular disease (CVD) is a primary cause of death in India, accounting for a significant portion of the global CVD burden. This study looks at statistics on heart disease mortality from the Institute for Health Metrics and Evaluation (IHME) from 1990 to 2021, divided into five age groups: 0-5, 6-15, 16-49, 50-69, and 70 + . We used both classic ARIMA and hybrid models that combined ARIMA with machine learning techniques such as Random Forest, Support Vector Machine (SVM), XGBoost, and GARCH to anticipate mortality trends. Model performance was assessed using the Root Mean Square Error (RMSE) and Mean Absolute Percentage Error (MAPE). Across several age groups, the ARIMA + SVM model outperformed standalone ARIMA in terms of accuracy, with RMSE improvements of up to 15.6%. The 70 + population has the greatest mortality rates, highlighting the urgent need for focused healthcare treatments. These hybrid models are valuable tools for healthcare legislators in developing preventative programs, allocating resources effectively, and prioritizing treatment for high-risk age groups, especially the elderly, since they improve forecasting accuracy and offer interpretive insights. Given India's growing cardiovascular disease load, our results highlight how predictive analytics may support data-driven public health planning.
期刊介绍:
BMC Public Health is an open access, peer-reviewed journal that considers articles on the epidemiology of disease and the understanding of all aspects of public health. The journal has a special focus on the social determinants of health, the environmental, behavioral, and occupational correlates of health and disease, and the impact of health policies, practices and interventions on the community.