Tianlong Yang , Xunbo Du , Junfan Li , Tin Zhang , Yao Wang , Liang Wang
{"title":"中国成都猩红热传播动态和社会经济决定因素建模:综合SEIAR和机器学习方法","authors":"Tianlong Yang , Xunbo Du , Junfan Li , Tin Zhang , Yao Wang , Liang Wang","doi":"10.1016/j.epidem.2025.100844","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Scarlet fever (SF) is an acute infectious disease that poses a significant public health threat; however, its transmission dynamics, particularly the impact of asymptomatic carriers and socioeconomic determinants, remain unclear.</div></div><div><h3>Methods</h3><div>We developed a susceptible–exposed–infectious–asymptomatic–recovered (SEIAR) model that incorporates asymptomatic infections to estimate the time-varying reproduction number (<em>R</em><sub><em>t</em></sub>) for SF in Chengdu (2005–2019) using local epidemiological data. The model was evaluated using the coefficient of determination (<em>R</em>²), and sensitivity analysis confirmed its robustness. We further integrated Boruta, Extreme Gradient Boosting (XGBoost), and SHapley Additive exPlanations (SHAP) to systematically assess the influence of socioeconomic variables on <em>R</em><sub><em>t</em></sub>.</div></div><div><h3>Results</h3><div>Between 2005 and 2019, Chengdu reported 11,499 cases of SF, with an average incidence of 4.87 per 100,000. Two distinct seasonal peaks occurred in April–May and November–December, and incidence rates were notably lower during school holidays. The majority of cases affected children aged 3–7, with a male-to-female ratio of 1.59:1. In addition, core districts such as Wuhou and Xindu exhibited the highest incidence. The SEIAR model demonstrated strong predictive performance (overall <em>R</em>² = 0.831, <em>P</em> < 0.001) and estimated a median <em>R</em><sub><em>t</em></sub> of 0.963; however, several regions exceeded this threshold, with <em>R</em><sub><em>t</em></sub> peaking approximately two months prior to incidence surges. Spatial analyses revealed significant clustering in central urban areas, and integrated socioeconomic analysis identified the one-child rate as the primary driver of <em>R</em><sub><em>t</em></sub>, followed by population density and healthcare facility density (<em>P</em> < 0.01).</div></div><div><h3>Conclusion</h3><div>By integrating epidemiological data with socioeconomic factors, this study quantitatively elucidates the transmission characteristics of SF in Chengdu, providing data-driven support for monitoring and targeted intervention strategies in the absence of vaccination.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100844"},"PeriodicalIF":3.0000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Modeling transmission dynamics and socio-economic determinants of scarlet fever in Chengdu, China: An integrated SEIAR and machine learning approach\",\"authors\":\"Tianlong Yang , Xunbo Du , Junfan Li , Tin Zhang , Yao Wang , Liang Wang\",\"doi\":\"10.1016/j.epidem.2025.100844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Scarlet fever (SF) is an acute infectious disease that poses a significant public health threat; however, its transmission dynamics, particularly the impact of asymptomatic carriers and socioeconomic determinants, remain unclear.</div></div><div><h3>Methods</h3><div>We developed a susceptible–exposed–infectious–asymptomatic–recovered (SEIAR) model that incorporates asymptomatic infections to estimate the time-varying reproduction number (<em>R</em><sub><em>t</em></sub>) for SF in Chengdu (2005–2019) using local epidemiological data. The model was evaluated using the coefficient of determination (<em>R</em>²), and sensitivity analysis confirmed its robustness. We further integrated Boruta, Extreme Gradient Boosting (XGBoost), and SHapley Additive exPlanations (SHAP) to systematically assess the influence of socioeconomic variables on <em>R</em><sub><em>t</em></sub>.</div></div><div><h3>Results</h3><div>Between 2005 and 2019, Chengdu reported 11,499 cases of SF, with an average incidence of 4.87 per 100,000. Two distinct seasonal peaks occurred in April–May and November–December, and incidence rates were notably lower during school holidays. The majority of cases affected children aged 3–7, with a male-to-female ratio of 1.59:1. In addition, core districts such as Wuhou and Xindu exhibited the highest incidence. The SEIAR model demonstrated strong predictive performance (overall <em>R</em>² = 0.831, <em>P</em> < 0.001) and estimated a median <em>R</em><sub><em>t</em></sub> of 0.963; however, several regions exceeded this threshold, with <em>R</em><sub><em>t</em></sub> peaking approximately two months prior to incidence surges. Spatial analyses revealed significant clustering in central urban areas, and integrated socioeconomic analysis identified the one-child rate as the primary driver of <em>R</em><sub><em>t</em></sub>, followed by population density and healthcare facility density (<em>P</em> < 0.01).</div></div><div><h3>Conclusion</h3><div>By integrating epidemiological data with socioeconomic factors, this study quantitatively elucidates the transmission characteristics of SF in Chengdu, providing data-driven support for monitoring and targeted intervention strategies in the absence of vaccination.</div></div>\",\"PeriodicalId\":49206,\"journal\":{\"name\":\"Epidemics\",\"volume\":\"52 \",\"pages\":\"Article 100844\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Epidemics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1755436525000325\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INFECTIOUS DISEASES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epidemics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1755436525000325","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
Modeling transmission dynamics and socio-economic determinants of scarlet fever in Chengdu, China: An integrated SEIAR and machine learning approach
Background
Scarlet fever (SF) is an acute infectious disease that poses a significant public health threat; however, its transmission dynamics, particularly the impact of asymptomatic carriers and socioeconomic determinants, remain unclear.
Methods
We developed a susceptible–exposed–infectious–asymptomatic–recovered (SEIAR) model that incorporates asymptomatic infections to estimate the time-varying reproduction number (Rt) for SF in Chengdu (2005–2019) using local epidemiological data. The model was evaluated using the coefficient of determination (R²), and sensitivity analysis confirmed its robustness. We further integrated Boruta, Extreme Gradient Boosting (XGBoost), and SHapley Additive exPlanations (SHAP) to systematically assess the influence of socioeconomic variables on Rt.
Results
Between 2005 and 2019, Chengdu reported 11,499 cases of SF, with an average incidence of 4.87 per 100,000. Two distinct seasonal peaks occurred in April–May and November–December, and incidence rates were notably lower during school holidays. The majority of cases affected children aged 3–7, with a male-to-female ratio of 1.59:1. In addition, core districts such as Wuhou and Xindu exhibited the highest incidence. The SEIAR model demonstrated strong predictive performance (overall R² = 0.831, P < 0.001) and estimated a median Rt of 0.963; however, several regions exceeded this threshold, with Rt peaking approximately two months prior to incidence surges. Spatial analyses revealed significant clustering in central urban areas, and integrated socioeconomic analysis identified the one-child rate as the primary driver of Rt, followed by population density and healthcare facility density (P < 0.01).
Conclusion
By integrating epidemiological data with socioeconomic factors, this study quantitatively elucidates the transmission characteristics of SF in Chengdu, providing data-driven support for monitoring and targeted intervention strategies in the absence of vaccination.
期刊介绍:
Epidemics publishes papers on infectious disease dynamics in the broadest sense. Its scope covers both within-host dynamics of infectious agents and dynamics at the population level, particularly the interaction between the two. Areas of emphasis include: spread, transmission, persistence, implications and population dynamics of infectious diseases; population and public health as well as policy aspects of control and prevention; dynamics at the individual level; interaction with the environment, ecology and evolution of infectious diseases, as well as population genetics of infectious agents.