{"title":"儿童喘息危险因素的两阶段人工神经网络回归模型——以肯尼亚Gatundu医院为例","authors":"Thomas Mageto, J. Onyango","doi":"10.12691/AJAMS-9-2-1","DOIUrl":null,"url":null,"abstract":"In Kenya wheezing that leads to asthma development in most cases remain under-diagnosed and under-treated. Currently there is no public supported wheezing and asthma care programmes to optimize care for patients with asthma which greatly compounds diagnosis and treatment of the disease. The aim of this study is therefore to consider and analyse the covariates of childhood wheezing among children below 10 years of age in Kenya, a case study of Gatundu hospital in order to improve the provision of wheezing and asthma care services in medical facilities. The possible risk factors in the study are selected from three major groups of demographic, socioeconomic and geographical location factors related to childhood wheezing. The longitudinal secondary data obtained from Gatundu hospital in Kenya were collected and a total of 584 complete cases were recorded. The predictor variables considered in the study include age of children in months, gender, exclusive breastfeeding, exposure to tobacco smoking, difficult living conditions, residence, atopy, maternal age and preterm births. Due to the binary nature of response variable in which data is recorded as presence or absence of wheezing, the risk factors were modelled using multiple logistic regression and Artificial Neural Network Models. Simple random samples of sizes n = 385 without replacement were selected and p-values at 5% level of significance for the variables were recorded. In multiple logistic regression, the five variables identified as possible risk factors for modelling with p-value less than or equal to 0.05 were selected that includes age of children, exclusive breastfeeding, exposure to tobacco smoking, difficult living conditions and residence that recorded p-values of 0.0151, 0.0000, 0.0071, 0.0274 and 0.0410. The best multiple logistic linear regression model selected was based on Akaike Information Criterion (AIC) criterion that recorded null deviance, residual deviance and AIC of 502.44, 179.57 and 191.57 respectively. The precision and accuracy of the multiple logistic regression model were recorded as 89.2% and 93.3% respectively. The Artificial Neural Network was considered for modelling as well, the model with one-hidden layer with four neurons in the hidden layer recorded precision of 97.1% and accuracy of 39.4% while the rest of the models with one hidden layer recorded precision and accuracy of 0.0% and 65.1% respectively. The Artificial Neural Network model with two-hidden layers were also considered and the Network with one neuron in both layers was selected as better performing model with precision and accuracy of 88.2% and 93.3%. The developed two-stage logistic Artificial Neural Network was found to have better performance compared to multiple linear logistic regression and Artificial Neural Networks since it recorded precision and accuracy of 97.1% and 99.0% respectively and hence recommended for consideration in modelling the risk factor of wheezing among children in Kenya.","PeriodicalId":91196,"journal":{"name":"American journal of applied mathematics and statistics","volume":"9 1","pages":"38-47"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Two-Stage Artificial Neural Network Regression Modelling for Wheezing Risk Factors Among Children - A Case Study of Gatundu Hospital, Kenya\",\"authors\":\"Thomas Mageto, J. Onyango\",\"doi\":\"10.12691/AJAMS-9-2-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In Kenya wheezing that leads to asthma development in most cases remain under-diagnosed and under-treated. Currently there is no public supported wheezing and asthma care programmes to optimize care for patients with asthma which greatly compounds diagnosis and treatment of the disease. The aim of this study is therefore to consider and analyse the covariates of childhood wheezing among children below 10 years of age in Kenya, a case study of Gatundu hospital in order to improve the provision of wheezing and asthma care services in medical facilities. The possible risk factors in the study are selected from three major groups of demographic, socioeconomic and geographical location factors related to childhood wheezing. The longitudinal secondary data obtained from Gatundu hospital in Kenya were collected and a total of 584 complete cases were recorded. The predictor variables considered in the study include age of children in months, gender, exclusive breastfeeding, exposure to tobacco smoking, difficult living conditions, residence, atopy, maternal age and preterm births. Due to the binary nature of response variable in which data is recorded as presence or absence of wheezing, the risk factors were modelled using multiple logistic regression and Artificial Neural Network Models. Simple random samples of sizes n = 385 without replacement were selected and p-values at 5% level of significance for the variables were recorded. In multiple logistic regression, the five variables identified as possible risk factors for modelling with p-value less than or equal to 0.05 were selected that includes age of children, exclusive breastfeeding, exposure to tobacco smoking, difficult living conditions and residence that recorded p-values of 0.0151, 0.0000, 0.0071, 0.0274 and 0.0410. The best multiple logistic linear regression model selected was based on Akaike Information Criterion (AIC) criterion that recorded null deviance, residual deviance and AIC of 502.44, 179.57 and 191.57 respectively. The precision and accuracy of the multiple logistic regression model were recorded as 89.2% and 93.3% respectively. The Artificial Neural Network was considered for modelling as well, the model with one-hidden layer with four neurons in the hidden layer recorded precision of 97.1% and accuracy of 39.4% while the rest of the models with one hidden layer recorded precision and accuracy of 0.0% and 65.1% respectively. The Artificial Neural Network model with two-hidden layers were also considered and the Network with one neuron in both layers was selected as better performing model with precision and accuracy of 88.2% and 93.3%. The developed two-stage logistic Artificial Neural Network was found to have better performance compared to multiple linear logistic regression and Artificial Neural Networks since it recorded precision and accuracy of 97.1% and 99.0% respectively and hence recommended for consideration in modelling the risk factor of wheezing among children in Kenya.\",\"PeriodicalId\":91196,\"journal\":{\"name\":\"American journal of applied mathematics and statistics\",\"volume\":\"9 1\",\"pages\":\"38-47\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American journal of applied mathematics and statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12691/AJAMS-9-2-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of applied mathematics and statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12691/AJAMS-9-2-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在肯尼亚,大多数情况下导致哮喘发展的喘息仍未得到充分诊断和治疗。目前,没有公众支持的喘息和哮喘护理规划来优化对哮喘患者的护理,这大大复杂化了该病的诊断和治疗。因此,本研究的目的是考虑和分析肯尼亚10岁以下儿童喘息的协变量,以Gatundu医院为例进行研究,以改善医疗设施中喘息和哮喘护理服务的提供。研究中可能的危险因素是从与儿童喘息有关的人口、社会经济和地理位置因素三大类中选择的。收集了从肯尼亚Gatundu医院获得的纵向二级数据,共记录了584例完整病例。研究中考虑的预测变量包括儿童的月龄、性别、纯母乳喂养、吸烟、困难的生活条件、住所、特应性反应、母亲年龄和早产。由于响应变量的二元性质,其中数据记录为存在或不存在喘息,因此使用多元逻辑回归和人工神经网络模型对风险因素进行建模。选择大小为n = 385的无替换的简单随机样本,记录变量在5%显著性水平下的p值。在多元logistic回归中,选取了5个p值小于或等于0.05的可能危险因素,包括儿童年龄、纯母乳喂养、吸烟暴露、生活条件困难和居住地,p值分别为0.0151、0.0000、0.0071、0.0274和0.0410。选择的最佳多元logistic线性回归模型以赤池信息准则(Akaike Information Criterion, AIC)为准则,零偏差、残差和AIC分别为502.44、179.57和191.57。多元logistic回归模型的精密度和准确度分别为89.2%和93.3%。采用人工神经网络进行建模,单隐层4个神经元的模型的精度为97.1%,准确率为39.4%,其余单隐层模型的精度为0.0%,准确率为65.1%。考虑了两层隐含层的人工神经网络模型,选择了两层都有一个神经元的网络作为表现较好的模型,其精度和准确度分别为88.2%和93.3%。研究发现,与多元线性逻辑回归和人工神经网络相比,开发的两阶段逻辑人工神经网络具有更好的性能,因为它分别记录了97.1%和99.0%的精度和准确性,因此建议在肯尼亚儿童喘息风险因素建模时考虑。
Two-Stage Artificial Neural Network Regression Modelling for Wheezing Risk Factors Among Children - A Case Study of Gatundu Hospital, Kenya
In Kenya wheezing that leads to asthma development in most cases remain under-diagnosed and under-treated. Currently there is no public supported wheezing and asthma care programmes to optimize care for patients with asthma which greatly compounds diagnosis and treatment of the disease. The aim of this study is therefore to consider and analyse the covariates of childhood wheezing among children below 10 years of age in Kenya, a case study of Gatundu hospital in order to improve the provision of wheezing and asthma care services in medical facilities. The possible risk factors in the study are selected from three major groups of demographic, socioeconomic and geographical location factors related to childhood wheezing. The longitudinal secondary data obtained from Gatundu hospital in Kenya were collected and a total of 584 complete cases were recorded. The predictor variables considered in the study include age of children in months, gender, exclusive breastfeeding, exposure to tobacco smoking, difficult living conditions, residence, atopy, maternal age and preterm births. Due to the binary nature of response variable in which data is recorded as presence or absence of wheezing, the risk factors were modelled using multiple logistic regression and Artificial Neural Network Models. Simple random samples of sizes n = 385 without replacement were selected and p-values at 5% level of significance for the variables were recorded. In multiple logistic regression, the five variables identified as possible risk factors for modelling with p-value less than or equal to 0.05 were selected that includes age of children, exclusive breastfeeding, exposure to tobacco smoking, difficult living conditions and residence that recorded p-values of 0.0151, 0.0000, 0.0071, 0.0274 and 0.0410. The best multiple logistic linear regression model selected was based on Akaike Information Criterion (AIC) criterion that recorded null deviance, residual deviance and AIC of 502.44, 179.57 and 191.57 respectively. The precision and accuracy of the multiple logistic regression model were recorded as 89.2% and 93.3% respectively. The Artificial Neural Network was considered for modelling as well, the model with one-hidden layer with four neurons in the hidden layer recorded precision of 97.1% and accuracy of 39.4% while the rest of the models with one hidden layer recorded precision and accuracy of 0.0% and 65.1% respectively. The Artificial Neural Network model with two-hidden layers were also considered and the Network with one neuron in both layers was selected as better performing model with precision and accuracy of 88.2% and 93.3%. The developed two-stage logistic Artificial Neural Network was found to have better performance compared to multiple linear logistic regression and Artificial Neural Networks since it recorded precision and accuracy of 97.1% and 99.0% respectively and hence recommended for consideration in modelling the risk factor of wheezing among children in Kenya.