PREDICTION CLASSIFICATION AND MODELLING USING DECISION TREE WITH ORDERED REGRESSION AND ITS APPLICATION TO SOCIO-BEHAVIORAL FACTORS ASSOCIATED WITH TOOTHBRUSHING FREQUENCY IN CHILDREN
Hazik Bin Shahzad, Wan Muhammad Amir W Ahmad, Mohamad Nasarudin Adnan, Anas Imran Arshad
{"title":"PREDICTION CLASSIFICATION AND MODELLING USING DECISION TREE WITH ORDERED REGRESSION AND ITS APPLICATION TO SOCIO-BEHAVIORAL FACTORS ASSOCIATED WITH TOOTHBRUSHING FREQUENCY IN CHILDREN","authors":"Hazik Bin Shahzad, Wan Muhammad Amir W Ahmad, Mohamad Nasarudin Adnan, Anas Imran Arshad","doi":"10.37268/mjphm/vol.23/no.2/art.2053","DOIUrl":null,"url":null,"abstract":"Toothbrushing is considered the best self-care behavior for the prevention of oral diseases. Brushing teeth twice a day is considered the social norm, but the development of such habits is dependent on psychosocial, economic, and environmental factors. Recognizing the significance of statistical modeling in medical sciences, this study will use decision trees and ordinal regression to predict frequency of toothbrushing in children. The methodology will be harmonized in the R syntax. The study illustrated the development of the method using 527 observations from WHO oral health questionnaire for children. Before regression analysis, the clinical relevance and significance of each of the 28 variables will be assessed using decision tree analysis and tested for accuracy. The classification obtained will be used as an input for the ordinal regression modeling. According to decision tree analysis, smoking, maternal education, dietary habits, history of toothache and self-rated tooth health contributed significantly to the children’s overall toothbrushing frequency. These six variables were used as input for ordinal regression analysis and the developed syntax was used to assess the goodness of fit for the model. Our proposed method achieves the highest level of forecasting precision possible. The process is an alternate to ordinal regression modelling as the selection of appropriate variables is based on computational analysis, forecasting the importance of the independent variables chosen for the final model. This process demonstrates the possibility of developing prediction models which can then be used to formulate clinical hypothesis and inform future researchers.","PeriodicalId":38537,"journal":{"name":"Malaysian Journal of Public Health Medicine","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Malaysian Journal of Public Health Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37268/mjphm/vol.23/no.2/art.2053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Toothbrushing is considered the best self-care behavior for the prevention of oral diseases. Brushing teeth twice a day is considered the social norm, but the development of such habits is dependent on psychosocial, economic, and environmental factors. Recognizing the significance of statistical modeling in medical sciences, this study will use decision trees and ordinal regression to predict frequency of toothbrushing in children. The methodology will be harmonized in the R syntax. The study illustrated the development of the method using 527 observations from WHO oral health questionnaire for children. Before regression analysis, the clinical relevance and significance of each of the 28 variables will be assessed using decision tree analysis and tested for accuracy. The classification obtained will be used as an input for the ordinal regression modeling. According to decision tree analysis, smoking, maternal education, dietary habits, history of toothache and self-rated tooth health contributed significantly to the children’s overall toothbrushing frequency. These six variables were used as input for ordinal regression analysis and the developed syntax was used to assess the goodness of fit for the model. Our proposed method achieves the highest level of forecasting precision possible. The process is an alternate to ordinal regression modelling as the selection of appropriate variables is based on computational analysis, forecasting the importance of the independent variables chosen for the final model. This process demonstrates the possibility of developing prediction models which can then be used to formulate clinical hypothesis and inform future researchers.
期刊介绍:
Malaysian Journal of Public Health Medicine (MJPHM) is the official Journal of Malaysian Public Health Physicians’ Association. This is an Open-Access and peer-reviewed Journal founded in 2001 with the main objective of providing a platform for publication of scientific articles in the areas of public health medicine. . The Journal is published in two volumes per year. Contributors are welcome to send their articles in all sub-discipline of public health including epidemiology, biostatistics, nutrition, family health, infectious diseases, health services research, gerontology, child health, adolescent health, behavioral medicine, rural health, chronic diseases, health promotion, public health policy and management, health economics, occupational health and environmental health.