Bradley R Lewis, Dipankar Bandyopadhyay, Stacia M DeSantis, Mike T John
{"title":"Augmenting beta regression for periodontal proportion data via the SAS NLMIXED procedure.","authors":"Bradley R Lewis, Dipankar Bandyopadhyay, Stacia M DeSantis, Mike T John","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Often in clinical dental research, clinical attachment level (CAL) is recorded at several sites throughout the mouth to assess the extent of periodontal disease (PD). One might be interested to quantify PD at the tooth-level via the proportion of diseased sites per tooth type (say, incisors, canines, pre-molars and molars) per subject. However, these studies might consist of relatively disease-free and highly diseased subjects leading to the proportion responses distributed in the interval [0, 1]. While beta regression (BR) is often the model of choice to assess covariate effects for proportion data, the presence (and/or abundance) of zeros and/or ones makes it inapplicable here because the beta support is defined in the interval (0, 1). Avoiding ad hoc data transformation, we explore the potential of the augmented BR framework which augments the beta density with non-zero masses at zero and one while accounting for the clustering induced. Our classical estimation framework using maximum likelihood utilizes the potential of the SAS® Proc NLMIXED procedure. We explore our methodology via simulation studies and application to a real cross-sectional dataset on PD, and we assess the gain in model fit and parameter estimation over other ad hoc alternatives. This reveals newer insights into risk quantification on clustered proportion responses. Our methods can be implemented using standard SAS software routines. The augmented BR model results in a better fit to clustered periodontal proportion data over the standard beta model. We recommend using it as a parametric alternative for fitting proportion data, and avoid ad hoc data transformation.</p>","PeriodicalId":38394,"journal":{"name":"Journal of Applied Probability and Statistics","volume":"12 1","pages":"49-66"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6191203/pdf/nihms882113.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Probability and Statistics","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0
Abstract
Often in clinical dental research, clinical attachment level (CAL) is recorded at several sites throughout the mouth to assess the extent of periodontal disease (PD). One might be interested to quantify PD at the tooth-level via the proportion of diseased sites per tooth type (say, incisors, canines, pre-molars and molars) per subject. However, these studies might consist of relatively disease-free and highly diseased subjects leading to the proportion responses distributed in the interval [0, 1]. While beta regression (BR) is often the model of choice to assess covariate effects for proportion data, the presence (and/or abundance) of zeros and/or ones makes it inapplicable here because the beta support is defined in the interval (0, 1). Avoiding ad hoc data transformation, we explore the potential of the augmented BR framework which augments the beta density with non-zero masses at zero and one while accounting for the clustering induced. Our classical estimation framework using maximum likelihood utilizes the potential of the SAS® Proc NLMIXED procedure. We explore our methodology via simulation studies and application to a real cross-sectional dataset on PD, and we assess the gain in model fit and parameter estimation over other ad hoc alternatives. This reveals newer insights into risk quantification on clustered proportion responses. Our methods can be implemented using standard SAS software routines. The augmented BR model results in a better fit to clustered periodontal proportion data over the standard beta model. We recommend using it as a parametric alternative for fitting proportion data, and avoid ad hoc data transformation.