{"title":"Smoothing spline for the AUC estimate: Simulation studies in Gaussian data","authors":"K. Promjiraprawat, W. Wongseree","doi":"10.1109/MITICON.2016.8025224","DOIUrl":null,"url":null,"abstract":"Receiver operating characteristic (ROC) curve has been employed in classification problems along with the area under the curve (AUC) as the performance indicator of classifiers. Both parametric and non-parametric methods have been widely used to estimate the ROC curve as well as the AUC. In this study, a smoothing spline is proposed in order to provide an alternative of the ROC curve and AUC estimate. A logistic regression is selected as a base classifier for simulation cases of Gaussian and mixture of Gaussian data. The smoothing spline, bi-normal model and empirical method are compared in terms of root mean square error (RMSE) from the true ROC curve and the bias from the true AUC. The results indicate that the ROC curve and its AUC obtained from smoothing spline can provide a trade-off between the parametric bi-normal model and non-parametric empirical method, with 1.4% of bias and 7.75 of RMSE, on average for a dichotomous classification.","PeriodicalId":127868,"journal":{"name":"2016 Management and Innovation Technology International Conference (MITicon)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Management and Innovation Technology International Conference (MITicon)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MITICON.2016.8025224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Receiver operating characteristic (ROC) curve has been employed in classification problems along with the area under the curve (AUC) as the performance indicator of classifiers. Both parametric and non-parametric methods have been widely used to estimate the ROC curve as well as the AUC. In this study, a smoothing spline is proposed in order to provide an alternative of the ROC curve and AUC estimate. A logistic regression is selected as a base classifier for simulation cases of Gaussian and mixture of Gaussian data. The smoothing spline, bi-normal model and empirical method are compared in terms of root mean square error (RMSE) from the true ROC curve and the bias from the true AUC. The results indicate that the ROC curve and its AUC obtained from smoothing spline can provide a trade-off between the parametric bi-normal model and non-parametric empirical method, with 1.4% of bias and 7.75 of RMSE, on average for a dichotomous classification.
在分类问题中采用了受试者工作特征曲线(Receiver operating characteristic, ROC),曲线下面积(area under the curve, AUC)作为分类器的性能指标。参数和非参数方法都被广泛用于估计ROC曲线和AUC。在本研究中,为了提供ROC曲线和AUC估计的替代方法,提出了平滑样条。对于高斯和混合高斯数据的模拟情况,选择逻辑回归作为基分类器。比较了平滑样条、双正态模型和经验方法与真实ROC曲线的均方根误差(RMSE)和真实AUC的偏差。结果表明,由平滑样条获得的ROC曲线及其AUC可以在参数双正态模型和非参数经验方法之间进行权衡,对于二分类,平均偏差为1.4%,RMSE为7.75。