Berkay Akturk, Ufuk Beyaztas, Han Lin Shang, Abhijit Mandal
{"title":"Robust functional logistic regression","authors":"Berkay Akturk, Ufuk Beyaztas, Han Lin Shang, Abhijit Mandal","doi":"10.1007/s11634-023-00577-z","DOIUrl":null,"url":null,"abstract":"<p>Functional logistic regression is a popular model to capture a linear relationship between binary response and functional predictor variables. However, many methods used for parameter estimation in functional logistic regression are sensitive to outliers, which may lead to inaccurate parameter estimates and inferior classification accuracy. We propose a robust estimation procedure for functional logistic regression, in which the observations of the functional predictor are projected onto a set of finite-dimensional subspaces via robust functional principal component analysis. This dimension-reduction step reduces the outlying effects in the functional predictor. The logistic regression coefficient is estimated using an M-type estimator based on binary response and robust principal component scores. In doing so, we provide robust estimates by minimizing the effects of outliers in the binary response and functional predictor variables. Via a series of Monte-Carlo simulations and using hand radiograph data, we examine the parameter estimation and classification accuracy for the response variable. We find that the robust procedure outperforms some existing robust and non-robust methods when outliers are present, while producing competitive results when outliers are absent. In addition, the proposed method is computationally more efficient than some existing robust alternatives.\n</p>","PeriodicalId":49270,"journal":{"name":"Advances in Data Analysis and Classification","volume":"2018 1","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Data Analysis and Classification","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11634-023-00577-z","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
Functional logistic regression is a popular model to capture a linear relationship between binary response and functional predictor variables. However, many methods used for parameter estimation in functional logistic regression are sensitive to outliers, which may lead to inaccurate parameter estimates and inferior classification accuracy. We propose a robust estimation procedure for functional logistic regression, in which the observations of the functional predictor are projected onto a set of finite-dimensional subspaces via robust functional principal component analysis. This dimension-reduction step reduces the outlying effects in the functional predictor. The logistic regression coefficient is estimated using an M-type estimator based on binary response and robust principal component scores. In doing so, we provide robust estimates by minimizing the effects of outliers in the binary response and functional predictor variables. Via a series of Monte-Carlo simulations and using hand radiograph data, we examine the parameter estimation and classification accuracy for the response variable. We find that the robust procedure outperforms some existing robust and non-robust methods when outliers are present, while producing competitive results when outliers are absent. In addition, the proposed method is computationally more efficient than some existing robust alternatives.
功能逻辑回归是一种常用的模型,用于捕捉二元响应与功能预测变量之间的线性关系。然而,用于函数逻辑回归参数估计的许多方法对异常值都很敏感,这可能导致参数估计不准确和分类准确性降低。我们提出了一种稳健的函数逻辑回归估计程序,通过稳健的函数主成分分析,将函数预测变量的观测值投影到一组有限维子空间上。这一降维步骤减少了功能预测因子中的离群效应。使用基于二元响应和稳健主成分得分的 M 型估计器来估计逻辑回归系数。在此过程中,我们将二元响应和功能预测变量中离群值的影响降至最低,从而提供稳健的估计值。通过一系列蒙特卡罗模拟并使用手部 X 射线照片数据,我们检验了响应变量的参数估计和分类准确性。我们发现,当出现异常值时,稳健程序优于一些现有的稳健和非稳健方法,而当没有异常值时,稳健程序也能产生有竞争力的结果。此外,与现有的一些稳健替代方法相比,所提出的方法在计算上更加高效。
期刊介绍:
The international journal Advances in Data Analysis and Classification (ADAC) is designed as a forum for high standard publications on research and applications concerning the extraction of knowable aspects from many types of data. It publishes articles on such topics as structural, quantitative, or statistical approaches for the analysis of data; advances in classification, clustering, and pattern recognition methods; strategies for modeling complex data and mining large data sets; methods for the extraction of knowledge from data, and applications of advanced methods in specific domains of practice. Articles illustrate how new domain-specific knowledge can be made available from data by skillful use of data analysis methods. The journal also publishes survey papers that outline, and illuminate the basic ideas and techniques of special approaches.