直接使用共轭梯度的逻辑回归的有效优化

2011 10th International Conference on Machine Learning and Applications and Workshops Pub Date : 2011-12-18 DOI:10.1109/ICMLA.2011.63

Kenji Watanabe, Takumi Kobayashi, N. Otsu

{"title":"直接使用共轭梯度的逻辑回归的有效优化","authors":"Kenji Watanabe, Takumi Kobayashi, N. Otsu","doi":"10.1109/ICMLA.2011.63","DOIUrl":null,"url":null,"abstract":"In classification problems, logistic regression (LR) is used to estimate posterior probabilities. The objective function of LR is usually minimized by Newton-Raphson method such as using iterative reweighted least squares (IRLS). There, the inverse Hessian matrix must be calculated in each iteration step. Thus, a computational cost in the optimization of LR significantly increases as input data becomes large. To reduce the computational cost, we propose a novel optimization method of LR by directly using the non-linear conjugate gradient (CG) method. The proposed method iteratively minimizes the objective function of LR without calculation of the Hessian matrix. Furthermore, to reduce the number of iteration efficiently, the step size in the non-linear CG iteration is optimized avoiding ad hock line search, and initial values are set by ordinary linear regression analysis. In the experimental results, our method performs about 200 times faster than the other methods for a large scale dataset.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Efficient Optimization of Logistic Regression by Direct Use of Conjugate Gradient\",\"authors\":\"Kenji Watanabe, Takumi Kobayashi, N. Otsu\",\"doi\":\"10.1109/ICMLA.2011.63\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In classification problems, logistic regression (LR) is used to estimate posterior probabilities. The objective function of LR is usually minimized by Newton-Raphson method such as using iterative reweighted least squares (IRLS). There, the inverse Hessian matrix must be calculated in each iteration step. Thus, a computational cost in the optimization of LR significantly increases as input data becomes large. To reduce the computational cost, we propose a novel optimization method of LR by directly using the non-linear conjugate gradient (CG) method. The proposed method iteratively minimizes the objective function of LR without calculation of the Hessian matrix. Furthermore, to reduce the number of iteration efficiently, the step size in the non-linear CG iteration is optimized avoiding ad hock line search, and initial values are set by ordinary linear regression analysis. In the experimental results, our method performs about 200 times faster than the other methods for a large scale dataset.\",\"PeriodicalId\":439926,\"journal\":{\"name\":\"2011 10th International Conference on Machine Learning and Applications and Workshops\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 10th International Conference on Machine Learning and Applications and Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2011.63\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 10th International Conference on Machine Learning and Applications and Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2011.63","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在分类问题中，逻辑回归(LR)被用来估计后验概率。LR的目标函数通常采用迭代加权最小二乘(IRLS)等Newton-Raphson方法最小化。在这种情况下，必须在每个迭代步骤中计算逆Hessian矩阵。因此，当输入数据变大时，LR优化的计算成本会显著增加。为了减少计算量，我们提出了一种直接使用非线性共轭梯度(CG)方法的LR优化方法。该方法在不计算Hessian矩阵的情况下迭代最小化LR的目标函数。为了有效减少迭代次数，对非线性CG迭代中的步长进行了优化，避免了对直线的搜索，并通过普通线性回归分析设定了初始值。在实验结果中，对于大规模数据集，我们的方法比其他方法快200倍左右。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Efficient Optimization of Logistic Regression by Direct Use of Conjugate Gradient

In classification problems, logistic regression (LR) is used to estimate posterior probabilities. The objective function of LR is usually minimized by Newton-Raphson method such as using iterative reweighted least squares (IRLS). There, the inverse Hessian matrix must be calculated in each iteration step. Thus, a computational cost in the optimization of LR significantly increases as input data becomes large. To reduce the computational cost, we propose a novel optimization method of LR by directly using the non-linear conjugate gradient (CG) method. The proposed method iteratively minimizes the objective function of LR without calculation of the Hessian matrix. Furthermore, to reduce the number of iteration efficiently, the step size in the non-linear CG iteration is optimized avoiding ad hock line search, and initial values are set by ordinary linear regression analysis. In the experimental results, our method performs about 200 times faster than the other methods for a large scale dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2011 10th International Conference on Machine Learning and Applications and Workshops

自引率

0.00%

发文量