Soft regression trees: A model variant and a decomposition training algorithm

IF 6 2区管理学 Q1 OPERATIONS RESEARCH & MANAGEMENT SCIENCE

European Journal of Operational Research Pub Date : 2025-09-03 DOI:10.1016/j.ejor.2025.08.050

Antonio Consolo , Edoardo Amaldi , Andrea Manno

{"title":"Soft regression trees: A model variant and a decomposition training algorithm","authors":"Antonio Consolo , Edoardo Amaldi , Andrea Manno","doi":"10.1016/j.ejor.2025.08.050","DOIUrl":null,"url":null,"abstract":"<div><div>Decision trees are widely used for classification and regression tasks in a variety of application fields due to their interpretability and good accuracy. During the past decade, growing attention has been devoted to globally optimized decision trees with deterministic or soft splitting rules at branch nodes, which are trained by optimizing the error function over all the tree parameters. In this work, we propose a new variant of soft multivariate regression trees (SRTs) where, for every input vector, the prediction is defined as the linear regression associated to a single leaf node, namely, the leaf node obtained by routing the input vector from the root along the branches with higher probability. SRTs exhibit the conditional computational property, i.e., each prediction depends on a small number of nodes (parameters), and our nonlinear optimization formulation for training them is amenable to decomposition. After showing a universal approximation result for SRTs, we present a decomposition training algorithm including a clustering-based initialization procedure and a heuristic for rerouting the input vectors along the tree. Under mild assumptions, we establish asymptotic convergence guarantees. Experiments on 15 well-known datasets indicate that our SRTs and decomposition algorithm yield higher accuracy and robustness compared with traditional soft regression trees trained using the nonlinear optimization formulation of Blanquero et al. (2021), and a significant reduction in training times as well as a slightly better average accuracy compared with the mixed-integer optimization approach of Bertsimas and Dunn (2019). We also report a comparison with the Random Forest ensemble method.</div></div>","PeriodicalId":55161,"journal":{"name":"European Journal of Operational Research","volume":"328 2","pages":"Pages 607-619"},"PeriodicalIF":6.0000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Operational Research","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0377221725006873","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Decision trees are widely used for classification and regression tasks in a variety of application fields due to their interpretability and good accuracy. During the past decade, growing attention has been devoted to globally optimized decision trees with deterministic or soft splitting rules at branch nodes, which are trained by optimizing the error function over all the tree parameters. In this work, we propose a new variant of soft multivariate regression trees (SRTs) where, for every input vector, the prediction is defined as the linear regression associated to a single leaf node, namely, the leaf node obtained by routing the input vector from the root along the branches with higher probability. SRTs exhibit the conditional computational property, i.e., each prediction depends on a small number of nodes (parameters), and our nonlinear optimization formulation for training them is amenable to decomposition. After showing a universal approximation result for SRTs, we present a decomposition training algorithm including a clustering-based initialization procedure and a heuristic for rerouting the input vectors along the tree. Under mild assumptions, we establish asymptotic convergence guarantees. Experiments on 15 well-known datasets indicate that our SRTs and decomposition algorithm yield higher accuracy and robustness compared with traditional soft regression trees trained using the nonlinear optimization formulation of Blanquero et al. (2021), and a significant reduction in training times as well as a slightly better average accuracy compared with the mixed-integer optimization approach of Bertsimas and Dunn (2019). We also report a comparison with the Random Forest ensemble method.

查看原文本刊更多论文

软回归树：一种模型变体和分解训练算法

决策树由于其可解释性和良好的准确性，被广泛用于各种应用领域的分类和回归任务。在过去的十年中，越来越多的人关注全局优化决策树，这些决策树在分支节点上具有确定性或软分裂规则，通过优化所有树参数上的误差函数来训练。在这项工作中，我们提出了一种新的软多元回归树（srt）变体，其中，对于每个输入向量，预测被定义为与单个叶节点相关的线性回归，即通过以更高的概率从根沿着分支路由输入向量获得的叶节点。srt表现出条件计算特性，即每个预测依赖于少量节点（参数），并且我们用于训练它们的非线性优化公式易于分解。在展示了srt的普遍近似结果之后，我们提出了一种分解训练算法，包括基于聚类的初始化过程和用于沿树重新路由输入向量的启发式算法。在温和的假设下，我们建立了渐近收敛保证。在15个知名数据集上的实验表明，与使用Blanquero等人（2021）的非线性优化公式训练的传统软回归树相比，我们的srt和分解算法具有更高的准确性和鲁棒性，与Bertsimas和Dunn（2019）的混合整数优化方法相比，我们的训练时间显著减少，平均准确率略高。我们还报告了与随机森林集合方法的比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

European Journal of Operational Research 管理科学-运筹学与管理科学

CiteScore

11.90

自引率

9.40%

发文量

786

审稿时长

8.2 months

期刊介绍： The European Journal of Operational Research (EJOR) publishes high quality, original papers that contribute to the methodology of operational research (OR) and to the practice of decision making.