基于决策空间Pareto前分析的集成模型性能曲线优化

IF 2.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems Pub Date : 2025-05-29 DOI:10.1111/exsy.70075

Alberto Gutierrez-Gallego, Oscar Garnica, Daniel Parra, J. Manuel Velasco, J. Ignacio Hidalgo

{"title":"基于决策空间Pareto前分析的集成模型性能曲线优化","authors":"Alberto Gutierrez-Gallego, Oscar Garnica, Daniel Parra, J. Manuel Velasco, J. Ignacio Hidalgo","doi":"10.1111/exsy.70075","DOIUrl":null,"url":null,"abstract":"Receiver operating characteristic curves are commonly used to evaluate the performance of machine learning ensemble classification models that combine multiple classifiers through a voting procedure. Although these models have many parameters, standard ROC analyses typically vary only the voting threshold, limiting their potential for improvement. In this paper, we propose Performance Curve Mapping, a new method that redefines the ROC curve as the Pareto front of a multi-objective optimisation problem. The method maps the multidimensional space of all ensemble parameters (Decision space) into a two-dimensional Objective space defined by classification performance metrics. We employ an algorithm based on NSGA-II to explore the Decision space and validate the proposal on two different classification problems: (1) predicting car insurance claims in a highly imbalanced dataset (Insurance dataset), and (2) predicting obesity risk in a balanced clinical dataset (GenObIA dataset). We compare our method with alternative ensemble optimisation approaches, using visual assessment, the area under the curve and the Youden index as performance measures. In the Insurance dataset, Performance Curve Mapping achieves an average improvement of 46.4% in AUC-ROC and 26.1% in the Youden index. In the GenObIA dataset, it achieves an average improvement of 29.7% in AUC-ROC and 11.9% in the Youden index. All improvements are calculated relative to the maximum achievable improvement.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 7","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.70075","citationCount":"0","resultStr":"{\"title\":\"Optimising Performance Curves for Ensemble Models through Pareto Front Analysis of the Decision Space\",\"authors\":\"Alberto Gutierrez-Gallego, Oscar Garnica, Daniel Parra, J. Manuel Velasco, J. Ignacio Hidalgo\",\"doi\":\"10.1111/exsy.70075\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Receiver operating characteristic curves are commonly used to evaluate the performance of machine learning ensemble classification models that combine multiple classifiers through a voting procedure. Although these models have many parameters, standard ROC analyses typically vary only the voting threshold, limiting their potential for improvement. In this paper, we propose Performance Curve Mapping, a new method that redefines the ROC curve as the Pareto front of a multi-objective optimisation problem. The method maps the multidimensional space of all ensemble parameters (Decision space) into a two-dimensional Objective space defined by classification performance metrics. We employ an algorithm based on NSGA-II to explore the Decision space and validate the proposal on two different classification problems: (1) predicting car insurance claims in a highly imbalanced dataset (Insurance dataset), and (2) predicting obesity risk in a balanced clinical dataset (GenObIA dataset). We compare our method with alternative ensemble optimisation approaches, using visual assessment, the area under the curve and the Youden index as performance measures. In the Insurance dataset, Performance Curve Mapping achieves an average improvement of 46.4% in AUC-ROC and 26.1% in the Youden index. In the GenObIA dataset, it achieves an average improvement of 29.7% in AUC-ROC and 11.9% in the Youden index. All improvements are calculated relative to the maximum achievable improvement.\",\"PeriodicalId\":51053,\"journal\":{\"name\":\"Expert Systems\",\"volume\":\"42 7\",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.70075\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70075\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70075","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

接受者工作特征曲线通常用于评估通过投票程序组合多个分类器的机器学习集成分类模型的性能。虽然这些模型有许多参数，但标准的ROC分析通常只改变投票阈值，限制了它们的改进潜力。在本文中，我们提出了性能曲线映射，这是一种将ROC曲线重新定义为多目标优化问题的帕累托前沿的新方法。该方法将所有集成参数的多维空间（决策空间）映射到由分类性能指标定义的二维客观空间。我们使用基于NSGA-II的算法来探索决策空间，并在两个不同的分类问题上验证该建议：(1)在高度不平衡的数据集（insurance数据集）中预测汽车保险索赔，(2)在平衡的临床数据集（GenObIA数据集）中预测肥胖风险。我们将我们的方法与其他集成优化方法进行比较，使用视觉评估，曲线下面积和约登指数作为性能衡量标准。在Insurance数据集中，Performance Curve Mapping在AUC-ROC上平均提高了46.4%，在Youden指数上平均提高了26.1%。在GenObIA数据集中，AUC-ROC平均提高29.7%，Youden指数平均提高11.9%。所有改进都是相对于可实现的最大改进来计算的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Optimising Performance Curves for Ensemble Models through Pareto Front Analysis of the Decision Space

查看原文本刊更多论文

Optimising Performance Curves for Ensemble Models through Pareto Front Analysis of the Decision Space

Receiver operating characteristic curves are commonly used to evaluate the performance of machine learning ensemble classification models that combine multiple classifiers through a voting procedure. Although these models have many parameters, standard ROC analyses typically vary only the voting threshold, limiting their potential for improvement. In this paper, we propose Performance Curve Mapping, a new method that redefines the ROC curve as the Pareto front of a multi-objective optimisation problem. The method maps the multidimensional space of all ensemble parameters (Decision space) into a two-dimensional Objective space defined by classification performance metrics. We employ an algorithm based on NSGA-II to explore the Decision space and validate the proposal on two different classification problems: (1) predicting car insurance claims in a highly imbalanced dataset (Insurance dataset), and (2) predicting obesity risk in a balanced clinical dataset (GenObIA dataset). We compare our method with alternative ensemble optimisation approaches, using visual assessment, the area under the curve and the Youden index as performance measures. In the Insurance dataset, Performance Curve Mapping achieves an average improvement of 46.4% in AUC-ROC and 26.1% in the Youden index. In the GenObIA dataset, it achieves an average improvement of 29.7% in AUC-ROC and 11.9% in the Youden index. All improvements are calculated relative to the maximum achievable improvement.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems 工程技术-计算机：理论方法

CiteScore

7.40

自引率

6.10%

发文量

266

审稿时长

24 months

期刊介绍： Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper. As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.