Alberto Gutierrez-Gallego, Oscar Garnica, Daniel Parra, J. Manuel Velasco, J. Ignacio Hidalgo
{"title":"基于决策空间Pareto前分析的集成模型性能曲线优化","authors":"Alberto Gutierrez-Gallego, Oscar Garnica, Daniel Parra, J. Manuel Velasco, J. Ignacio Hidalgo","doi":"10.1111/exsy.70075","DOIUrl":null,"url":null,"abstract":"<p>Receiver operating characteristic curves are commonly used to evaluate the performance of machine learning ensemble classification models that combine multiple classifiers through a voting procedure. Although these models have many parameters, standard ROC analyses typically vary only the voting threshold, limiting their potential for improvement. In this paper, we propose <span>Performance Curve Mapping</span>, a new method that redefines the ROC curve as the Pareto front of a multi-objective optimisation problem. The method maps the multidimensional space of all ensemble parameters (Decision space) into a two-dimensional Objective space defined by classification performance metrics. We employ an algorithm based on NSGA-II to explore the Decision space and validate the proposal on two different classification problems: (1) predicting car insurance claims in a highly imbalanced dataset (<span>Insurance</span> dataset), and (2) predicting obesity risk in a balanced clinical dataset (<span>GenObIA</span> dataset). We compare our method with alternative ensemble optimisation approaches, using visual assessment, the area under the curve and the Youden index as performance measures. In the <span>Insurance</span> dataset, <span>Performance Curve Mapping</span> achieves an average improvement of 46.4% in AUC-ROC and 26.1% in the Youden index. In the <span>GenObIA</span> dataset, it achieves an average improvement of 29.7% in AUC-ROC and 11.9% in the Youden index. All improvements are calculated relative to the maximum achievable improvement.</p>","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 7","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.70075","citationCount":"0","resultStr":"{\"title\":\"Optimising Performance Curves for Ensemble Models through Pareto Front Analysis of the Decision Space\",\"authors\":\"Alberto Gutierrez-Gallego, Oscar Garnica, Daniel Parra, J. Manuel Velasco, J. Ignacio Hidalgo\",\"doi\":\"10.1111/exsy.70075\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Receiver operating characteristic curves are commonly used to evaluate the performance of machine learning ensemble classification models that combine multiple classifiers through a voting procedure. Although these models have many parameters, standard ROC analyses typically vary only the voting threshold, limiting their potential for improvement. In this paper, we propose <span>Performance Curve Mapping</span>, a new method that redefines the ROC curve as the Pareto front of a multi-objective optimisation problem. The method maps the multidimensional space of all ensemble parameters (Decision space) into a two-dimensional Objective space defined by classification performance metrics. We employ an algorithm based on NSGA-II to explore the Decision space and validate the proposal on two different classification problems: (1) predicting car insurance claims in a highly imbalanced dataset (<span>Insurance</span> dataset), and (2) predicting obesity risk in a balanced clinical dataset (<span>GenObIA</span> dataset). We compare our method with alternative ensemble optimisation approaches, using visual assessment, the area under the curve and the Youden index as performance measures. In the <span>Insurance</span> dataset, <span>Performance Curve Mapping</span> achieves an average improvement of 46.4% in AUC-ROC and 26.1% in the Youden index. In the <span>GenObIA</span> dataset, it achieves an average improvement of 29.7% in AUC-ROC and 11.9% in the Youden index. All improvements are calculated relative to the maximum achievable improvement.</p>\",\"PeriodicalId\":51053,\"journal\":{\"name\":\"Expert Systems\",\"volume\":\"42 7\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.70075\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70075\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70075","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Optimising Performance Curves for Ensemble Models through Pareto Front Analysis of the Decision Space
Receiver operating characteristic curves are commonly used to evaluate the performance of machine learning ensemble classification models that combine multiple classifiers through a voting procedure. Although these models have many parameters, standard ROC analyses typically vary only the voting threshold, limiting their potential for improvement. In this paper, we propose Performance Curve Mapping, a new method that redefines the ROC curve as the Pareto front of a multi-objective optimisation problem. The method maps the multidimensional space of all ensemble parameters (Decision space) into a two-dimensional Objective space defined by classification performance metrics. We employ an algorithm based on NSGA-II to explore the Decision space and validate the proposal on two different classification problems: (1) predicting car insurance claims in a highly imbalanced dataset (Insurance dataset), and (2) predicting obesity risk in a balanced clinical dataset (GenObIA dataset). We compare our method with alternative ensemble optimisation approaches, using visual assessment, the area under the curve and the Youden index as performance measures. In the Insurance dataset, Performance Curve Mapping achieves an average improvement of 46.4% in AUC-ROC and 26.1% in the Youden index. In the GenObIA dataset, it achieves an average improvement of 29.7% in AUC-ROC and 11.9% in the Youden index. All improvements are calculated relative to the maximum achievable improvement.
期刊介绍:
Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper.
As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.