Minghui Bai , Yuan Gao , Xiaoying Gao , Jianbin Ma
{"title":"Multi-objective genetic programming for binary classification with adaptive thresholds and a generalization-optimizing fitness function","authors":"Minghui Bai , Yuan Gao , Xiaoying Gao , Jianbin Ma","doi":"10.1016/j.asoc.2025.113956","DOIUrl":null,"url":null,"abstract":"<div><div>Genetic programming (GP) has been widely applied to classifier construction due to its flexible representation and powerful feature construction capabilities. Existing studies have proposed various fitness functions to improve GP-based classifiers, but most of them rely on a fixed decision threshold. However, when dealing with imbalanced classification problems, a fixed threshold often biases the model toward the majority class, thereby compromising overall performance. To address this issue, in this paper, we propose a novel multi-objective GP framework for constructing binary classifiers with adaptive threshold adjustment. During evolution, the method employs Youden’s Index to dynamically adjust the threshold of each individual, enabling the classifiers to better fit the underlying data distribution. In addition, we introduce a new class separation metric, <em>dist</em><sub>t</sub>, to quantify the clarity of class boundaries and enhance the generalization ability of the evolved models. The framework jointly optimizes three objectives: minority class accuracy, majority class accuracy, and the proposed <em>dist</em><sub>t</sub> metric. Experiments on 14 imbalanced datasets demonstrate that our method significantly outperforms conventional single-objective GP with fixed thresholds. Further results also confirm the positive impact of the proposed <em>dist</em><sub>t</sub> metric on classification performance. Compared to seven existing GP algorithms and five traditional machine learning classifiers, our approach achieves superior overall performance and better generalization ability.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"185 ","pages":"Article 113956"},"PeriodicalIF":6.6000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625012694","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Genetic programming (GP) has been widely applied to classifier construction due to its flexible representation and powerful feature construction capabilities. Existing studies have proposed various fitness functions to improve GP-based classifiers, but most of them rely on a fixed decision threshold. However, when dealing with imbalanced classification problems, a fixed threshold often biases the model toward the majority class, thereby compromising overall performance. To address this issue, in this paper, we propose a novel multi-objective GP framework for constructing binary classifiers with adaptive threshold adjustment. During evolution, the method employs Youden’s Index to dynamically adjust the threshold of each individual, enabling the classifiers to better fit the underlying data distribution. In addition, we introduce a new class separation metric, distt, to quantify the clarity of class boundaries and enhance the generalization ability of the evolved models. The framework jointly optimizes three objectives: minority class accuracy, majority class accuracy, and the proposed distt metric. Experiments on 14 imbalanced datasets demonstrate that our method significantly outperforms conventional single-objective GP with fixed thresholds. Further results also confirm the positive impact of the proposed distt metric on classification performance. Compared to seven existing GP algorithms and five traditional machine learning classifiers, our approach achieves superior overall performance and better generalization ability.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.