{"title":"Multi-objective multi-optima ensemble binary optimization algorithm for identifying optimal set of features for ECG-based identification","authors":"Mamata Pandey, Anup Kumar Keshri","doi":"10.1016/j.asoc.2025.113556","DOIUrl":null,"url":null,"abstract":"<div><div>Reducing the number of input features for a machine learning model decreases its complexity and computation time. However, it is crucial to choose the best set of features without compromising the model's performance. There could be several subsets of features with optimal behavior. Evolutionary algorithms are great for feature optimization. However, different evolutionary algorithms may produce different solutions, and their performance is influenced by the size of the data and the types of features. To address these issues, three popular algorithms, Genetic Algorithm (GA), Particle Swarm Optimization (PSO), and Binary Differential Evolution (BDE) have been adapted to accommodate multiple populations for achieving multiple optima. The BDE algorithm applied here is a novel variant with modified mutation and crossover operators. Then they are combined to create a novel 'Multi-Objective Multi-Optima Ensemble Binary Optimization Algorithm. The algorithm has been tested on 71 fiducial ECG features including temporal, amplitude, distance, slope, angular, and HRV features for ECG-based identification. These 71 features can identify individuals using the SVM classifier with 98 % accuracy. With 71 features, there could be a maximum of <span><math><msup><mrow><mn>2</mn></mrow><mrow><mn>71</mn></mrow></msup></math></span> subsets. The optimization objective is to find all feature subsets that maximize classifier accuracy while minimizing the number of features. The ensemble optimizer has found 190 unique optimized subsets. These subsets have been analyzed to identify critical features for identification. The most optimal subset with the minimum number of features and maximum accuracy has been identified. The practical implementation of an ECG-based identification system requires an efficient system that can process incoming signals, extract features from the signal, and identify individuals in the shortest time possible. To speed up the processing of the input signal, a novel DFA-based algorithm has been proposed to identify fiducial points P, Q, R, S, and T from an ECG signal. The proposed algorithm applies to both recorded and live ECG signals.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"182 ","pages":"Article 113556"},"PeriodicalIF":7.2000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625008671","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Reducing the number of input features for a machine learning model decreases its complexity and computation time. However, it is crucial to choose the best set of features without compromising the model's performance. There could be several subsets of features with optimal behavior. Evolutionary algorithms are great for feature optimization. However, different evolutionary algorithms may produce different solutions, and their performance is influenced by the size of the data and the types of features. To address these issues, three popular algorithms, Genetic Algorithm (GA), Particle Swarm Optimization (PSO), and Binary Differential Evolution (BDE) have been adapted to accommodate multiple populations for achieving multiple optima. The BDE algorithm applied here is a novel variant with modified mutation and crossover operators. Then they are combined to create a novel 'Multi-Objective Multi-Optima Ensemble Binary Optimization Algorithm. The algorithm has been tested on 71 fiducial ECG features including temporal, amplitude, distance, slope, angular, and HRV features for ECG-based identification. These 71 features can identify individuals using the SVM classifier with 98 % accuracy. With 71 features, there could be a maximum of subsets. The optimization objective is to find all feature subsets that maximize classifier accuracy while minimizing the number of features. The ensemble optimizer has found 190 unique optimized subsets. These subsets have been analyzed to identify critical features for identification. The most optimal subset with the minimum number of features and maximum accuracy has been identified. The practical implementation of an ECG-based identification system requires an efficient system that can process incoming signals, extract features from the signal, and identify individuals in the shortest time possible. To speed up the processing of the input signal, a novel DFA-based algorithm has been proposed to identify fiducial points P, Q, R, S, and T from an ECG signal. The proposed algorithm applies to both recorded and live ECG signals.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.