{"title":"Robust feature selection for high-dimensional datasets using a GPU-accelerated ensemble of cooperative coevolutionary optimizers","authors":"Marjan Firouznia, Pietro Ruiu, G. Trunfio","doi":"10.1109/PDP59025.2023.00052","DOIUrl":null,"url":null,"abstract":"Feature selection is an increasingly important step in the application of machine learning and knowledge discovery techniques to high-dimensional datasets. However, the growing complexity and size of datasets have made feature selection increasingly challenging, as selecting an optimal subset of features can be computationally very expensive, especially when a robust solution is required. To address this issue, we present an approach based on ensembles of cooperative coevolutionary optimisers and its parallelisation for hybrid multi-core CPU and GPU computation. The application of the developed algorithm to some typical high-dimensional datasets is discussed in the paper. According to the preliminary results, the proposed framework represents a valuable tool for addressing the computational challenges faced in feature selection, and it can be potentially applied to a wide range of machine learning and knowledge discovery tasks.","PeriodicalId":153500,"journal":{"name":"2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)","volume":"294 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDP59025.2023.00052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Feature selection is an increasingly important step in the application of machine learning and knowledge discovery techniques to high-dimensional datasets. However, the growing complexity and size of datasets have made feature selection increasingly challenging, as selecting an optimal subset of features can be computationally very expensive, especially when a robust solution is required. To address this issue, we present an approach based on ensembles of cooperative coevolutionary optimisers and its parallelisation for hybrid multi-core CPU and GPU computation. The application of the developed algorithm to some typical high-dimensional datasets is discussed in the paper. According to the preliminary results, the proposed framework represents a valuable tool for addressing the computational challenges faced in feature selection, and it can be potentially applied to a wide range of machine learning and knowledge discovery tasks.