Ericks da Silva Rodrigues, D. Martins, F. B. L. Neto
{"title":"Automatic Feature Engineering Using Self-Organizing Maps","authors":"Ericks da Silva Rodrigues, D. Martins, F. B. L. Neto","doi":"10.1109/LA-CCI48322.2021.9769788","DOIUrl":null,"url":null,"abstract":"Feature Engineering (FE) consists of generating new, better features to improve the results obtained by Machine Learning models. Very often, FE is performed in a series of trial-and-error steps conducted manually by data scientists. Moreover, FE requires data-specific and domain knowledge, both rarely easy to acquire. To alleviate these problems, we propose an automatic FE approach based on Self-Organizing Maps (SOM) in which new features are generated via pattern recognition. The use of the SOM algorithm in variable generation tasks can identify data elements that help Machine Learning models to obtain better results and points out to a broad direction for future researches.","PeriodicalId":152596,"journal":{"name":"Latin American Conference on Computational Intelligence","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Latin American Conference on Computational Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LA-CCI48322.2021.9769788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Feature Engineering (FE) consists of generating new, better features to improve the results obtained by Machine Learning models. Very often, FE is performed in a series of trial-and-error steps conducted manually by data scientists. Moreover, FE requires data-specific and domain knowledge, both rarely easy to acquire. To alleviate these problems, we propose an automatic FE approach based on Self-Organizing Maps (SOM) in which new features are generated via pattern recognition. The use of the SOM algorithm in variable generation tasks can identify data elements that help Machine Learning models to obtain better results and points out to a broad direction for future researches.