MACFE: A Meta-learning and Causality Based Feature Engineering Framework

Mexican International Conference on Artificial Intelligence Pub Date : 2022-07-08 DOI:10.48550/arXiv.2207.04010

Iván Reyes-Amezcua, Daniel Flores-Araiza, G. Ochoa-Ruiz, Andres Mendez-Vazquez, E. Rodriguez-Tello

{"title":"MACFE: A Meta-learning and Causality Based Feature Engineering Framework","authors":"Iván Reyes-Amezcua, Daniel Flores-Araiza, G. Ochoa-Ruiz, Andres Mendez-Vazquez, E. Rodriguez-Tello","doi":"10.48550/arXiv.2207.04010","DOIUrl":null,"url":null,"abstract":". Feature engineering has become one of the most important steps to improve model prediction performance, and to produce quality datasets. However, this process requires non-trivial domain-knowledge which involves a time-consuming process. Thereby, automating such process has become an active area of research and of interest in industrial applications. In this paper, a novel method, called Meta-learning and Causality Based Feature Engineering (MACFE), is proposed; our method is based on the use of meta-learning, feature distribution encoding, and causality feature selection. In MACFE, meta-learning is used to ﬁnd the best transformations, then the search is accelerated by pre-selecting “original” features given their causal relevance. Experimental evaluations on popular classiﬁcation datasets show that MACFE can improve the prediction performance across eight classiﬁers, outperforms the cur-rent state-of-the-art methods in average by at least 6.54%, and obtains an improvement of 2.71% over the best previous works.","PeriodicalId":166595,"journal":{"name":"Mexican International Conference on Artificial Intelligence","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mexican International Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2207.04010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

. Feature engineering has become one of the most important steps to improve model prediction performance, and to produce quality datasets. However, this process requires non-trivial domain-knowledge which involves a time-consuming process. Thereby, automating such process has become an active area of research and of interest in industrial applications. In this paper, a novel method, called Meta-learning and Causality Based Feature Engineering (MACFE), is proposed; our method is based on the use of meta-learning, feature distribution encoding, and causality feature selection. In MACFE, meta-learning is used to ﬁnd the best transformations, then the search is accelerated by pre-selecting “original” features given their causal relevance. Experimental evaluations on popular classiﬁcation datasets show that MACFE can improve the prediction performance across eight classiﬁers, outperforms the cur-rent state-of-the-art methods in average by at least 6.54%, and obtains an improvement of 2.71% over the best previous works.

查看原文本刊更多论文

MACFE:一个基于元学习和因果关系的特征工程框架

．特征工程已成为提高模型预测性能和生成高质量数据集的重要步骤之一。然而，这一过程需要大量的领域知识，这是一个耗时的过程。因此，自动化这一过程已成为一个活跃的研究领域和工业应用的兴趣。本文提出了一种新的方法，称为元学习和基于因果关系的特征工程(MACFE);我们的方法是基于元学习、特征分布编码和因果关系特征选择的使用。在MACFE中，元学习用于寻找最佳转换，然后通过预先选择“原始”特征来加速搜索，因为它们具有因果关系。在常用分类数据集上的实验评估表明，MACFE可以提高8个分类器的预测性能，平均比目前最先进的方法提高至少6.54%，比以前最好的方法提高2.71%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Mexican International Conference on Artificial Intelligence

自引率

0.00%

发文量