MACFE:一个基于元学习和因果关系的特征工程框架

Iván Reyes-Amezcua, Daniel Flores-Araiza, G. Ochoa-Ruiz, Andres Mendez-Vazquez, E. Rodriguez-Tello
{"title":"MACFE:一个基于元学习和因果关系的特征工程框架","authors":"Iván Reyes-Amezcua, Daniel Flores-Araiza, G. Ochoa-Ruiz, Andres Mendez-Vazquez, E. Rodriguez-Tello","doi":"10.48550/arXiv.2207.04010","DOIUrl":null,"url":null,"abstract":". Feature engineering has become one of the most important steps to improve model prediction performance, and to produce quality datasets. However, this process requires non-trivial domain-knowledge which involves a time-consuming process. Thereby, automating such process has become an active area of research and of interest in industrial applications. In this paper, a novel method, called Meta-learning and Causality Based Feature Engineering (MACFE), is proposed; our method is based on the use of meta-learning, feature distribution encoding, and causality feature selection. In MACFE, meta-learning is used to find the best transformations, then the search is accelerated by pre-selecting “original” features given their causal relevance. Experimental evaluations on popular classification datasets show that MACFE can improve the prediction performance across eight classifiers, outperforms the cur-rent state-of-the-art methods in average by at least 6.54%, and obtains an improvement of 2.71% over the best previous works.","PeriodicalId":166595,"journal":{"name":"Mexican International Conference on Artificial Intelligence","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MACFE: A Meta-learning and Causality Based Feature Engineering Framework\",\"authors\":\"Iván Reyes-Amezcua, Daniel Flores-Araiza, G. Ochoa-Ruiz, Andres Mendez-Vazquez, E. Rodriguez-Tello\",\"doi\":\"10.48550/arXiv.2207.04010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\". Feature engineering has become one of the most important steps to improve model prediction performance, and to produce quality datasets. However, this process requires non-trivial domain-knowledge which involves a time-consuming process. Thereby, automating such process has become an active area of research and of interest in industrial applications. In this paper, a novel method, called Meta-learning and Causality Based Feature Engineering (MACFE), is proposed; our method is based on the use of meta-learning, feature distribution encoding, and causality feature selection. In MACFE, meta-learning is used to find the best transformations, then the search is accelerated by pre-selecting “original” features given their causal relevance. Experimental evaluations on popular classification datasets show that MACFE can improve the prediction performance across eight classifiers, outperforms the cur-rent state-of-the-art methods in average by at least 6.54%, and obtains an improvement of 2.71% over the best previous works.\",\"PeriodicalId\":166595,\"journal\":{\"name\":\"Mexican International Conference on Artificial Intelligence\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mexican International Conference on Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2207.04010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mexican International Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2207.04010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

. 特征工程已成为提高模型预测性能和生成高质量数据集的重要步骤之一。然而,这一过程需要大量的领域知识,这是一个耗时的过程。因此,自动化这一过程已成为一个活跃的研究领域和工业应用的兴趣。本文提出了一种新的方法,称为元学习和基于因果关系的特征工程(MACFE);我们的方法是基于元学习、特征分布编码和因果关系特征选择的使用。在MACFE中,元学习用于寻找最佳转换,然后通过预先选择“原始”特征来加速搜索,因为它们具有因果关系。在常用分类数据集上的实验评估表明,MACFE可以提高8个分类器的预测性能,平均比目前最先进的方法提高至少6.54%,比以前最好的方法提高2.71%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MACFE: A Meta-learning and Causality Based Feature Engineering Framework
. Feature engineering has become one of the most important steps to improve model prediction performance, and to produce quality datasets. However, this process requires non-trivial domain-knowledge which involves a time-consuming process. Thereby, automating such process has become an active area of research and of interest in industrial applications. In this paper, a novel method, called Meta-learning and Causality Based Feature Engineering (MACFE), is proposed; our method is based on the use of meta-learning, feature distribution encoding, and causality feature selection. In MACFE, meta-learning is used to find the best transformations, then the search is accelerated by pre-selecting “original” features given their causal relevance. Experimental evaluations on popular classification datasets show that MACFE can improve the prediction performance across eight classifiers, outperforms the cur-rent state-of-the-art methods in average by at least 6.54%, and obtains an improvement of 2.71% over the best previous works.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信