A Comprehensive Feature Selection Approach for Machine Learning

International Journal of Distributed Artificial Intelligence Pub Date : 2021-07-01 DOI:10.4018/ijdai.2021070102

S. Das, M. Sanyal, Debamoy Datta

引用次数: 0

Abstract

In machine learning, it is required that the underlying important input variables are known or else the value of the predicted outcome variable would never match the value of the target outcome variable. Machine learning tools are used in many applications where the underlying scientific model is inadequate. Unfortunately, making any kind of mathematical relationship is difficult, and as a result, incorporation of variables during the training becomes a big issue as it affects the accuracy of results. Another important issue is to find the cause behind the phenomena and the major factor that affects the outcome variable. The aim of this article is to focus on developing an approach that is not particular-tool specific, but it gives accurate results under all circumstances. This paper proposes a model that filters out the irrelevant variables irrespective of the type of dataset that the researcher can use. This approach provides parameters for determining the quality of the data used for mining purposes.

查看原文本刊更多论文

面向机器学习的综合特征选择方法

在机器学习中，需要知道潜在的重要输入变量，否则预测结果变量的值永远不会与目标结果变量的值匹配。机器学习工具被用于许多基础科学模型不充分的应用中。不幸的是，建立任何类型的数学关系都是困难的，因此，在训练过程中合并变量成为一个大问题，因为它会影响结果的准确性。另一个重要的问题是找到现象背后的原因和影响结果变量的主要因素。本文的目的是专注于开发一种方法，这种方法不是特定于特定工具，而是在所有情况下都能给出准确的结果。本文提出了一个模型，可以过滤掉无关变量，而不考虑研究人员可以使用的数据集类型。这种方法为确定用于挖掘目的的数据的质量提供了参数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Distributed Artificial Intelligence

自引率

0.00%

发文量