A Comparative Study on Recent Automatic Data Fusion Methods

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computers Pub Date : 2023-12-30 DOI:10.3390/computers13010013

Luis Manuel Pereira, A. Salazar, L. Vergara

{"title":"A Comparative Study on Recent Automatic Data Fusion Methods","authors":"Luis Manuel Pereira, A. Salazar, L. Vergara","doi":"10.3390/computers13010013","DOIUrl":null,"url":null,"abstract":"Automatic data fusion is an important field of machine learning that has been increasingly studied. The objective is to improve the classification performance from several individual classifiers in terms of accuracy and stability of the results. This paper presents a comparative study on recent data fusion methods. The fusion step can be applied at early and/or late stages of the classification procedure. Early fusion consists of combining features from different sources or domains to form the observation vector before the training of the individual classifiers. On the contrary, late fusion consists of combining the results from the individual classifiers after the testing stage. Late fusion has two setups, combination of the posterior probabilities (scores), which is called soft fusion, and combination of the decisions, which is called hard fusion. A theoretical analysis of the conditions for applying the three kinds of fusion (early, late, and late hard) is introduced. Thus, we propose a comparative analysis with different schemes of fusion, including weaknesses and strengths of the state-of-the-art methods studied from the following perspectives: sensors, features, scores, and decisions.","PeriodicalId":46292,"journal":{"name":"Computers","volume":" 43","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/computers13010013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Automatic data fusion is an important field of machine learning that has been increasingly studied. The objective is to improve the classification performance from several individual classifiers in terms of accuracy and stability of the results. This paper presents a comparative study on recent data fusion methods. The fusion step can be applied at early and/or late stages of the classification procedure. Early fusion consists of combining features from different sources or domains to form the observation vector before the training of the individual classifiers. On the contrary, late fusion consists of combining the results from the individual classifiers after the testing stage. Late fusion has two setups, combination of the posterior probabilities (scores), which is called soft fusion, and combination of the decisions, which is called hard fusion. A theoretical analysis of the conditions for applying the three kinds of fusion (early, late, and late hard) is introduced. Thus, we propose a comparative analysis with different schemes of fusion, including weaknesses and strengths of the state-of-the-art methods studied from the following perspectives: sensors, features, scores, and decisions.

查看原文本刊更多论文

最新自动数据融合方法比较研究

自动数据融合是机器学习的一个重要领域，对它的研究越来越多。其目的是从结果的准确性和稳定性方面提高多个单独分类器的分类性能。本文对最新的数据融合方法进行了比较研究。融合步骤可应用于分类程序的早期和/或后期阶段。早期融合包括在训练单个分类器之前，将不同来源或领域的特征结合起来，形成观察向量。相反，后期融合则是在测试阶段结束后，将各个分类器的结果结合起来。后期融合有两种设置，一种是后验概率（分数）的组合，称为软融合；另一种是判定结果的组合，称为硬融合。我们对三种融合（早期融合、晚期融合和晚期硬融合）的应用条件进行了理论分析。因此，我们提出了不同融合方案的比较分析，包括从传感器、特征、分数和决策等角度研究最先进方法的优缺点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊