Code2vect: An efficient heterogenous data classifier and nonlinear regression technique

IF 1 4区 工程技术 Q4 MECHANICS
Clara Argerich Martín , Ruben Ibáñez Pinillo , Anais Barasinski , Francisco Chinesta
{"title":"Code2vect: An efficient heterogenous data classifier and nonlinear regression technique","authors":"Clara Argerich Martín ,&nbsp;Ruben Ibáñez Pinillo ,&nbsp;Anais Barasinski ,&nbsp;Francisco Chinesta","doi":"10.1016/j.crme.2019.11.002","DOIUrl":null,"url":null,"abstract":"<div><p>The aim of this paper is to present a new classification and regression algorithm based on Artificial Intelligence. The main feature of this algorithm, which will be called Code2Vect, is the nature of the data to treat: qualitative or quantitative and continuous or discrete. Contrary to other artificial intelligence techniques based on the “Big-Data,” this new approach will enable working with a reduced amount of data, within the so-called “Smart Data” paradigm. Moreover, the main purpose of this algorithm is to enable the representation of high-dimensional data and more specifically grouping and visualizing this data according to a given target. For that purpose, the data will be projected into a vectorial space equipped with an appropriate metric, able to group data according to their affinity (with respect to a given output of interest). Furthermore, another application of this algorithm lies on its prediction capability. As it occurs with most common data-mining techniques such as regression trees, by giving an input the output will be inferred, in this case considering the nature of the data formerly described. In order to illustrate its potentialities, two different applications will be addressed, one concerning the representation of high-dimensional and categorical data and another featuring the prediction capabilities of the algorithm.</p></div>","PeriodicalId":50997,"journal":{"name":"Comptes Rendus Mecanique","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.crme.2019.11.002","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Comptes Rendus Mecanique","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1631072119301731","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MECHANICS","Score":null,"Total":0}
引用次数: 19

Abstract

The aim of this paper is to present a new classification and regression algorithm based on Artificial Intelligence. The main feature of this algorithm, which will be called Code2Vect, is the nature of the data to treat: qualitative or quantitative and continuous or discrete. Contrary to other artificial intelligence techniques based on the “Big-Data,” this new approach will enable working with a reduced amount of data, within the so-called “Smart Data” paradigm. Moreover, the main purpose of this algorithm is to enable the representation of high-dimensional data and more specifically grouping and visualizing this data according to a given target. For that purpose, the data will be projected into a vectorial space equipped with an appropriate metric, able to group data according to their affinity (with respect to a given output of interest). Furthermore, another application of this algorithm lies on its prediction capability. As it occurs with most common data-mining techniques such as regression trees, by giving an input the output will be inferred, in this case considering the nature of the data formerly described. In order to illustrate its potentialities, two different applications will be addressed, one concerning the representation of high-dimensional and categorical data and another featuring the prediction capabilities of the algorithm.

Code2vect:一个高效的异构数据分类器和非线性回归技术
提出了一种新的基于人工智能的分类回归算法。该算法将被称为Code2Vect,其主要特征是要处理的数据的性质:定性或定量,连续或离散。与其他基于“大数据”的人工智能技术相反,这种新方法将在所谓的“智能数据”范式下使用更少的数据。此外,该算法的主要目的是支持高维数据的表示,更具体地说,是根据给定的目标对这些数据进行分组和可视化。为此,数据将被投影到具有适当度量的向量空间中,能够根据数据的亲缘性(相对于给定的感兴趣的输出)对数据进行分组。此外,该算法的另一个应用在于其预测能力。正如大多数常见的数据挖掘技术(如回归树)所发生的那样,通过给出输入,将推断出输出,在这种情况下,考虑到先前描述的数据的性质。为了说明其潜力,将讨论两种不同的应用,一种涉及高维和分类数据的表示,另一种涉及算法的预测能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Comptes Rendus Mecanique
Comptes Rendus Mecanique 物理-力学
CiteScore
1.40
自引率
0.00%
发文量
0
审稿时长
12 months
期刊介绍: The Comptes rendus - Mécanique cover all fields of the discipline: Logic, Combinatorics, Number Theory, Group Theory, Mathematical Analysis, (Partial) Differential Equations, Geometry, Topology, Dynamical systems, Mathematical Physics, Mathematical Problems in Mechanics, Signal Theory, Mathematical Economics, … The journal publishes original and high-quality research articles. These can be in either in English or in French, with an abstract in both languages. An abridged version of the main text in the second language may also be included.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信