{"title":"On The Irrelevance of Machine Learning Algorithms and the Importance of Relativity","authors":"Carlos Huertas, Qi Zhao","doi":"10.1109/ICMEW59549.2023.00009","DOIUrl":null,"url":null,"abstract":"Information explosion has brought us a wide range of data formats and machine learning keeps in constant evolution to develop mechanisms to extract knowledge from them. Modern models in the Deep Learning space have proven to be very successful in multiple applications, yet in the tabular space they fail to provide consistent competitive performance. However, in this work we claim model selection can become irrelevant as the key tends to lie in data processing. In this paper we introduce the concept of relativity in feature engineering, a powerful methodology to boost any classifier performance and we provide over 30 different configurations of models and feature engineering designs to prove we can bias any result to help an arbitrary model score best. Our results attribute 600% more value to feature engineering than model selection. In order to validate the effectiveness of our approach, we submitted our work to a live machine learning competition with outstanding results regardless of our model of choice.","PeriodicalId":111482,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW59549.2023.00009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Information explosion has brought us a wide range of data formats and machine learning keeps in constant evolution to develop mechanisms to extract knowledge from them. Modern models in the Deep Learning space have proven to be very successful in multiple applications, yet in the tabular space they fail to provide consistent competitive performance. However, in this work we claim model selection can become irrelevant as the key tends to lie in data processing. In this paper we introduce the concept of relativity in feature engineering, a powerful methodology to boost any classifier performance and we provide over 30 different configurations of models and feature engineering designs to prove we can bias any result to help an arbitrary model score best. Our results attribute 600% more value to feature engineering than model selection. In order to validate the effectiveness of our approach, we submitted our work to a live machine learning competition with outstanding results regardless of our model of choice.