An Enhanced Machine Learning Approach to Identify Noise and Detect Relevant Structures for Predictive Modeling

2023 9th International Conference on Information Technology Trends (ITT) Pub Date : 2023-05-24 DOI:10.1109/ITT59889.2023.10184237

M. Uddin

{"title":"An Enhanced Machine Learning Approach to Identify Noise and Detect Relevant Structures for Predictive Modeling","authors":"M. Uddin","doi":"10.1109/ITT59889.2023.10184237","DOIUrl":null,"url":null,"abstract":"The era of big data and social networking platforms have provided great repositories of the data for mining useful information for the real-world industry. However, along with this benefit comes the noise in the data. Generally, noise is the data-set that are redundant, false, bad, and/or outliers. Data cleaning, outlier identification, feature engineering, data slicing, etc. are few of many techniques used traditionally. End goal remains ensuring good data (signal) is not lost in bad data (noise) and less processing cost are incurred to extract useful knowledge out of given big data. This paper presents a follow up progress on existing work of the author in relevance of machine learning algorithms, academic and career data predictions and personality computing. All of that have been initially inspired by potential of useful relationships and data points in unstructured data and thus Noise becomes very relevant and may appear Signal in other contexts and predictors in goal. This proposed model is collectively titled as ‘Noise Removal and Structured Data Detection’ based on inherited parallel processing and unique n-Dimensional training approach. Personality features can be quantified into talent traits, matrix indicating the max/min for relevance factors in the academics/career of nD. The engine internals examine and train the algorithm that it minimizes the x,y co-ordinates and maximizes the z co-ordinate. It records and compares the engine internal metrics and reports it back to engine to further optimize the machine learning process until the optimum results are obtained or do not improve any further.","PeriodicalId":223578,"journal":{"name":"2023 9th International Conference on Information Technology Trends (ITT)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 9th International Conference on Information Technology Trends (ITT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITT59889.2023.10184237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The era of big data and social networking platforms have provided great repositories of the data for mining useful information for the real-world industry. However, along with this benefit comes the noise in the data. Generally, noise is the data-set that are redundant, false, bad, and/or outliers. Data cleaning, outlier identification, feature engineering, data slicing, etc. are few of many techniques used traditionally. End goal remains ensuring good data (signal) is not lost in bad data (noise) and less processing cost are incurred to extract useful knowledge out of given big data. This paper presents a follow up progress on existing work of the author in relevance of machine learning algorithms, academic and career data predictions and personality computing. All of that have been initially inspired by potential of useful relationships and data points in unstructured data and thus Noise becomes very relevant and may appear Signal in other contexts and predictors in goal. This proposed model is collectively titled as ‘Noise Removal and Structured Data Detection’ based on inherited parallel processing and unique n-Dimensional training approach. Personality features can be quantified into talent traits, matrix indicating the max/min for relevance factors in the academics/career of nD. The engine internals examine and train the algorithm that it minimizes the x,y co-ordinates and maximizes the z co-ordinate. It records and compares the engine internal metrics and reports it back to engine to further optimize the machine learning process until the optimum results are obtained or do not improve any further.

查看原文本刊更多论文

一种用于预测建模的增强机器学习方法识别噪声和检测相关结构

大数据和社交网络平台的时代为现实世界的行业挖掘有用的信息提供了巨大的数据存储库。然而，伴随着这些好处而来的是数据中的噪音。通常，噪声是冗余的、错误的、坏的和/或异常值的数据集。数据清洗、离群点识别、特征工程、数据切片等是传统技术中的一小部分。最终目标仍然是确保好的数据(信号)不会在坏数据(噪声)中丢失，并减少从给定的大数据中提取有用知识的处理成本。本文介绍了作者在机器学习算法、学术和职业数据预测以及人格计算相关方面的现有工作的后续进展。所有这些最初都是受非结构化数据中有用关系和数据点的潜力的启发，因此噪声变得非常相关，并且可能在其他上下文和目标预测中出现信号。该模型基于继承的并行处理和独特的n维训练方法，被统称为“噪声去除和结构化数据检测”。人格特征可以量化为人才特征，矩阵表示nD的学业/职业相关因素的最大/最小值。引擎内部检查和训练算法，它最小化x,y坐标和最大化z坐标。它记录并比较发动机内部指标，并将其反馈给发动机，以进一步优化机器学习过程，直到获得最佳结果或不再进一步改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 9th International Conference on Information Technology Trends (ITT)

自引率

0.00%

发文量