使用监督学习技术评估预测的准确性

S. K., Sajimon Abraham
{"title":"使用监督学习技术评估预测的准确性","authors":"S. K., Sajimon Abraham","doi":"10.1145/3339311.3339337","DOIUrl":null,"url":null,"abstract":"The term Big data is used to refer the huge volume of complex and growing data generated from many distinct electronic gadgets. In the case of Big Data, the commonly used programming methods are not adequate to collect, store and analyze the data within a short period. Statistics as well as machine learning techniques are used for finding patterns and information from large data through data driven decision-making. Big Data analytics gives competitive opportunities in designing business plans for Business Analytics. For analytical purpose, traditionally we use Multiple Linear Regression (MLR) model in the statistical method, a type of Supervised Machine Learning Algorithm. We implemented Cross-Validation Resampling technique with MLR model. The performance of new MLR-Leave-One-Out (MLR-LOOCV) model evaluated using partitioning the whole data set. This technique used to validate the model developed from training data with test data to control the problem like over fitting. The accuracy of such prediction model is very poor. So we propose to build a Multilayer Perceptron Neural Network (MPNN) model with gradient descent learning method to improve the efficiency of prediction model. The new proposed model, MPNN with GD shows accuracy much greater than normal MLR. The data set from UCI machine learning repository is used for simulation methods to check the performance.","PeriodicalId":206653,"journal":{"name":"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Accuracy evaluation of prediction using supervised learning techniques\",\"authors\":\"S. K., Sajimon Abraham\",\"doi\":\"10.1145/3339311.3339337\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The term Big data is used to refer the huge volume of complex and growing data generated from many distinct electronic gadgets. In the case of Big Data, the commonly used programming methods are not adequate to collect, store and analyze the data within a short period. Statistics as well as machine learning techniques are used for finding patterns and information from large data through data driven decision-making. Big Data analytics gives competitive opportunities in designing business plans for Business Analytics. For analytical purpose, traditionally we use Multiple Linear Regression (MLR) model in the statistical method, a type of Supervised Machine Learning Algorithm. We implemented Cross-Validation Resampling technique with MLR model. The performance of new MLR-Leave-One-Out (MLR-LOOCV) model evaluated using partitioning the whole data set. This technique used to validate the model developed from training data with test data to control the problem like over fitting. The accuracy of such prediction model is very poor. So we propose to build a Multilayer Perceptron Neural Network (MPNN) model with gradient descent learning method to improve the efficiency of prediction model. The new proposed model, MPNN with GD shows accuracy much greater than normal MLR. The data set from UCI machine learning repository is used for simulation methods to check the performance.\",\"PeriodicalId\":206653,\"journal\":{\"name\":\"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3339311.3339337\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3339311.3339337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

“大数据”一词指的是由许多不同的电子设备产生的大量复杂且不断增长的数据。在大数据的情况下,常用的编程方法不足以在短时间内收集、存储和分析数据。统计学和机器学习技术被用于通过数据驱动的决策从大数据中寻找模式和信息。大数据分析为商业分析设计商业计划提供了竞争机会。为了分析目的,传统上我们在统计方法中使用多元线性回归(MLR)模型,这是一种监督式机器学习算法。我们利用MLR模型实现了交叉验证重采样技术。通过对整个数据集进行分区,评价了MLR-Leave-One-Out (MLR-LOOCV)模型的性能。该技术用于用测试数据验证从训练数据开发的模型,以控制过度拟合等问题。这种预测模型的精度很差。为此,我们提出了一种基于梯度下降学习方法的多层感知器神经网络(MPNN)模型,以提高预测模型的效率。该模型的精度明显高于一般的MLR模型。UCI机器学习存储库中的数据集用于模拟方法来检查性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Accuracy evaluation of prediction using supervised learning techniques
The term Big data is used to refer the huge volume of complex and growing data generated from many distinct electronic gadgets. In the case of Big Data, the commonly used programming methods are not adequate to collect, store and analyze the data within a short period. Statistics as well as machine learning techniques are used for finding patterns and information from large data through data driven decision-making. Big Data analytics gives competitive opportunities in designing business plans for Business Analytics. For analytical purpose, traditionally we use Multiple Linear Regression (MLR) model in the statistical method, a type of Supervised Machine Learning Algorithm. We implemented Cross-Validation Resampling technique with MLR model. The performance of new MLR-Leave-One-Out (MLR-LOOCV) model evaluated using partitioning the whole data set. This technique used to validate the model developed from training data with test data to control the problem like over fitting. The accuracy of such prediction model is very poor. So we propose to build a Multilayer Perceptron Neural Network (MPNN) model with gradient descent learning method to improve the efficiency of prediction model. The new proposed model, MPNN with GD shows accuracy much greater than normal MLR. The data set from UCI machine learning repository is used for simulation methods to check the performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信