{"title":"使用监督学习技术评估预测的准确性","authors":"S. K., Sajimon Abraham","doi":"10.1145/3339311.3339337","DOIUrl":null,"url":null,"abstract":"The term Big data is used to refer the huge volume of complex and growing data generated from many distinct electronic gadgets. In the case of Big Data, the commonly used programming methods are not adequate to collect, store and analyze the data within a short period. Statistics as well as machine learning techniques are used for finding patterns and information from large data through data driven decision-making. Big Data analytics gives competitive opportunities in designing business plans for Business Analytics. For analytical purpose, traditionally we use Multiple Linear Regression (MLR) model in the statistical method, a type of Supervised Machine Learning Algorithm. We implemented Cross-Validation Resampling technique with MLR model. The performance of new MLR-Leave-One-Out (MLR-LOOCV) model evaluated using partitioning the whole data set. This technique used to validate the model developed from training data with test data to control the problem like over fitting. The accuracy of such prediction model is very poor. So we propose to build a Multilayer Perceptron Neural Network (MPNN) model with gradient descent learning method to improve the efficiency of prediction model. The new proposed model, MPNN with GD shows accuracy much greater than normal MLR. The data set from UCI machine learning repository is used for simulation methods to check the performance.","PeriodicalId":206653,"journal":{"name":"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Accuracy evaluation of prediction using supervised learning techniques\",\"authors\":\"S. K., Sajimon Abraham\",\"doi\":\"10.1145/3339311.3339337\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The term Big data is used to refer the huge volume of complex and growing data generated from many distinct electronic gadgets. In the case of Big Data, the commonly used programming methods are not adequate to collect, store and analyze the data within a short period. Statistics as well as machine learning techniques are used for finding patterns and information from large data through data driven decision-making. Big Data analytics gives competitive opportunities in designing business plans for Business Analytics. For analytical purpose, traditionally we use Multiple Linear Regression (MLR) model in the statistical method, a type of Supervised Machine Learning Algorithm. We implemented Cross-Validation Resampling technique with MLR model. The performance of new MLR-Leave-One-Out (MLR-LOOCV) model evaluated using partitioning the whole data set. This technique used to validate the model developed from training data with test data to control the problem like over fitting. The accuracy of such prediction model is very poor. So we propose to build a Multilayer Perceptron Neural Network (MPNN) model with gradient descent learning method to improve the efficiency of prediction model. The new proposed model, MPNN with GD shows accuracy much greater than normal MLR. The data set from UCI machine learning repository is used for simulation methods to check the performance.\",\"PeriodicalId\":206653,\"journal\":{\"name\":\"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3339311.3339337\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3339311.3339337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Accuracy evaluation of prediction using supervised learning techniques
The term Big data is used to refer the huge volume of complex and growing data generated from many distinct electronic gadgets. In the case of Big Data, the commonly used programming methods are not adequate to collect, store and analyze the data within a short period. Statistics as well as machine learning techniques are used for finding patterns and information from large data through data driven decision-making. Big Data analytics gives competitive opportunities in designing business plans for Business Analytics. For analytical purpose, traditionally we use Multiple Linear Regression (MLR) model in the statistical method, a type of Supervised Machine Learning Algorithm. We implemented Cross-Validation Resampling technique with MLR model. The performance of new MLR-Leave-One-Out (MLR-LOOCV) model evaluated using partitioning the whole data set. This technique used to validate the model developed from training data with test data to control the problem like over fitting. The accuracy of such prediction model is very poor. So we propose to build a Multilayer Perceptron Neural Network (MPNN) model with gradient descent learning method to improve the efficiency of prediction model. The new proposed model, MPNN with GD shows accuracy much greater than normal MLR. The data set from UCI machine learning repository is used for simulation methods to check the performance.