Estimated Missing Data on Multiple Linear Regression Using Algorithm EM (Expectation-Maximization) for Prediction Revenue Company

Samsul Pahmi, Nunik Destria Arianti, Imam Fahrudin, Ana Zanatun Amalia, Erdi Ardiansyah, Sarah Rahman
{"title":"Estimated Missing Data on Multiple Linear Regression Using Algorithm EM (Expectation-Maximization) for Prediction Revenue Company","authors":"Samsul Pahmi, Nunik Destria Arianti, Imam Fahrudin, Ana Zanatun Amalia, Erdi Ardiansyah, Sarah Rahman","doi":"10.1109/ICCED.2018.00025","DOIUrl":null,"url":null,"abstract":"This study aims to identify problems in the company which in the past two years have experienced unpredictable fluctuations in revenue and do not seem to improve the overall trend in two years. This research is a type of quantitative research involving continuous data types and using 20 independent variables and 1 dependent variable. Data collection was carried out at companies in Sukabumi, Indonesia with data collection methods using interviews and documentation. In the process of data collection, there are some missing data. Based on the type of data that is retrieved, the selection method used in this research is divided into two methods, Expectation-Maximization algorithm for estimating missing data and Multiple Linear Regression to determine the effect of variables on company revenue and to predict the company revenue in the next period. The results showed that using EM Algorithm using the reference variable that has the same character can be predicted the missing data from the initial data. From the calculation of hypothesis test can be concluded that only Luggage, Cleaning Aid, and Glass Ware which have no significant effect to the company's revenue; While the calculation using multiple linear regression analysis there are 5 types of goods that have a negative influence with company revenue, the 5 types are: Hair care, Luggage, Bakery, Cleaning Aid, and Snack. Thus, the overall type of goods that are very necessary to be evaluated by the company are: Luggage, Bakery, Cleaning Aid, Snack and Glass Ware, because of the type of goods that do not have significant influence or negatively effect to the increase of the company's revenue.","PeriodicalId":166437,"journal":{"name":"2018 International Conference on Computing, Engineering, and Design (ICCED)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Computing, Engineering, and Design (ICCED)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCED.2018.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

This study aims to identify problems in the company which in the past two years have experienced unpredictable fluctuations in revenue and do not seem to improve the overall trend in two years. This research is a type of quantitative research involving continuous data types and using 20 independent variables and 1 dependent variable. Data collection was carried out at companies in Sukabumi, Indonesia with data collection methods using interviews and documentation. In the process of data collection, there are some missing data. Based on the type of data that is retrieved, the selection method used in this research is divided into two methods, Expectation-Maximization algorithm for estimating missing data and Multiple Linear Regression to determine the effect of variables on company revenue and to predict the company revenue in the next period. The results showed that using EM Algorithm using the reference variable that has the same character can be predicted the missing data from the initial data. From the calculation of hypothesis test can be concluded that only Luggage, Cleaning Aid, and Glass Ware which have no significant effect to the company's revenue; While the calculation using multiple linear regression analysis there are 5 types of goods that have a negative influence with company revenue, the 5 types are: Hair care, Luggage, Bakery, Cleaning Aid, and Snack. Thus, the overall type of goods that are very necessary to be evaluated by the company are: Luggage, Bakery, Cleaning Aid, Snack and Glass Ware, because of the type of goods that do not have significant influence or negatively effect to the increase of the company's revenue.
基于期望最大化算法的预测收益公司多元线性回归缺失数据估计
本研究旨在找出公司在过去两年中出现不可预测的收入波动,并在两年内似乎没有改善整体趋势的问题。本研究是一种涉及连续数据类型的定量研究,使用20个自变量和1个因变量。数据收集是在印度尼西亚苏卡umi的公司进行的,数据收集方法是使用访谈和文件。在数据收集的过程中,存在一些数据缺失。根据检索到的数据类型,本研究使用的选择方法分为两种方法,一种是用于估计缺失数据的期望最大化算法,另一种是用于确定变量对公司收入的影响并预测公司下一时期收入的多元线性回归。结果表明,使用具有相同特征的参考变量的EM算法可以预测初始数据中的缺失数据。从假设检验的计算可以得出,只有箱包、清洁用品和玻璃器皿对公司的收入没有显著影响;而通过多元线性回归分析的计算,有5种商品对公司收入有负面影响,这5种商品是:护发、箱包、面包、清洁用品和零食。因此,公司非常有必要评估的整体商品类型是:箱包,面包店,清洁用品,零食和玻璃器皿,因为这类商品对公司收入的增加没有显著的影响或负面影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信