{"title":"Machine Learning Models for Salary Prediction Dataset using Python","authors":"Reham Kablaoui, A. Salman","doi":"10.1109/ICECTA57148.2022.9990316","DOIUrl":null,"url":null,"abstract":"In today’s world, salary is the primary source of motivation for many regular employees, which makes salary prediction very important for both employers and employees. It helps employers and employees to make estimations of the expected salary. Fortunately, technological advancements like Data Science and Machine Learning (ML) have made salary prediction more realistic. In this paper, we exploit the benefits of data science to collect a 20,000+ dataset of salaries in the USA. We then apply three supervised ML techniques to the obtained datasets to produce salary prediction. The learning models are linear regression, random forest, and neural networks. The output of the three models is analyzed and compared to show the following; neural network outperforms the other ML models for better accuracy with accuracy level 83.2%, and linear regression has the fastest time of 0.363s for training the model.","PeriodicalId":337798,"journal":{"name":"2022 International Conference on Electrical and Computing Technologies and Applications (ICECTA)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Electrical and Computing Technologies and Applications (ICECTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECTA57148.2022.9990316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In today’s world, salary is the primary source of motivation for many regular employees, which makes salary prediction very important for both employers and employees. It helps employers and employees to make estimations of the expected salary. Fortunately, technological advancements like Data Science and Machine Learning (ML) have made salary prediction more realistic. In this paper, we exploit the benefits of data science to collect a 20,000+ dataset of salaries in the USA. We then apply three supervised ML techniques to the obtained datasets to produce salary prediction. The learning models are linear regression, random forest, and neural networks. The output of the three models is analyzed and compared to show the following; neural network outperforms the other ML models for better accuracy with accuracy level 83.2%, and linear regression has the fastest time of 0.363s for training the model.