{"title":"Optimization house price prediction model using gradient boosted regression trees (GBRT) and xgboost algorithm","authors":"Putri Susi Sundari, Mahardika Khafidz Putra","doi":"10.52465/josre.v2i1.176","DOIUrl":null,"url":null,"abstract":"In this rapidly advancing technological era, the demand for the real estate industry has also increased, including in the field of house price prediction. House prices fluctuate every year due to several factors such as changes in land prices, location, year of construction, infrastructure developments, and other factors. Numerous studies have been conducted on this issue. However, the challenge lies in building a proven accurate and effective model for predicting house prices with the abundance of features present in the dataset. The objective of this research is to develop a predictive model that can accurately estimate house prices based on relevant features or variables. The researcher utilizes ensemble learning techniques, combining the Gradient Boosted Regression Trees (GBRT) and XGBoost algorithms. The dataset used in this article is titled \"Ames Housing dataset\" obtained from Kaggle. The predictive model is then evaluated using the Root Mean Squared Error (RMSE) method. The RMSE result from a previous study that used the combination of Lasso and XGBoost was 0.11260, while the RMSE result from this research is 0.00480. This indicates a decrease in the RMSE value, indicating a lower level of error in the model. It also means that the combination of GBRT and XGBoost algorithms successfully improves the prediction accuracy of the previous research model.","PeriodicalId":105983,"journal":{"name":"Journal of Student Research Exploration","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Student Research Exploration","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52465/josre.v2i1.176","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this rapidly advancing technological era, the demand for the real estate industry has also increased, including in the field of house price prediction. House prices fluctuate every year due to several factors such as changes in land prices, location, year of construction, infrastructure developments, and other factors. Numerous studies have been conducted on this issue. However, the challenge lies in building a proven accurate and effective model for predicting house prices with the abundance of features present in the dataset. The objective of this research is to develop a predictive model that can accurately estimate house prices based on relevant features or variables. The researcher utilizes ensemble learning techniques, combining the Gradient Boosted Regression Trees (GBRT) and XGBoost algorithms. The dataset used in this article is titled "Ames Housing dataset" obtained from Kaggle. The predictive model is then evaluated using the Root Mean Squared Error (RMSE) method. The RMSE result from a previous study that used the combination of Lasso and XGBoost was 0.11260, while the RMSE result from this research is 0.00480. This indicates a decrease in the RMSE value, indicating a lower level of error in the model. It also means that the combination of GBRT and XGBoost algorithms successfully improves the prediction accuracy of the previous research model.