{"title":"基于随机森林和多元线性回归的自行车租赁需求预测","authors":"YouLi Feng, Shanshan Wang","doi":"10.1109/ICIS.2017.7959977","DOIUrl":null,"url":null,"abstract":"Bike sharing system is a ways of renting bicycles; bike return is automated via a network of kiosk locations throughout a city. Using these systems, people are able to rent a bike from a one pick up location and combine with their as-need, customer returns bike to the place, which they would prefer to return. This paper is asked to combine historical usage patterns with weather data in order to forecast bike rental demand in the Capital Bike share program in Washington, D.C. Firstly, the multiple linear regression model was established by the conventional method, Multiple linear regression equation was obtained by using SPSS software, After comparing the data with the real value, it is indicated that the multiple linear regression model is less accurate. After analysis, we find that the data includes the dummy variables such as the time and the season. Hence this paper proposes a random forest model and a GBM packet to improve the decision tree. The results and the accuracy of multiple regression analysis are greatly improved when use of random forest model to predict the demand for bicycle rental.","PeriodicalId":301467,"journal":{"name":"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)","volume":"215 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"44","resultStr":"{\"title\":\"A forecast for bicycle rental demand based on random forests and multiple linear regression\",\"authors\":\"YouLi Feng, Shanshan Wang\",\"doi\":\"10.1109/ICIS.2017.7959977\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Bike sharing system is a ways of renting bicycles; bike return is automated via a network of kiosk locations throughout a city. Using these systems, people are able to rent a bike from a one pick up location and combine with their as-need, customer returns bike to the place, which they would prefer to return. This paper is asked to combine historical usage patterns with weather data in order to forecast bike rental demand in the Capital Bike share program in Washington, D.C. Firstly, the multiple linear regression model was established by the conventional method, Multiple linear regression equation was obtained by using SPSS software, After comparing the data with the real value, it is indicated that the multiple linear regression model is less accurate. After analysis, we find that the data includes the dummy variables such as the time and the season. Hence this paper proposes a random forest model and a GBM packet to improve the decision tree. The results and the accuracy of multiple regression analysis are greatly improved when use of random forest model to predict the demand for bicycle rental.\",\"PeriodicalId\":301467,\"journal\":{\"name\":\"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)\",\"volume\":\"215 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"44\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIS.2017.7959977\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIS.2017.7959977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A forecast for bicycle rental demand based on random forests and multiple linear regression
Bike sharing system is a ways of renting bicycles; bike return is automated via a network of kiosk locations throughout a city. Using these systems, people are able to rent a bike from a one pick up location and combine with their as-need, customer returns bike to the place, which they would prefer to return. This paper is asked to combine historical usage patterns with weather data in order to forecast bike rental demand in the Capital Bike share program in Washington, D.C. Firstly, the multiple linear regression model was established by the conventional method, Multiple linear regression equation was obtained by using SPSS software, After comparing the data with the real value, it is indicated that the multiple linear regression model is less accurate. After analysis, we find that the data includes the dummy variables such as the time and the season. Hence this paper proposes a random forest model and a GBM packet to improve the decision tree. The results and the accuracy of multiple regression analysis are greatly improved when use of random forest model to predict the demand for bicycle rental.