{"title":"基于改进特征选择的共享单车需求随机森林预测","authors":"Pengcheng Zhao, Xiaolei Zhang, Shibao Sun","doi":"10.1145/3375998.3375999","DOIUrl":null,"url":null,"abstract":"Under the promotion of green travel, shared bicycles are developing rapidly, but urban road space is wasted due to unreasonable planning. In order to accurately place the number of bicycles, this paper predicts the demand for shared bicycles based on factors such as weather, seasonality and temperature and humidity. Faced with the complexity and collinearity of data features, a random forest prediction shared bicycle demand model with improved feature selection is proposed. First, features with collinearity are excluded by partitioning the feature saliency and correlation coefficient values. Then, the data is effectively characterized. Therefore, the upper bound of the generalization error of the algorithm is reduced. The final prediction model improves prediction accuracy. Experiments show that the random forest algorithm with improved feature selection is optimized, which is compared to other regression algorithms in terms of demand prediction accuracy and fitness. The method is effective.","PeriodicalId":395773,"journal":{"name":"Proceedings of the 2019 8th International Conference on Networks, Communication and Computing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Random Forest Prediction with Improved Feature Selection to Shared Bicycle Demand\",\"authors\":\"Pengcheng Zhao, Xiaolei Zhang, Shibao Sun\",\"doi\":\"10.1145/3375998.3375999\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Under the promotion of green travel, shared bicycles are developing rapidly, but urban road space is wasted due to unreasonable planning. In order to accurately place the number of bicycles, this paper predicts the demand for shared bicycles based on factors such as weather, seasonality and temperature and humidity. Faced with the complexity and collinearity of data features, a random forest prediction shared bicycle demand model with improved feature selection is proposed. First, features with collinearity are excluded by partitioning the feature saliency and correlation coefficient values. Then, the data is effectively characterized. Therefore, the upper bound of the generalization error of the algorithm is reduced. The final prediction model improves prediction accuracy. Experiments show that the random forest algorithm with improved feature selection is optimized, which is compared to other regression algorithms in terms of demand prediction accuracy and fitness. The method is effective.\",\"PeriodicalId\":395773,\"journal\":{\"name\":\"Proceedings of the 2019 8th International Conference on Networks, Communication and Computing\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 8th International Conference on Networks, Communication and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3375998.3375999\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 8th International Conference on Networks, Communication and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3375998.3375999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Random Forest Prediction with Improved Feature Selection to Shared Bicycle Demand
Under the promotion of green travel, shared bicycles are developing rapidly, but urban road space is wasted due to unreasonable planning. In order to accurately place the number of bicycles, this paper predicts the demand for shared bicycles based on factors such as weather, seasonality and temperature and humidity. Faced with the complexity and collinearity of data features, a random forest prediction shared bicycle demand model with improved feature selection is proposed. First, features with collinearity are excluded by partitioning the feature saliency and correlation coefficient values. Then, the data is effectively characterized. Therefore, the upper bound of the generalization error of the algorithm is reduced. The final prediction model improves prediction accuracy. Experiments show that the random forest algorithm with improved feature selection is optimized, which is compared to other regression algorithms in terms of demand prediction accuracy and fitness. The method is effective.