{"title":"基于spark平台的出租车轨道数据分析","authors":"Chengcheng Li, Yu Liu, H. Zhang","doi":"10.1109/IAEAC47372.2019.8998030","DOIUrl":null,"url":null,"abstract":"How to analyze a large amount of traffic data, improve the level of urban traffic management, and discover the travel rules of urban residents from complex data, and rationally dispatch vehicles. Combined with the trajectory data of the yellow and green taxis in New York City, the traffic data is processed based on the Spark distributed data processing platform, and the K-means clustering algorithm is used to analyze the passengers' access points. The visualization of taxi passenger travel characteristics based on the platform is given, including the impact of rental car passengers, the distribution of urban residents' travel time and the distribution of taxi area speed. The gradient lifting algorithm is used to predict the importance of each feature of the training set, so as to rationally dispatch the vehicle.","PeriodicalId":164163,"journal":{"name":"2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of taxi track data based on spark platform\",\"authors\":\"Chengcheng Li, Yu Liu, H. Zhang\",\"doi\":\"10.1109/IAEAC47372.2019.8998030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"How to analyze a large amount of traffic data, improve the level of urban traffic management, and discover the travel rules of urban residents from complex data, and rationally dispatch vehicles. Combined with the trajectory data of the yellow and green taxis in New York City, the traffic data is processed based on the Spark distributed data processing platform, and the K-means clustering algorithm is used to analyze the passengers' access points. The visualization of taxi passenger travel characteristics based on the platform is given, including the impact of rental car passengers, the distribution of urban residents' travel time and the distribution of taxi area speed. The gradient lifting algorithm is used to predict the importance of each feature of the training set, so as to rationally dispatch the vehicle.\",\"PeriodicalId\":164163,\"journal\":{\"name\":\"2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAEAC47372.2019.8998030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAEAC47372.2019.8998030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis of taxi track data based on spark platform
How to analyze a large amount of traffic data, improve the level of urban traffic management, and discover the travel rules of urban residents from complex data, and rationally dispatch vehicles. Combined with the trajectory data of the yellow and green taxis in New York City, the traffic data is processed based on the Spark distributed data processing platform, and the K-means clustering algorithm is used to analyze the passengers' access points. The visualization of taxi passenger travel characteristics based on the platform is given, including the impact of rental car passengers, the distribution of urban residents' travel time and the distribution of taxi area speed. The gradient lifting algorithm is used to predict the importance of each feature of the training set, so as to rationally dispatch the vehicle.