{"title":"时间序列、决策树和聚类的结合:以气象事件预测为例","authors":"S.B. Lajevardi, B. Minaei-Bidgoli","doi":"10.1109/ICCEE.2008.110","DOIUrl":null,"url":null,"abstract":"Predictive systems use historical and other available data to predict an event. In this paper we propose a general framework to predict the Aerology events with time series streams and events stream using combination of K-means clustering algorithm and Decision Tree C5 algorithm. Firstly, we find the closest time series record for any events; therefore, we have gathered different parameters value when an event is occurring. Using K-means we add a field to data set which determines the cluster of each record after that by using C5 algorithm we predict events. C5 Decision Tree Algorithm is one of the well-known Decision Tree Algorithms. This framework and time series model can predict future events efficiently. We gathered 1961 until 2005 data of aerology organization for Tehran Mehrabad Station. This data contains some fields such as wet bulb, relative humidity, amount of cloud, wind speed and etc. This data set includes 17 types of events. Time series models can predict next time series parameters value and by using this Framework the closest event can be predicted. The C5 method is able to predict Events with Correct 74.11 percent and Wrong 25.89 percent. But with the aims of K-means clustering algorithm the prediction increase to 85 percent and wrong to 15 percent. 90 percent of data was used for training set and 10 percent for test set. We use 10-fold cross validation to evaluate our prediction rate. This framework is the first estimation in the area of event prediction for a huge data set of aerology and can be extended in many different data sets in any other environments.","PeriodicalId":365473,"journal":{"name":"2008 International Conference on Computer and Electrical Engineering","volume":"264 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Combination of Time Series, Decision Tree and Clustering: A Case Study in Aerology Event Prediction\",\"authors\":\"S.B. Lajevardi, B. Minaei-Bidgoli\",\"doi\":\"10.1109/ICCEE.2008.110\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predictive systems use historical and other available data to predict an event. In this paper we propose a general framework to predict the Aerology events with time series streams and events stream using combination of K-means clustering algorithm and Decision Tree C5 algorithm. Firstly, we find the closest time series record for any events; therefore, we have gathered different parameters value when an event is occurring. Using K-means we add a field to data set which determines the cluster of each record after that by using C5 algorithm we predict events. C5 Decision Tree Algorithm is one of the well-known Decision Tree Algorithms. This framework and time series model can predict future events efficiently. We gathered 1961 until 2005 data of aerology organization for Tehran Mehrabad Station. This data contains some fields such as wet bulb, relative humidity, amount of cloud, wind speed and etc. This data set includes 17 types of events. Time series models can predict next time series parameters value and by using this Framework the closest event can be predicted. The C5 method is able to predict Events with Correct 74.11 percent and Wrong 25.89 percent. But with the aims of K-means clustering algorithm the prediction increase to 85 percent and wrong to 15 percent. 90 percent of data was used for training set and 10 percent for test set. We use 10-fold cross validation to evaluate our prediction rate. This framework is the first estimation in the area of event prediction for a huge data set of aerology and can be extended in many different data sets in any other environments.\",\"PeriodicalId\":365473,\"journal\":{\"name\":\"2008 International Conference on Computer and Electrical Engineering\",\"volume\":\"264 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Conference on Computer and Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCEE.2008.110\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Computer and Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEE.2008.110","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Combination of Time Series, Decision Tree and Clustering: A Case Study in Aerology Event Prediction
Predictive systems use historical and other available data to predict an event. In this paper we propose a general framework to predict the Aerology events with time series streams and events stream using combination of K-means clustering algorithm and Decision Tree C5 algorithm. Firstly, we find the closest time series record for any events; therefore, we have gathered different parameters value when an event is occurring. Using K-means we add a field to data set which determines the cluster of each record after that by using C5 algorithm we predict events. C5 Decision Tree Algorithm is one of the well-known Decision Tree Algorithms. This framework and time series model can predict future events efficiently. We gathered 1961 until 2005 data of aerology organization for Tehran Mehrabad Station. This data contains some fields such as wet bulb, relative humidity, amount of cloud, wind speed and etc. This data set includes 17 types of events. Time series models can predict next time series parameters value and by using this Framework the closest event can be predicted. The C5 method is able to predict Events with Correct 74.11 percent and Wrong 25.89 percent. But with the aims of K-means clustering algorithm the prediction increase to 85 percent and wrong to 15 percent. 90 percent of data was used for training set and 10 percent for test set. We use 10-fold cross validation to evaluate our prediction rate. This framework is the first estimation in the area of event prediction for a huge data set of aerology and can be extended in many different data sets in any other environments.