{"title":"Information Decay in Building Predictive Models Using Temporal Data","authors":"Lasheng Yu, Mukwende Placide","doi":"10.4304/JSW.7.2.479-484","DOIUrl":null,"url":null,"abstract":"Most predictive data mining methods are based on the assumption that the historical data involved in building and verifying a model are the best estimator of what will happen in the future. However, the relevance of past to the future depends on the application domain in a specific timeframe. In this paper, we present a method of predicting the future from temporal data using information decay technique that estimate the relevance of the past to the future with the help of the knowledge of the information lifetime of the temporal data. We show how to use a decision tree data mining technique to build an information-decay-based predictive model by introducing an information decay method in impurity measure functions. The relevance of the concepts is proven by comparing two decision tree-based classifiers: one built using information decay over a specific timeframe and the other built setting aside the time factor.","PeriodicalId":206833,"journal":{"name":"2010 Third International Symposium on Information Science and Engineering","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Third International Symposium on Information Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4304/JSW.7.2.479-484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Most predictive data mining methods are based on the assumption that the historical data involved in building and verifying a model are the best estimator of what will happen in the future. However, the relevance of past to the future depends on the application domain in a specific timeframe. In this paper, we present a method of predicting the future from temporal data using information decay technique that estimate the relevance of the past to the future with the help of the knowledge of the information lifetime of the temporal data. We show how to use a decision tree data mining technique to build an information-decay-based predictive model by introducing an information decay method in impurity measure functions. The relevance of the concepts is proven by comparing two decision tree-based classifiers: one built using information decay over a specific timeframe and the other built setting aside the time factor.