{"title":"挖掘数据流中出现的模式和分类","authors":"Hamad Alhammady, K. Ramamohanarao","doi":"10.1109/WI.2005.96","DOIUrl":null,"url":null,"abstract":"A data stream model has been proposed recently for those data intensive applications such as financial applications, manufacturing, and others (Babcock et al., 2002). In this model, data arrives in multiple, continuous, rapid, time-varying data streams. These characteristics make it infeasible for traditional classification and mining techniques to deal with data streams. In this paper, we propose a novel method for mining emerging patterns (EPs) in data streams. Moreover, we show how these EPs can be used to classify data streams. EPs (Dong and Li, 1999) are those itemsets whose supports in one class are significantly higher than their supports in the other classes. The experimental evaluation shows that our proposed method can achieve up to 10% increase in accuracy compared to the other methods.","PeriodicalId":213856,"journal":{"name":"The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)","volume":"213 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"Mining emerging patterns and classification in data streams\",\"authors\":\"Hamad Alhammady, K. Ramamohanarao\",\"doi\":\"10.1109/WI.2005.96\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A data stream model has been proposed recently for those data intensive applications such as financial applications, manufacturing, and others (Babcock et al., 2002). In this model, data arrives in multiple, continuous, rapid, time-varying data streams. These characteristics make it infeasible for traditional classification and mining techniques to deal with data streams. In this paper, we propose a novel method for mining emerging patterns (EPs) in data streams. Moreover, we show how these EPs can be used to classify data streams. EPs (Dong and Li, 1999) are those itemsets whose supports in one class are significantly higher than their supports in the other classes. The experimental evaluation shows that our proposed method can achieve up to 10% increase in accuracy compared to the other methods.\",\"PeriodicalId\":213856,\"journal\":{\"name\":\"The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)\",\"volume\":\"213 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WI.2005.96\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI.2005.96","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20
摘要
最近提出了一种数据流模型,用于那些数据密集型应用,如金融应用、制造业和其他应用(Babcock et al., 2002)。在这个模型中,数据以多个连续的、快速的、时变的数据流到达。这些特点使得传统的分类和挖掘技术无法处理数据流。本文提出了一种挖掘数据流中新兴模式(EPs)的新方法。此外,我们还展示了如何使用这些ep对数据流进行分类。EPs (Dong and Li, 1999)是指某一类的支持度显著高于其他类的支持度的项目集。实验结果表明,该方法与其他方法相比,准确率提高了10%以上。
Mining emerging patterns and classification in data streams
A data stream model has been proposed recently for those data intensive applications such as financial applications, manufacturing, and others (Babcock et al., 2002). In this model, data arrives in multiple, continuous, rapid, time-varying data streams. These characteristics make it infeasible for traditional classification and mining techniques to deal with data streams. In this paper, we propose a novel method for mining emerging patterns (EPs) in data streams. Moreover, we show how these EPs can be used to classify data streams. EPs (Dong and Li, 1999) are those itemsets whose supports in one class are significantly higher than their supports in the other classes. The experimental evaluation shows that our proposed method can achieve up to 10% increase in accuracy compared to the other methods.