{"title":"Investigating the Impact of Bursty Traffic on Hoeffding Tree Algorithm in Stream Mining over Internet","authors":"Yang Hang, S. Fong","doi":"10.1109/INTERNET.2010.33","DOIUrl":null,"url":null,"abstract":"Steam data are continuous and ubiquitous in nature which can be found in many Web applications operating on Internet. Some instances of stream data are web logs, online users’ click-streams, online media streaming and Web transaction records. Stream Mining was proposed as a relatively new data analytic solution for handling such streams. It has been widely acclaimed of its usefulness in real-time decision-support applications, for example web recommenders. Hoeffding Tree Algorithm (HTA) is one of the popular choices for implementing Very-Fast-Decision-Tree in stream mining. The theoretical aspects have been studied extensively by researchers. However, the data streams that fed into HTA are usually assumed at a constant rate in the literature. HTA has yet been tested under bursty traffic such as Internet environment. This paper sheds some light into the impact of bursty traffic on the performance of HTA in stream mining.","PeriodicalId":155572,"journal":{"name":"2010 2nd International Conference on Evolving Internet","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 2nd International Conference on Evolving Internet","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INTERNET.2010.33","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Steam data are continuous and ubiquitous in nature which can be found in many Web applications operating on Internet. Some instances of stream data are web logs, online users’ click-streams, online media streaming and Web transaction records. Stream Mining was proposed as a relatively new data analytic solution for handling such streams. It has been widely acclaimed of its usefulness in real-time decision-support applications, for example web recommenders. Hoeffding Tree Algorithm (HTA) is one of the popular choices for implementing Very-Fast-Decision-Tree in stream mining. The theoretical aspects have been studied extensively by researchers. However, the data streams that fed into HTA are usually assumed at a constant rate in the literature. HTA has yet been tested under bursty traffic such as Internet environment. This paper sheds some light into the impact of bursty traffic on the performance of HTA in stream mining.