F. Majeed, S. Mahmood, Saqib Ubaid, Naveed Khalil, Sadaf Siddiqi, Fasiha Ashraf
{"title":"A burst resolution technique for data streams management in the real-time data warehouse","authors":"F. Majeed, S. Mahmood, Saqib Ubaid, Naveed Khalil, Sadaf Siddiqi, Fasiha Ashraf","doi":"10.1109/ICET.2011.6048446","DOIUrl":null,"url":null,"abstract":"Data stream sources are currently emerged with the evolution of traditional data warehouse towards real-time data warehouse. Different solutions have been proposed to extract, transform and load the data streams but investigation is still needed to handle the bursts of incoming data streams. In this paper, we have proposed a flow regulation technique which regulates the fast and time varying bursts of data streams. For this purpose, we have adapted and used the token bucket that is simple and flexible mechanism having little overhead. The objective of this research is to minimize the probability of dropping data streams, synchronize processing power and balancing the load of arriving data streams. An algorithm for the flow regulation technique has been proposed to efficiently regulate the data streams. We have evaluated our technique on synthetic dataset and found that flow regulation technique works well in presence of bursty data streams.","PeriodicalId":167049,"journal":{"name":"2011 7th International Conference on Emerging Technologies","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 7th International Conference on Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICET.2011.6048446","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Data stream sources are currently emerged with the evolution of traditional data warehouse towards real-time data warehouse. Different solutions have been proposed to extract, transform and load the data streams but investigation is still needed to handle the bursts of incoming data streams. In this paper, we have proposed a flow regulation technique which regulates the fast and time varying bursts of data streams. For this purpose, we have adapted and used the token bucket that is simple and flexible mechanism having little overhead. The objective of this research is to minimize the probability of dropping data streams, synchronize processing power and balancing the load of arriving data streams. An algorithm for the flow regulation technique has been proposed to efficiently regulate the data streams. We have evaluated our technique on synthetic dataset and found that flow regulation technique works well in presence of bursty data streams.