{"title":"任意时间粒度数据流中Top K项的一种新方法","authors":"Shu Pingda, Chen Hua-hui","doi":"10.1109/CSSE.2008.973","DOIUrl":null,"url":null,"abstract":"Finding top K items in data streams means finding K items whose frequence are larger than other items in data streams. There are some methods to find most frequent K items in the whole data streams, but they can't be used in arbitrary time interval. This paper proposes a new method-MMF(K)_MS to find most frequent K items based on Hierarchical Synopsis. MMF(K)_MS supports query in arbitrary time interval through using HFVN framework with variable number of node in every layer and using Count Stretch data structure to maintain Synopsis in each layer. At Last, Proving MMF(K)_MS rational and available by experiment.","PeriodicalId":6460,"journal":{"name":"2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"31 1","pages":"267-270"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A New Method to Find Top K Items in Data Streams at Arbitrary Time Granularities\",\"authors\":\"Shu Pingda, Chen Hua-hui\",\"doi\":\"10.1109/CSSE.2008.973\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Finding top K items in data streams means finding K items whose frequence are larger than other items in data streams. There are some methods to find most frequent K items in the whole data streams, but they can't be used in arbitrary time interval. This paper proposes a new method-MMF(K)_MS to find most frequent K items based on Hierarchical Synopsis. MMF(K)_MS supports query in arbitrary time interval through using HFVN framework with variable number of node in every layer and using Count Stretch data structure to maintain Synopsis in each layer. At Last, Proving MMF(K)_MS rational and available by experiment.\",\"PeriodicalId\":6460,\"journal\":{\"name\":\"2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"volume\":\"31 1\",\"pages\":\"267-270\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSSE.2008.973\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSSE.2008.973","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A New Method to Find Top K Items in Data Streams at Arbitrary Time Granularities
Finding top K items in data streams means finding K items whose frequence are larger than other items in data streams. There are some methods to find most frequent K items in the whole data streams, but they can't be used in arbitrary time interval. This paper proposes a new method-MMF(K)_MS to find most frequent K items based on Hierarchical Synopsis. MMF(K)_MS supports query in arbitrary time interval through using HFVN framework with variable number of node in every layer and using Count Stretch data structure to maintain Synopsis in each layer. At Last, Proving MMF(K)_MS rational and available by experiment.