{"title":"基于so集的频繁项集快速挖掘新算法","authors":"Long Tan, Q. Qin","doi":"10.1109/ICEICT.2016.7879713","DOIUrl":null,"url":null,"abstract":"N-list and B-list have simply been proven to be highly effective for mining frequent itemsets. The main problem of the two novel structures is that they both need to encode each node of pre-order (or start order) and post-order (or finish order) code. This causes excessive memory consumption to mine frequent itemsets. In this paper, we propose SO-Sets based on SO-Tree, a more efficient data structure, to mine frequent itemsets. SO-Sets require only start-order (or finish-order) of each node, which makes it save lots of memory compared with N-list and B-list. Based on SO-Sets, we propose a new algorithm called FISO to mining frequent itemsets. To analyze the performance of algorithms, we conduct lots of experiments on five real datasets. Experimental results show that FISO algorithm has advantages in running time and size of main memory consumption.","PeriodicalId":224387,"journal":{"name":"2016 IEEE International Conference on Electronic Information and Communication Technology (ICEICT)","volume":"276 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A new algorithm for fast mining frequent itemsets based on SO-Sets\",\"authors\":\"Long Tan, Q. Qin\",\"doi\":\"10.1109/ICEICT.2016.7879713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"N-list and B-list have simply been proven to be highly effective for mining frequent itemsets. The main problem of the two novel structures is that they both need to encode each node of pre-order (or start order) and post-order (or finish order) code. This causes excessive memory consumption to mine frequent itemsets. In this paper, we propose SO-Sets based on SO-Tree, a more efficient data structure, to mine frequent itemsets. SO-Sets require only start-order (or finish-order) of each node, which makes it save lots of memory compared with N-list and B-list. Based on SO-Sets, we propose a new algorithm called FISO to mining frequent itemsets. To analyze the performance of algorithms, we conduct lots of experiments on five real datasets. Experimental results show that FISO algorithm has advantages in running time and size of main memory consumption.\",\"PeriodicalId\":224387,\"journal\":{\"name\":\"2016 IEEE International Conference on Electronic Information and Communication Technology (ICEICT)\",\"volume\":\"276 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Electronic Information and Communication Technology (ICEICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEICT.2016.7879713\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Electronic Information and Communication Technology (ICEICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEICT.2016.7879713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new algorithm for fast mining frequent itemsets based on SO-Sets
N-list and B-list have simply been proven to be highly effective for mining frequent itemsets. The main problem of the two novel structures is that they both need to encode each node of pre-order (or start order) and post-order (or finish order) code. This causes excessive memory consumption to mine frequent itemsets. In this paper, we propose SO-Sets based on SO-Tree, a more efficient data structure, to mine frequent itemsets. SO-Sets require only start-order (or finish-order) of each node, which makes it save lots of memory compared with N-list and B-list. Based on SO-Sets, we propose a new algorithm called FISO to mining frequent itemsets. To analyze the performance of algorithms, we conduct lots of experiments on five real datasets. Experimental results show that FISO algorithm has advantages in running time and size of main memory consumption.