Seventh IEEE International Conference on Data Mining (ICDM 2007)最新文献_第4页

Temporal Analysis of Semantic Graphs Using ASALSAN 使用ASALSAN的语义图时间分析

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.54

Brett W. Bader, R. Harshman, T. Kolda

引用次数: 139

Binary Matrix Factorization with Applications 二元矩阵分解及其应用

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.99

Zhongyuan Zhang, Tao Li, C. Ding, Xiang-Sun Zhang

引用次数: 159

Incremental Subspace Clustering over Multiple Data Streams 多数据流上的增量子空间聚类

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.100

Qi Zhang, Jinze Liu, Wei Wang

引用次数: 13

Optimizing Frequency Queries for Data Mining Applications 优化数据挖掘应用的频率查询

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.34

Hassan H. Malik, J. Kender

{"title":"Optimizing Frequency Queries for Data Mining Applications","authors":"Hassan H. Malik, J. Kender","doi":"10.1109/ICDM.2007.34","DOIUrl":"https://doi.org/10.1109/ICDM.2007.34","url":null,"abstract":"Data mining algorithms use various Trie and bitmap-based representations to optimize the support (i.e., frequency) counting performance. In this paper, we compare the memory requirements and support counting performance of FP Tree, and Compressed Patricia Trie against several novel variants of vertical bit vectors. First, borrowing ideas from the VLDB domain, we compress vertical bit vectors using WAH encoding. Second, we evaluate the Gray code rank- based transaction reordering scheme, and show that in practice, simple lexicographic ordering, obtained by applying LSB Radix sort, outperforms this scheme. Led by these results, we propose HDO, a novel Hamming-distance-based greedy transaction reordering scheme, and aHDO, a linear-time approximation to HDO. We present results of experiments performed on 15 common datasets with varying degrees of sparseness, and show that HDO- reordered, WAH encoded bit vectors can take as little as 5% of the uncompressed space, while aHDO achieves similar compression on sparse datasets. Finally, with results from over a billion database and data mining style frequency query executions, we show that bitmap-based approaches result in up to hundreds of times faster support counting, and HDO-WAH encoded bitmaps offer the best space-time tradeoff.","PeriodicalId":233758,"journal":{"name":"Seventh IEEE International Conference on Data Mining (ICDM 2007)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129608569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

A Semantic Kernel for Semi-structured DocumentS 半结构化文档的语义内核

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.23

S. Aseervatham, E. Viennet, Younès Bennani

引用次数: 3

Recommendation via Query Centered Random Walk on K-Partite Graph 基于k部图查询中心随机游动的推荐算法

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.8

H. Cheng, P. Tan, J. Sticklen, W. Punch

引用次数: 53

Cross-Mining Binary and Numerical Attributes 交叉挖掘二进制和数值属性

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.32

G. C. Garriga, H. Heikinheimo, J. K. Seppänen

引用次数: 11

A Generalization of Proximity Functions for K-Means k -均值近似函数的推广

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.59

Junjie Wu, Hui Xiong, Jing Chen, Wenjun Zhou

引用次数: 30

Web Site Recommendation Using HTTP Traffic 使用HTTP流量的网站推荐

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.44

Ming Jia, Shaozhi Ye, Xing Li, J. Dickerson

引用次数: 3

Locally Constrained Support Vector Clustering 局部约束支持向量聚类

Seventh IEEE International Conference on Data Mining (ICDM 2007) Pub Date : 2007-10-28 DOI: 10.1109/ICDM.2007.58

Dragomir Yankov, Eamonn J. Keogh, K. Kan

引用次数: 16