2008 Eighth IEEE International Conference on Data Mining最新文献_第5页

Non-negative Matrix Factorization on Manifold 流形上的非负矩阵分解

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.57

Deng Cai, Xiaofei He, Xiaoyun Wu, Jiawei Han

引用次数: 391

Balancing Spectral Clustering for Segmenting Spatio-temporal Observations of Multi-agent Systems 基于平衡谱聚类的多智能体系统时空观测数据分割

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.88

B. Takács, Y. Demiris

引用次数: 14

Clustering Uncertain Data Using Voronoi Diagrams 用Voronoi图聚类不确定数据

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.31

Ben Kao, Sau-dan. Lee, David Wai-Lok Cheung, Wai-Shing Ho, K. F. Chan

引用次数: 89

Fast and Memory Efficient Mining of High Utility Itemsets in Data Streams 数据流中高实用项集的快速高效挖掘

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.107

Hua-Fu Li, Hsin-Yun Huang, Yi-Cheng Chen, Yu-Jiun Liu, Suh-Yin Lee

引用次数: 127

Support Vector Regression for Censored Data (SVRc): A Novel Tool for Survival Analysis 删节数据的支持向量回归(SVRc):一种新的生存分析工具

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.50

F. Khan, V. Zubek

引用次数: 132

A Hierarchical Algorithm for Clustering Uncertain Data via an Information-Theoretic Approach 一种基于信息论的不确定数据聚类层次算法

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.115

Francesco Gullo, Giovanni Ponti, Andrea Tagarelli, S. Greco

引用次数: 34

Isolation Forest 与世隔绝的森林

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.17

Fei Tony Liu, K. Ting, Zhi-Hua Zhou

引用次数: 3344

Why Stacked Models Perform Effective Collective Classification 为什么堆叠模型能有效地进行集体分类

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.126

A. Fast, David D. Jensen

引用次数: 33

On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking 在线LDA:挖掘文本流的自适应主题模型及其在主题检测和跟踪中的应用

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.140

Loulwah AlSumait, Daniel Barbará, C. Domeniconi

{"title":"On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking","authors":"Loulwah AlSumait, Daniel Barbará, C. Domeniconi","doi":"10.1109/ICDM.2008.140","DOIUrl":"https://doi.org/10.1109/ICDM.2008.140","url":null,"abstract":"This paper presents online topic model (OLDA), a topic model that automatically captures the thematic patterns and identifies emerging topics of text streams and their changes over time. Our approach allows the topic modeling framework, specifically the latent Dirichlet allocation (LDA) model, to work in an online fashion such that it incrementally builds an up-to-date model (mixture of topics per document and mixture of words per topic) when a new document (or a set of documents) appears. A solution based on the empirical Bayes method is proposed. The idea is to incrementally update the current model according to the information inferred from the new stream of data with no need to access previous data. The dynamics of the proposed approach also provide an efficient mean to track the topics over time and detect the emerging topics in real time. Our method is evaluated both qualitatively and quantitatively using benchmark datasets. In our experiments, the OLDA has discovered interesting patterns by just analyzing a fraction of data at a time. Our tests also prove the ability of OLDA to align the topics across the epochs with which the evolution of the topics over time is captured. The OLDA is also comparable to, and sometimes better than, the original LDA in predicting the likelihood of unseen documents.","PeriodicalId":252958,"journal":{"name":"2008 Eighth IEEE International Conference on Data Mining","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121860844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 455

A Novel Method of Combined Feature Extraction for Recognition 一种新的组合特征提取识别方法

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI: 10.1109/ICDM.2008.28

Tingkai Sun, Songcan Chen, Jing-yu Yang, P. Shi

引用次数: 127