Review on recent developments in frequent itemset based document clustering, its research trends and applications

Q4 Mathematics

International Journal of Data Analysis Techniques and Strategies Pub Date : 2019-02-01 DOI:10.1504/IJDATS.2019.10018907

D. Rajput

引用次数: 4

Abstract

The document data is growing at an exponential rate. It is heterogeneous, dynamic and highly unstructured in nature. These characteristics of document data pose new challenges and opportunities for the development of various models and approaches for documents clustering. Different methods adopted for the development of these models. But these techniques have their advantages and disadvantages. The primary focus of the study is to the analysis of existing methods and approaches for document clustering based on frequent itemsets. Subsequently, this research direction facilitates the exploration of the emerging trends for each extension with applications. In this paper, more than 90 recent (published after 1990) research papers are summarised that are published in various reputed journals like IEEE Transaction, ScienceDirect, Springer-link, ACM and few fundamental authoritative articles.

查看原文本刊更多论文

综述了基于频繁项集的文档聚类的最新进展、研究趋势和应用

文档数据正以指数速度增长。它在本质上是异构的、动态的和高度非结构化的。文档数据的这些特征为各种文档聚类模型和方法的发展提出了新的挑战和机遇。这些模型的开发采用了不同的方法。但这些技术有其优点和缺点。本文的研究重点是对现有的基于频繁项集的文档聚类方法进行分析。随后，该研究方向有助于探索每个扩展的新兴趋势和应用。本文总结了90多篇近期(1990年以后发表的)研究论文，这些论文发表在IEEE Transaction、ScienceDirect、Springer-link、ACM等各种知名期刊上，以及一些基础权威文章。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Data Analysis Techniques and Strategies Decision Sciences-Information Systems and Management

CiteScore

1.20

自引率

0.00%

发文量