Big Data Summarization Using Novel Clustering Algorithm and Semantic Feature Approach

Int. J. Rough Sets Data Anal. Pub Date : 2017-07-01 DOI:10.4018/IJRSDA.2017070108

Shilpa G. Kolte, J. Bakal

引用次数: 36

Abstract

This paper proposes a big data i.e., documents, texts summarization method using proposed clustering and semantic features. This paper proposes a novel clustering algorithm which is used for big data summarization. The proposed system works in four phases and provides a modular implementation of multiple documents summarization. The experimental results using Iris dataset show that the proposed clustering algorithm performs better than K-means and K-medodis algorithm. The performance of big data i.e., documents, texts summarization is evaluated using Australian legal cases from the Federal Court of Australia FCA database. The experimental results demonstrate that the proposed method can summarize big data document superior as compared with existing systems.

查看原文本刊更多论文

基于新型聚类算法和语义特征的大数据摘要

本文提出了一种利用所提聚类和语义特征对大数据即文档、文本进行摘要的方法。提出了一种用于大数据摘要的聚类算法。提出的系统分四个阶段工作，并提供多文档摘要的模块化实现。Iris数据集的实验结果表明，本文提出的聚类算法优于K-means和K-medodis算法。使用来自澳大利亚联邦法院FCA数据库的澳大利亚法律案例来评估大数据(即文件、文本摘要)的性能。实验结果表明，与现有系统相比，所提出的方法在大数据文档的总结方面具有优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Int. J. Rough Sets Data Anal.

自引率

0.00%

发文量