{"title":"基于新型聚类算法和语义特征的大数据摘要","authors":"Shilpa G. Kolte, J. Bakal","doi":"10.4018/IJRSDA.2017070108","DOIUrl":null,"url":null,"abstract":"This paper proposes a big data i.e., documents, texts summarization method using proposed clustering and semantic features. This paper proposes a novel clustering algorithm which is used for big data summarization. The proposed system works in four phases and provides a modular implementation of multiple documents summarization. The experimental results using Iris dataset show that the proposed clustering algorithm performs better than K-means and K-medodis algorithm. The performance of big data i.e., documents, texts summarization is evaluated using Australian legal cases from the Federal Court of Australia FCA database. The experimental results demonstrate that the proposed method can summarize big data document superior as compared with existing systems.","PeriodicalId":152357,"journal":{"name":"Int. J. Rough Sets Data Anal.","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"36","resultStr":"{\"title\":\"Big Data Summarization Using Novel Clustering Algorithm and Semantic Feature Approach\",\"authors\":\"Shilpa G. Kolte, J. Bakal\",\"doi\":\"10.4018/IJRSDA.2017070108\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a big data i.e., documents, texts summarization method using proposed clustering and semantic features. This paper proposes a novel clustering algorithm which is used for big data summarization. The proposed system works in four phases and provides a modular implementation of multiple documents summarization. The experimental results using Iris dataset show that the proposed clustering algorithm performs better than K-means and K-medodis algorithm. The performance of big data i.e., documents, texts summarization is evaluated using Australian legal cases from the Federal Court of Australia FCA database. The experimental results demonstrate that the proposed method can summarize big data document superior as compared with existing systems.\",\"PeriodicalId\":152357,\"journal\":{\"name\":\"Int. J. Rough Sets Data Anal.\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"36\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Rough Sets Data Anal.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/IJRSDA.2017070108\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Rough Sets Data Anal.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/IJRSDA.2017070108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Big Data Summarization Using Novel Clustering Algorithm and Semantic Feature Approach
This paper proposes a big data i.e., documents, texts summarization method using proposed clustering and semantic features. This paper proposes a novel clustering algorithm which is used for big data summarization. The proposed system works in four phases and provides a modular implementation of multiple documents summarization. The experimental results using Iris dataset show that the proposed clustering algorithm performs better than K-means and K-medodis algorithm. The performance of big data i.e., documents, texts summarization is evaluated using Australian legal cases from the Federal Court of Australia FCA database. The experimental results demonstrate that the proposed method can summarize big data document superior as compared with existing systems.