{"title":"Algorithms for clustering XML documents: A review","authors":"Shagun Gulati, Geetika Munjal","doi":"10.1109/ICACEA.2015.7164772","DOIUrl":null,"url":null,"abstract":"This paper provides a brief survey of various algorithms that are widely being used for the clustering of XML (Extensible Markup Language) documents. The scalable integration techniques and algorithms, like XClust algorithm, S-GRACE algorithm, XProj algorithm, XCleaner algorithm and many more, are being developed for the growing number of data sources of XML documents. These techniques have been used for reduction in many problems of clustering but still we can find the problem of clustering complexity which is being discussed here and the technique to overcome that is being thought to be taken up as the future work.","PeriodicalId":202893,"journal":{"name":"2015 International Conference on Advances in Computer Engineering and Applications","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Advances in Computer Engineering and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACEA.2015.7164772","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper provides a brief survey of various algorithms that are widely being used for the clustering of XML (Extensible Markup Language) documents. The scalable integration techniques and algorithms, like XClust algorithm, S-GRACE algorithm, XProj algorithm, XCleaner algorithm and many more, are being developed for the growing number of data sources of XML documents. These techniques have been used for reduction in many problems of clustering but still we can find the problem of clustering complexity which is being discussed here and the technique to overcome that is being thought to be taken up as the future work.