Suad Kamil Ayfan, Dhiah Al-Shammary, Ahmed M. Mahdi, Fahim Sufi
{"title":"Dynamic clustering based on Minkowski similarity for web services aggregation","authors":"Suad Kamil Ayfan, Dhiah Al-Shammary, Ahmed M. Mahdi, Fahim Sufi","doi":"10.1007/s41870-024-02174-5","DOIUrl":null,"url":null,"abstract":"<p>This research, introduces a new dynamic clustering method offering a new approach utilizing Minkowski Distance methods for calculating similarity of xml messages to effectively compress and aggregate them. The increase in Web services utilization has led to bottlenecks and congestion on network links with limited bandwidth. Furthermore, Simple Object Access Protocol (SOAP) is an eXtensible Markup Language (XML) based messaging system often utilized on the internet. It leads to interoperability by facilitating connection both users and their service providers across various platforms. The large amount and huge size of the SOAP messages being exchanged lead to congestion and bottlenecks. Aggregation tools for SOAP messages can effectively decrease the significant amount that traffic generated. This has shown a notable enhancement in performance. Enhancements can be made by using similarity methods. These techniques group together multiple SOAP messages that share a significant level of similarity. Present techniques utilizing grouping for aggregating XML messages have demonstrated efficiency and compression ratio limitations. Practically, the proposed model groups messages into clusters based on minimum distance, supporting Huffman (variable-length) and (fixed-length) encoding compressing for aggregating multiple compressed XML web messages into a single compact message. Generally, the suggested model’s performance has been evaluated through a comparison with K-Means, Principle Component Analysis (PCA) with K-Means, Hilbert, and fractal self-similarity clustering models. Minkowski distance clustering model has shown excellent performance, especially in all message sizes like small, medium, large, V.large. Technically, the model achieved superior average Compression Ratio and it has outperformed all other models.</p>","PeriodicalId":14138,"journal":{"name":"International Journal of Information Technology","volume":"32 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41870-024-02174-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This research, introduces a new dynamic clustering method offering a new approach utilizing Minkowski Distance methods for calculating similarity of xml messages to effectively compress and aggregate them. The increase in Web services utilization has led to bottlenecks and congestion on network links with limited bandwidth. Furthermore, Simple Object Access Protocol (SOAP) is an eXtensible Markup Language (XML) based messaging system often utilized on the internet. It leads to interoperability by facilitating connection both users and their service providers across various platforms. The large amount and huge size of the SOAP messages being exchanged lead to congestion and bottlenecks. Aggregation tools for SOAP messages can effectively decrease the significant amount that traffic generated. This has shown a notable enhancement in performance. Enhancements can be made by using similarity methods. These techniques group together multiple SOAP messages that share a significant level of similarity. Present techniques utilizing grouping for aggregating XML messages have demonstrated efficiency and compression ratio limitations. Practically, the proposed model groups messages into clusters based on minimum distance, supporting Huffman (variable-length) and (fixed-length) encoding compressing for aggregating multiple compressed XML web messages into a single compact message. Generally, the suggested model’s performance has been evaluated through a comparison with K-Means, Principle Component Analysis (PCA) with K-Means, Hilbert, and fractal self-similarity clustering models. Minkowski distance clustering model has shown excellent performance, especially in all message sizes like small, medium, large, V.large. Technically, the model achieved superior average Compression Ratio and it has outperformed all other models.
这项研究介绍了一种新的动态聚类方法,利用闵科夫斯基距离法计算 xml 信息的相似性,从而有效地压缩和聚合这些信息。网络服务使用量的增加导致带宽有限的网络链路出现瓶颈和拥塞。此外,简单对象访问协议(SOAP)是一种基于可扩展标记语言(XML)的信息传递系统,经常在互联网上使用。它通过促进用户与其服务提供商在不同平台上的连接,实现互操作性。交换的 SOAP 信息数量大、体积大,会导致拥塞和瓶颈。SOAP 消息聚合工具可以有效减少产生的大量流量。这显示了性能的显著提升。使用相似性方法可以提高性能。这些技术可将具有高度相似性的多条 SOAP 信息组合在一起。目前利用分组来聚合 XML 信息的技术在效率和压缩率方面都有局限性。实际上,建议的模型根据最小距离将信息分组,支持哈夫曼(可变长度)和(固定长度)压缩编码,可将多条压缩 XML 网络信息聚合成一条紧凑的信息。总体而言,通过与 K-Means、带 K-Means 的主成分分析法(PCA)、希尔伯特和分形自相似性聚类模型的比较,对所建议模型的性能进行了评估。闵科夫斯基距离聚类模型表现出色,尤其是在所有信息大小(如小型、中型、大型、V.大型)中。从技术上讲,该模型实现了出色的平均压缩比,其表现优于所有其他模型。