Laaziz Lahlou, N. Kara, Mohssine Arouch, Claes Edstrom
{"title":"Cheetah:提供下一代网络服务的快速无监督学习技术","authors":"Laaziz Lahlou, N. Kara, Mohssine Arouch, Claes Edstrom","doi":"10.1145/3429885.3429964","DOIUrl":null,"url":null,"abstract":"Recently, attributed graphs have been extensively employed in modeling, studying and analyzing complex interactions in real world systems. A myriad of techniques have been proposed to partition these graphs into clusters that exhibit small entropy with respect to both compositional attributes and the structural properties of the graph. In cloud network infrastructures, they play an important role to understand end users, compute nodes and their interactions. One of the main challenges in today's large scale cloud infrastructures is to categorize these compute nodes into clusters that share similar attributes. Existing unsupervised machine learning techniques such as k-Means and DBSCAN, are inadequate to partition large scale computer network infrastructures due to their non suitability for such contexts and their algorithmic complexities that prevent them from being scalable to such sizes in a reasonable time. In this paper, we first formulate the problem of partitioning attributed graphs in the context of cloud infrastructures as a Quadratic Assignment Problem to solve small to medium scale instances and show its NP-Hardness. We then propose Cheetah a fast and scalable multi-objective topology-aware unsupervised machine learning technique that is tailored to effectively partition large scale cloud network infrastructures. Yet, in terms of complexity, Cheetah is linear as it leverages Breadth First Search algorithm. Experimental results demonstrate its ability to quickly construct good-quality clusters (≈ 1.63 seconds) given 1000 nodes compared to K-Means (≈ 2.78 seconds) and DBSCAN (≈ 24.76 seconds), respectively, and reveal its suitability for large scale infrastructures making it an appealing solution to be integrated into orchestration systems.","PeriodicalId":205652,"journal":{"name":"Proceedings of the 2020 6th International Workshop on Container Technologies and Container Clouds","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cheetah: A fast unsupervised learning technique to provision next generation network services\",\"authors\":\"Laaziz Lahlou, N. Kara, Mohssine Arouch, Claes Edstrom\",\"doi\":\"10.1145/3429885.3429964\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, attributed graphs have been extensively employed in modeling, studying and analyzing complex interactions in real world systems. A myriad of techniques have been proposed to partition these graphs into clusters that exhibit small entropy with respect to both compositional attributes and the structural properties of the graph. In cloud network infrastructures, they play an important role to understand end users, compute nodes and their interactions. One of the main challenges in today's large scale cloud infrastructures is to categorize these compute nodes into clusters that share similar attributes. Existing unsupervised machine learning techniques such as k-Means and DBSCAN, are inadequate to partition large scale computer network infrastructures due to their non suitability for such contexts and their algorithmic complexities that prevent them from being scalable to such sizes in a reasonable time. In this paper, we first formulate the problem of partitioning attributed graphs in the context of cloud infrastructures as a Quadratic Assignment Problem to solve small to medium scale instances and show its NP-Hardness. We then propose Cheetah a fast and scalable multi-objective topology-aware unsupervised machine learning technique that is tailored to effectively partition large scale cloud network infrastructures. Yet, in terms of complexity, Cheetah is linear as it leverages Breadth First Search algorithm. Experimental results demonstrate its ability to quickly construct good-quality clusters (≈ 1.63 seconds) given 1000 nodes compared to K-Means (≈ 2.78 seconds) and DBSCAN (≈ 24.76 seconds), respectively, and reveal its suitability for large scale infrastructures making it an appealing solution to be integrated into orchestration systems.\",\"PeriodicalId\":205652,\"journal\":{\"name\":\"Proceedings of the 2020 6th International Workshop on Container Technologies and Container Clouds\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 6th International Workshop on Container Technologies and Container Clouds\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3429885.3429964\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 6th International Workshop on Container Technologies and Container Clouds","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3429885.3429964","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cheetah: A fast unsupervised learning technique to provision next generation network services
Recently, attributed graphs have been extensively employed in modeling, studying and analyzing complex interactions in real world systems. A myriad of techniques have been proposed to partition these graphs into clusters that exhibit small entropy with respect to both compositional attributes and the structural properties of the graph. In cloud network infrastructures, they play an important role to understand end users, compute nodes and their interactions. One of the main challenges in today's large scale cloud infrastructures is to categorize these compute nodes into clusters that share similar attributes. Existing unsupervised machine learning techniques such as k-Means and DBSCAN, are inadequate to partition large scale computer network infrastructures due to their non suitability for such contexts and their algorithmic complexities that prevent them from being scalable to such sizes in a reasonable time. In this paper, we first formulate the problem of partitioning attributed graphs in the context of cloud infrastructures as a Quadratic Assignment Problem to solve small to medium scale instances and show its NP-Hardness. We then propose Cheetah a fast and scalable multi-objective topology-aware unsupervised machine learning technique that is tailored to effectively partition large scale cloud network infrastructures. Yet, in terms of complexity, Cheetah is linear as it leverages Breadth First Search algorithm. Experimental results demonstrate its ability to quickly construct good-quality clusters (≈ 1.63 seconds) given 1000 nodes compared to K-Means (≈ 2.78 seconds) and DBSCAN (≈ 24.76 seconds), respectively, and reveal its suitability for large scale infrastructures making it an appealing solution to be integrated into orchestration systems.