T. Cucinotta, Luigi Pannocchi, Filippo Galli, S. Fichera, Sourav Lahiri, Antonino Artale
{"title":"Optimum VM Placement for NFV Infrastructures","authors":"T. Cucinotta, Luigi Pannocchi, Filippo Galli, S. Fichera, Sourav Lahiri, Antonino Artale","doi":"10.1109/IC2E55432.2022.00029","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00029","url":null,"abstract":"This paper constitutes an industrial experience re-port about the use of data center optimization strategies for softwarized network services within the Vodafone resource man-agement unit for the management of virtualized network infras-tructures. The problem of optimum virtual machine placement as needed in the network operator context is detailed, and different solving strategies are proposed and discussed, including heuristics based on genetic optimization. Also, experimental results are presented that compare these strategies with one another from the standpoint of optimality and execution times, using a data-set made of some of the real problems that had to be solved in the past few years by Vodafone, in order to optimize its capacity planning decisions. The presented experimental results highlight that an optimum solver leads to excessively high computation times for large problems, whereas simple heuristics may exhibit significant loss in optimality at reduced computation times. Genetic optimization, on the other hand, constitutes a very interesting trade-off between these two extremes. The data-set used for the provided results is published under an open data license, for possible reuse in future research works on the topic.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"223 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117109054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Scalable Collaborative Software Visualization as a Service: Short Industry and Experience Paper","authors":"Alexander Krause-Glau, W. Hasselbring","doi":"10.1109/IC2E55432.2022.00026","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00026","url":null,"abstract":"Software visualizations are used by software developers, for instance, for program comprehension. In this context, a less researched aspect is the collaborative use of online visual-ization services. This paper presents the conceptual design and a prototype implementation of our approach for a collaborative software visualization service for program comprehension. The central idea is an online available software-as-a-service application that analyzes, persists, and visualizes software applications which users intend to explore and comprehend via software visualization. The resulting implementation is a redevelopment of our live trace visualization tool Explor Viz. In comparison to other approaches, ExplorViz utilizes WebGL and other browser technologies instead of a game engine to render its visualizations. As a result, we achieve a platform-independent interoperability that is crucial for (remote) collaboration. Here, users can explore the same visualization via our collaboratively usable desktop, virtual reality, and also augmented reality modes. Our prototype follows the Twelve-Factor App methodology to build a cloud-native application that comprises multiple scalable microservices. Thanks to horizontal scaling, our implementation is capable to analyze a large amount of visualization data; thus, allowing multiple users to simultaneously use the software visualization service. We conducted a set of preliminary performance exper-iments to benchmark our prototype's scalability. Results show that the evaluated service scales linearly with increasing load.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124147195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automated Traces-based Anomaly Detection and Root Cause Analysis in Cloud Platforms","authors":"Mbarka Soualhia, F. Wuhib","doi":"10.1109/IC2E55432.2022.00034","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00034","url":null,"abstract":"Current cloud infrastructures and their applications are increasingly complex, with confounding relationships among application elements and cloud infrastructure components. This makes timely identification of the root causes for faults that occur in such systems an important-yet-challenging task. In this paper, we propose a solution that automatically builds a correlation model and an anomaly detection model using kernel traces of cloud servers. The correlation model is used to capture the dependencies between the various elements of the cloud system while the anomaly detection model is used to identify anomalies related to specific elements of the system. Upon detection of a fault, our framework computes a dependency graph of detected anomalies using the models, which in turn is used to perform the root cause analysis. Evaluation results of our proposed framework on a Kubernetes cloud show that it can effectively find root causes of injected faults with an accuracy rate between 80% and 99.3%, with a low false negative rate.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"207 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134295809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Reddy, D. Moghe, Manik Taneja, Roger Liao, Subin Francis
{"title":"Novel Abstraction and Offload Mechanisms for High Performance Cloud-native Distributed Object Stores","authors":"A. Reddy, D. Moghe, Manik Taneja, Roger Liao, Subin Francis","doi":"10.1109/IC2E55432.2022.00024","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00024","url":null,"abstract":"Object Storage solutions are typically optimised for capacity and cost but performance has traditionally been a second thought. We make the case for a highly performant distributed object store by a) building an abstraction layer to pass immutability and data affinity hints to the underlying storage while also, b) making the Objects layer aware of the hardware configurations enabling the Object Storage controllers to maximise throughput. With these optimizations we show that object storage performance can approach 95% of the maximum possible performance from the underlying raw storage while ensuring that the abstractions are generic enough to be able to run on any general purpose off the shelf storage systems. We have observed these performance gains across more than 1000 customer environments across diverse hardware. We also extend the above optimizations with generic mecha-nisms to offload compute closer to storage that have significant benefits for a broad class of workloads. Specifically, we evaluate performance gains from a) well known constructs like S3 Select for Analytics workloads and b) generic compute offload like Objects Lambda. This ability to offload compute is critical for modern distributed workloads like AI/ML and Analytics processing with very large distributed data sets.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115132955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Organizing and Program Committee: IC2E 2022","authors":"","doi":"10.1109/ic2e55432.2022.00006","DOIUrl":"https://doi.org/10.1109/ic2e55432.2022.00006","url":null,"abstract":"","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127241806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Poster Paper: Operating System Support for Applications Performance Analysis","authors":"Riley VanDonge, Naser Ezzati-Jivan","doi":"10.1109/IC2E55432.2022.00039","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00039","url":null,"abstract":"In order to classify common performance issues of multi-core applications, used in cloud computing and distributed systems, and offer solutions to them, performance antipatterns have been introduced by researchers. Performance antipatterns help developers refactor inefficient code, and are exceptionally useful for multi-threaded applications, where problems can be difficult to diagnose. However, existing performance antipattern detection methods do not properly examine operating system-wide resources, leading to imprecise metrics and results. In this paper, a novel system-level execution tracing method is presented for detecting the One Lane Bridge performance antipattern. This method is validated through a case study performed on an open-source multi-threaded application, where we diagnosed performance issues.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130681519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Simone Bottoni, S. Braghin, Alberto Trombetta, S. Venugopal
{"title":"Adaptive Replication Strategy in Highly Distributed Data Management Systems","authors":"Simone Bottoni, S. Braghin, Alberto Trombetta, S. Venugopal","doi":"10.1109/IC2E55432.2022.00036","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00036","url":null,"abstract":"The performance of the execution of an analytical workload critically impacts the speed at which companies are able to react to market changes. In the era of Big Data, it is imperative that large, complex analytics are executed in a timely manner. In this paper, we propose a method to analyze the data access pattern of analytical workloads on large datasets to identify optimal data partitioning and replication strategies. This, in turn, helps the already existing query optimization components of modern data management systems.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133933356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"IoTreeplay: Synchronous Distributed Traffic Replay in IoT Environments","authors":"Markus Toll, Ilja Behnke, O. Kao","doi":"10.1109/IC2E55432.2022.00008","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00008","url":null,"abstract":"Use-cases in the Internet of Things (IoT) typically involve a high number of interconnected, heterogeneous devices. Due to the criticality of many IoT scenarios, systems and applications need to be tested thoroughly before rollout. Existing staging environments and testing frameworks are able to emulate network properties but fail to deliver actual network-wide traffic control to test systems application independently. To extend existing frameworks, we present the distributed traffic replaying tool IoTreeplay. The tool embeds TCPLivePlay into an environment that allows the synchronous replaying of network traffic with multiple endpoints and connections. Replaying takes place in a user-defined network or testbed containing IoT use-cases. Network traffic can be captured and compared to the original trace to evaluate accuracy and reliability. The resulting implementation is able to accurately replay connections within a maximum transmission rate but struggles with deviations from regular TCP connections, like packet loss or connection reset. An evaluation has been performed, measuring individual and aggregated delays between packets, based on the recorded timestamps.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115648029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient Transmission and Reconstruction of Dependent Data Streams via Edge Sampling","authors":"Joel Wolfrath, A. Chandra","doi":"10.1109/IC2E55432.2022.00013","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00013","url":null,"abstract":"Data stream processing is an increasingly important topic due to the prevalence of smart devices and the demand for real-time analytics. Geo-distributed streaming systems, where cloud-based queries utilize data streams from multiple distributed devices, face challenges since wide-area network (WAN) bandwidth is often scarce or expensive. Edge computing allows us to address these bandwidth costs by utilizing resources close to the devices, e.g. to perform sampling over the incoming data streams, which trades downstream query accuracy to reduce the overall transmission cost. In this paper, we leverage the fact that correlations between data streams may exist across devices located in the same geographical region. Using this insight, we develop a hybrid edge-cloud system which systematically trades off between sampling at the edge and estimation of missing values in the cloud to reduce traffic over the WAN. We present an optimization framework which computes sample sizes at the edge and systematically bounds the number of samples we can estimate in the cloud given the strength of the correlation between streams. Our evaluation with three real-world datasets shows that compared to existing sampling techniques, our system could provide comparable error rates over multiple aggregate queries while reducing WAN traffic by 27-42%.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"272 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121042597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Soeren Becker, Tobias Pfandzelter, Nils Japke, David Bermbach, O. Kao
{"title":"Network Emulation in Large-Scale Virtual Edge Testbeds: A Note of Caution and the Way Forward","authors":"Soeren Becker, Tobias Pfandzelter, Nils Japke, David Bermbach, O. Kao","doi":"10.1109/IC2E55432.2022.00007","DOIUrl":"https://doi.org/10.1109/IC2E55432.2022.00007","url":null,"abstract":"The growing research and industry interest in the Internet of Things and the edge computing paradigm has increased the need for cost-efficient virtual testbeds for large-scale distributed applications. Researchers, students, and practitioners need to test and evaluate the interplay of hundreds or thousands of real software components and services connected with a realistic edge network without access to phvsical infrastructure. While advances in virtualization technologies have enabled parts of this, network emulation as a crucial part in the development of edge testbeds is lagging behind: As we show in this paper, NetEm, the current state-of-the-art network emulation tooling included in the Linux kernel, imposes prohibitive scalability limits. We quantify these limits, investigate possible causes, and present a way forward for network emulation in large-scale virtual edge testbeds based on eBPFs.","PeriodicalId":415781,"journal":{"name":"2022 IEEE International Conference on Cloud Engineering (IC2E)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130288386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}