D. Kyriazis, C. Doulkeridis, P. Gouvas, R. Jiménez-Peris, A. J. Ferrer, L. Kallipolitis, Pavlos Kranas, George Kousiouris, C. Macdonald, R. McCreadie, Y. Moatti, Apostolos Papageorgiou, M. Patiño-Martínez, Stathis Plitsos, Dimitrios Poulopoulos, Antonio Paradell, A. Raouzaiou, Paula Ta-Shma, V. Vianello
{"title":"BigDataStack: A Holistic Data-Driven Stack for Big Data Applications and Operations","authors":"D. Kyriazis, C. Doulkeridis, P. Gouvas, R. Jiménez-Peris, A. J. Ferrer, L. Kallipolitis, Pavlos Kranas, George Kousiouris, C. Macdonald, R. McCreadie, Y. Moatti, Apostolos Papageorgiou, M. Patiño-Martínez, Stathis Plitsos, Dimitrios Poulopoulos, Antonio Paradell, A. Raouzaiou, Paula Ta-Shma, V. Vianello","doi":"10.1109/BigDataCongress.2018.00041","DOIUrl":null,"url":null,"abstract":"The new data-driven industrial revolution highlights the need for big data technologies to unlock the potential in various application domains. In this context, emerging innovative solutions exploit several underlying infrastructure and cluster management systems. However, these systems have not been designed and implemented in a \"big data context\", and they rather emphasize and address the computational needs and aspects of applications and services to be deployed. In this paper we present the architecture of a complete stack (namely BigDataStack), based on a frontrunner infrastructure management system that drives decisions according to data aspects, thus being fully scalable, runtime adaptable and high-performant to address the needs of big data operations and data-intensive applications. Furthermore, the stack goes beyond purely infrastructure elements by introducing techniques for dimensioning big data applications, modelling and analyzing of processes as well as provisioning data-as-a-service by exploiting a seamless analytics framework.","PeriodicalId":177250,"journal":{"name":"2018 IEEE International Congress on Big Data (BigData Congress)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Congress on Big Data (BigData Congress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2018.00041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The new data-driven industrial revolution highlights the need for big data technologies to unlock the potential in various application domains. In this context, emerging innovative solutions exploit several underlying infrastructure and cluster management systems. However, these systems have not been designed and implemented in a "big data context", and they rather emphasize and address the computational needs and aspects of applications and services to be deployed. In this paper we present the architecture of a complete stack (namely BigDataStack), based on a frontrunner infrastructure management system that drives decisions according to data aspects, thus being fully scalable, runtime adaptable and high-performant to address the needs of big data operations and data-intensive applications. Furthermore, the stack goes beyond purely infrastructure elements by introducing techniques for dimensioning big data applications, modelling and analyzing of processes as well as provisioning data-as-a-service by exploiting a seamless analytics framework.