{"title":"Software Metrics for Green Parallel Computing of Big Data Systems","authors":"H. Gürbüz, B. Tekinerdogan","doi":"10.1109/BigDataCongress.2016.54","DOIUrl":null,"url":null,"abstract":"Big Data is typically organized around a distributed file system on top of which the parallel algorithms can be executed for realizing the Big Data analytics. In general, the parallel algorithms can be mapped in different alternative ways to the computing platform. Hereby each alternative will perform differently with respect to the environmentally relevant parameters such as energy and power consumption. Existing studies on deployment of parallel computing algorithms have mainly focused on addressing general computing metrics such as speedup with respect to serial computing and efficiency of the use of the computing nodes. In this paper, we report on the elicitation of green metrics for big data systems that are required when analyzing deployment alternatives. To this end we use the existing systematic literature reviews and identify, and discuss the important green computing metrics for big data systems.","PeriodicalId":407471,"journal":{"name":"2016 IEEE International Congress on Big Data (BigData Congress)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Congress on Big Data (BigData Congress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2016.54","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Big Data is typically organized around a distributed file system on top of which the parallel algorithms can be executed for realizing the Big Data analytics. In general, the parallel algorithms can be mapped in different alternative ways to the computing platform. Hereby each alternative will perform differently with respect to the environmentally relevant parameters such as energy and power consumption. Existing studies on deployment of parallel computing algorithms have mainly focused on addressing general computing metrics such as speedup with respect to serial computing and efficiency of the use of the computing nodes. In this paper, we report on the elicitation of green metrics for big data systems that are required when analyzing deployment alternatives. To this end we use the existing systematic literature reviews and identify, and discuss the important green computing metrics for big data systems.