{"title":"Applying Eco-Threading Framework to Memory-Intensive Hadoop Applications","authors":"Hiroaki Takasaki, S. Mostafa, S. Kusakabe","doi":"10.1109/ICISA.2014.6847366","DOIUrl":null,"url":null,"abstract":"Hadoop is a software framework for processing large data sets on clusters of commodity hardware. We apply our framework, which enhances performance and efficiency of memory-intensive multi-threaded applications, to Hadoop applications. The framework consists of a kernel-level thread scheduler, an application programming interface (API) for the scheduler, and a controller for the behavior of the scheduler through the API. We exploit the affinity of sibling threads, which have the same parent process and share the context, so that we can effectively exploit memory hierarchy by reducing memory-related undesirable events such as cache misses. We monitors performance metrics and automatically adjusts the behavior of the scheduler through the API to try to maximize the effectiveness of the scheduler. According to our preliminary evaluation result, our framework is promising to reduce the energy consumption of memory intensive Hadoop applications.","PeriodicalId":117185,"journal":{"name":"2014 International Conference on Information Science & Applications (ICISA)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Information Science & Applications (ICISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISA.2014.6847366","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Hadoop is a software framework for processing large data sets on clusters of commodity hardware. We apply our framework, which enhances performance and efficiency of memory-intensive multi-threaded applications, to Hadoop applications. The framework consists of a kernel-level thread scheduler, an application programming interface (API) for the scheduler, and a controller for the behavior of the scheduler through the API. We exploit the affinity of sibling threads, which have the same parent process and share the context, so that we can effectively exploit memory hierarchy by reducing memory-related undesirable events such as cache misses. We monitors performance metrics and automatically adjusts the behavior of the scheduler through the API to try to maximize the effectiveness of the scheduler. According to our preliminary evaluation result, our framework is promising to reduce the energy consumption of memory intensive Hadoop applications.