{"title":"On the applicability of simple cache models for modern processors","authors":"Rakhi Hemani, Subhasish Banerjee, Apala Guha","doi":"10.1109/ICGHPC.2016.7508062","DOIUrl":null,"url":null,"abstract":"Cache performance estimation is the first step in assuring good cache utilization and hence application performance. However, it is difficult to create good cache models as the implementation of commercial caches is complex, constantly evolving, and, protected information. As a result many practical compilers use simple cache models such as Fully Associative LRU Cache (FALC) model. In this paper we quantify the applicability of the FALC model for three modern processors. Our investigation reveals that the applicability is both application and architecture dependent. This insight is used to develop a model for an early (i.e. no profiling required) identification of applicability: Early Picking Criterion. The Early Picking Criterion is developed using synthetic benchmarks and validated with 15 memory intensive SPEC CPU2006 benchmarks. All applications identified by the Early Picking Criterion demonstrate high applicability.","PeriodicalId":268630,"journal":{"name":"2016 2nd International Conference on Green High Performance Computing (ICGHPC)","volume":"108 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Green High Performance Computing (ICGHPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICGHPC.2016.7508062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Cache performance estimation is the first step in assuring good cache utilization and hence application performance. However, it is difficult to create good cache models as the implementation of commercial caches is complex, constantly evolving, and, protected information. As a result many practical compilers use simple cache models such as Fully Associative LRU Cache (FALC) model. In this paper we quantify the applicability of the FALC model for three modern processors. Our investigation reveals that the applicability is both application and architecture dependent. This insight is used to develop a model for an early (i.e. no profiling required) identification of applicability: Early Picking Criterion. The Early Picking Criterion is developed using synthetic benchmarks and validated with 15 memory intensive SPEC CPU2006 benchmarks. All applications identified by the Early Picking Criterion demonstrate high applicability.