{"title":"基于软件轨迹的L2缓存重用距离直方图快速建模","authors":"Jiancong Ge, Ming Ling","doi":"10.1109/ISPASS.2019.00025","DOIUrl":null,"url":null,"abstract":"As the speed gap between the CPU and the main memory keeps increasing, multi-level caches are widely used in modern processors to improve the memory access latency. Therefore, modeling behaviors of the downstream caches becomes a critical part of the processor performance evaluation. In this paper, we propose a fast, yet accurate, L2 cache reuse distance histogram model without time-consuming full simulations, which can be utilized to evaluate the L2 cache miss rate with the Random and LRU replacement policies. The inputs of our model only need to be profiled once and can be reused for evaluations of different L2 cache configurations. To evaluate our model, we compare the output L2 RDH from our model and that of gem5 cycle-accurate simulations. When used to evaluate the L2 cache miss rates, the average absolute error is 3% for SPEC2006 benchmarks.","PeriodicalId":137786,"journal":{"name":"2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Fast Modeling of the L2 Cache Reuse Distance Histograms from Software Traces\",\"authors\":\"Jiancong Ge, Ming Ling\",\"doi\":\"10.1109/ISPASS.2019.00025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the speed gap between the CPU and the main memory keeps increasing, multi-level caches are widely used in modern processors to improve the memory access latency. Therefore, modeling behaviors of the downstream caches becomes a critical part of the processor performance evaluation. In this paper, we propose a fast, yet accurate, L2 cache reuse distance histogram model without time-consuming full simulations, which can be utilized to evaluate the L2 cache miss rate with the Random and LRU replacement policies. The inputs of our model only need to be profiled once and can be reused for evaluations of different L2 cache configurations. To evaluate our model, we compare the output L2 RDH from our model and that of gem5 cycle-accurate simulations. When used to evaluate the L2 cache miss rates, the average absolute error is 3% for SPEC2006 benchmarks.\",\"PeriodicalId\":137786,\"journal\":{\"name\":\"2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPASS.2019.00025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPASS.2019.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fast Modeling of the L2 Cache Reuse Distance Histograms from Software Traces
As the speed gap between the CPU and the main memory keeps increasing, multi-level caches are widely used in modern processors to improve the memory access latency. Therefore, modeling behaviors of the downstream caches becomes a critical part of the processor performance evaluation. In this paper, we propose a fast, yet accurate, L2 cache reuse distance histogram model without time-consuming full simulations, which can be utilized to evaluate the L2 cache miss rate with the Random and LRU replacement policies. The inputs of our model only need to be profiled once and can be reused for evaluations of different L2 cache configurations. To evaluate our model, we compare the output L2 RDH from our model and that of gem5 cycle-accurate simulations. When used to evaluate the L2 cache miss rates, the average absolute error is 3% for SPEC2006 benchmarks.