Qi Yu, Libo Huang, Cheng Qian, Jianqiao Ma, Zhiying Wang
{"title":"Evaluating Memory Performance of Emerging Scale-Out Applications Using C-AMAT","authors":"Qi Yu, Libo Huang, Cheng Qian, Jianqiao Ma, Zhiying Wang","doi":"10.1109/ISPASS.2018.00022","DOIUrl":null,"url":null,"abstract":"Modern dominant processor architectures that emphasize high single-thread performance and deep memory hierarchy mismatch the needs of emerging scale-out workloads. Understanding these inefficiencies, especially inefficiencies in memory systems is important for high efficient architecture design. In this paper, we evaluate the memory performance of emerging scale-out applications using a newly proposed memory metric, C-AMAT. We find that scale-out applications suffer more from current memory hierarchy compared to traditional server workloads and desktop benchmarks, especially in terms of memory access concurrency in L1 data cache and L2 cache. Suggestions are provided to improve their memory performance , including adopting designs like multi-port, multi-banked cache, setting appropriate number of MSHRs and applying advanced data prefetchers.","PeriodicalId":171552,"journal":{"name":"2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPASS.2018.00022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Modern dominant processor architectures that emphasize high single-thread performance and deep memory hierarchy mismatch the needs of emerging scale-out workloads. Understanding these inefficiencies, especially inefficiencies in memory systems is important for high efficient architecture design. In this paper, we evaluate the memory performance of emerging scale-out applications using a newly proposed memory metric, C-AMAT. We find that scale-out applications suffer more from current memory hierarchy compared to traditional server workloads and desktop benchmarks, especially in terms of memory access concurrency in L1 data cache and L2 cache. Suggestions are provided to improve their memory performance , including adopting designs like multi-port, multi-banked cache, setting appropriate number of MSHRs and applying advanced data prefetchers.