Yongliang Wang, Naijie Gu, Junjie Su, Dongsheng Qi, Zhuorui Ning
{"title":"基于野外存取时空模型的数据布局优化","authors":"Yongliang Wang, Naijie Gu, Junjie Su, Dongsheng Qi, Zhuorui Ning","doi":"10.1109/AEMCSE55572.2022.00055","DOIUrl":null,"url":null,"abstract":"Memory access latency is one of the performance bottlenecks of most programs. Improving cache utilization is a common way to improve memory performance. Data layout optimization can improve cache performance based on the locality principle of the memory hierarchy. By analyzing the timestamp information and spatial information of memory access, a field access spatio-temporal model named FASTM was constructed to optimize the data layout of the structure. FASTM consists of three parts: hot data analysis, relative memory access count model and memory access behavior similarity model. A heuristic algorithm based on FASTM is proposed to design the split optimization scheme of the structure. Experimental results on eight benchmarks from SPEC and Olden show that FASTM can reduce cache misses by 57.85% and Translation Lookaside Buffer (TLB) misses by 74.70% on average. The average speedup of program running time is up to 1.37x.","PeriodicalId":309096,"journal":{"name":"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data layout optimization based on the spatio-temporal model of field access\",\"authors\":\"Yongliang Wang, Naijie Gu, Junjie Su, Dongsheng Qi, Zhuorui Ning\",\"doi\":\"10.1109/AEMCSE55572.2022.00055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Memory access latency is one of the performance bottlenecks of most programs. Improving cache utilization is a common way to improve memory performance. Data layout optimization can improve cache performance based on the locality principle of the memory hierarchy. By analyzing the timestamp information and spatial information of memory access, a field access spatio-temporal model named FASTM was constructed to optimize the data layout of the structure. FASTM consists of three parts: hot data analysis, relative memory access count model and memory access behavior similarity model. A heuristic algorithm based on FASTM is proposed to design the split optimization scheme of the structure. Experimental results on eight benchmarks from SPEC and Olden show that FASTM can reduce cache misses by 57.85% and Translation Lookaside Buffer (TLB) misses by 74.70% on average. The average speedup of program running time is up to 1.37x.\",\"PeriodicalId\":309096,\"journal\":{\"name\":\"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AEMCSE55572.2022.00055\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEMCSE55572.2022.00055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data layout optimization based on the spatio-temporal model of field access
Memory access latency is one of the performance bottlenecks of most programs. Improving cache utilization is a common way to improve memory performance. Data layout optimization can improve cache performance based on the locality principle of the memory hierarchy. By analyzing the timestamp information and spatial information of memory access, a field access spatio-temporal model named FASTM was constructed to optimize the data layout of the structure. FASTM consists of three parts: hot data analysis, relative memory access count model and memory access behavior similarity model. A heuristic algorithm based on FASTM is proposed to design the split optimization scheme of the structure. Experimental results on eight benchmarks from SPEC and Olden show that FASTM can reduce cache misses by 57.85% and Translation Lookaside Buffer (TLB) misses by 74.70% on average. The average speedup of program running time is up to 1.37x.