{"title":"多线程系统下i结构软件缓存的性能分析","authors":"Wen-Yen Lin, J. Gaudiot, J. Amaral, G. Gao","doi":"10.1109/PCCC.2000.830305","DOIUrl":null,"url":null,"abstract":"Non-blocking multithreaded execution models have been proposed as an effective means to overlap computation and communication in distributed memory systems without any hardware support. Even with the capability of latency tolerance in these execution models, each remote memory request still incurs the cost of communication interface overhead. We therefore designed and implemented our I-structure software cache system to further reduce communication overhead for non-blocking multithreaded execution. In this paper, we present analytical models for the performances of a multithreading system with and without I-structure software cache support. We compare our model's prediction with our experimental results on an existing multithreaded architecture platform. The analytical models allow us to predict at what ratio of communication latency/processing speed the implementation of I-structure software cache becomes profitable for applications with different characteristics.","PeriodicalId":387201,"journal":{"name":"Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No.00CH37086)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Performance analysis of the I-structure software cache on multi-threading systems\",\"authors\":\"Wen-Yen Lin, J. Gaudiot, J. Amaral, G. Gao\",\"doi\":\"10.1109/PCCC.2000.830305\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Non-blocking multithreaded execution models have been proposed as an effective means to overlap computation and communication in distributed memory systems without any hardware support. Even with the capability of latency tolerance in these execution models, each remote memory request still incurs the cost of communication interface overhead. We therefore designed and implemented our I-structure software cache system to further reduce communication overhead for non-blocking multithreaded execution. In this paper, we present analytical models for the performances of a multithreading system with and without I-structure software cache support. We compare our model's prediction with our experimental results on an existing multithreaded architecture platform. The analytical models allow us to predict at what ratio of communication latency/processing speed the implementation of I-structure software cache becomes profitable for applications with different characteristics.\",\"PeriodicalId\":387201,\"journal\":{\"name\":\"Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No.00CH37086)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-02-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No.00CH37086)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PCCC.2000.830305\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No.00CH37086)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCCC.2000.830305","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance analysis of the I-structure software cache on multi-threading systems
Non-blocking multithreaded execution models have been proposed as an effective means to overlap computation and communication in distributed memory systems without any hardware support. Even with the capability of latency tolerance in these execution models, each remote memory request still incurs the cost of communication interface overhead. We therefore designed and implemented our I-structure software cache system to further reduce communication overhead for non-blocking multithreaded execution. In this paper, we present analytical models for the performances of a multithreading system with and without I-structure software cache support. We compare our model's prediction with our experimental results on an existing multithreaded architecture platform. The analytical models allow us to predict at what ratio of communication latency/processing speed the implementation of I-structure software cache becomes profitable for applications with different characteristics.