{"title":"复杂的流指令:介绍和初步评估","authors":"S. Vassiliadis, B. Juurlink, E. Hakkennes","doi":"10.1109/EURMIC.2000.874659","DOIUrl":null,"url":null,"abstract":"An architectural paradigm intended to improve the performance of streaming operations is introduced. The proposed complex streamed instructions perform setting, controlling and executing vector operations simultaneously. That is, each of the instructions has the capability of sectioning a vector in addition to performing complex memory accesses and SIMD execution. To provide an initial validation, a video compression application is considered. We identify 5 streamed operations and perform simulations using the SimpleScalar toolset. The experiments show the following: the cycle count diminishes by a factor of 3.1, the number of executed instructions reduces by a factor of 3.0 to 3.2, and the CPI stays about the same. These results show that substantial performance improvements can be expected when complex streamed instructions are employed.","PeriodicalId":138250,"journal":{"name":"Proceedings of the 26th Euromicro Conference. EUROMICRO 2000. Informatics: Inventing the Future","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Complex streamed instructions: introduction and initial evaluation\",\"authors\":\"S. Vassiliadis, B. Juurlink, E. Hakkennes\",\"doi\":\"10.1109/EURMIC.2000.874659\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An architectural paradigm intended to improve the performance of streaming operations is introduced. The proposed complex streamed instructions perform setting, controlling and executing vector operations simultaneously. That is, each of the instructions has the capability of sectioning a vector in addition to performing complex memory accesses and SIMD execution. To provide an initial validation, a video compression application is considered. We identify 5 streamed operations and perform simulations using the SimpleScalar toolset. The experiments show the following: the cycle count diminishes by a factor of 3.1, the number of executed instructions reduces by a factor of 3.0 to 3.2, and the CPI stays about the same. These results show that substantial performance improvements can be expected when complex streamed instructions are employed.\",\"PeriodicalId\":138250,\"journal\":{\"name\":\"Proceedings of the 26th Euromicro Conference. EUROMICRO 2000. Informatics: Inventing the Future\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 26th Euromicro Conference. EUROMICRO 2000. Informatics: Inventing the Future\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EURMIC.2000.874659\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 26th Euromicro Conference. EUROMICRO 2000. Informatics: Inventing the Future","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EURMIC.2000.874659","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Complex streamed instructions: introduction and initial evaluation
An architectural paradigm intended to improve the performance of streaming operations is introduced. The proposed complex streamed instructions perform setting, controlling and executing vector operations simultaneously. That is, each of the instructions has the capability of sectioning a vector in addition to performing complex memory accesses and SIMD execution. To provide an initial validation, a video compression application is considered. We identify 5 streamed operations and perform simulations using the SimpleScalar toolset. The experiments show the following: the cycle count diminishes by a factor of 3.1, the number of executed instructions reduces by a factor of 3.0 to 3.2, and the CPI stays about the same. These results show that substantial performance improvements can be expected when complex streamed instructions are employed.