{"title":"Analysis and Characterization of Intel Itanium Instruction Bundles for Improving VLIW Processor Performance","authors":"Jiangjiang Liu, Brian Bell, T. Truong","doi":"10.1109/IMSCCS.2006.37","DOIUrl":null,"url":null,"abstract":"In order to achieve high instruction level parallelism (ILP), designers are turning to very long instruction word (VLIW) based designs, in which different types of instructions are grouped together as bundles of 128 bits or longer. In VLIW, the added nops increase the code size, limit processor performance by the under-utilization of functional units. In examining these performance issues of VLIW systems, we consider Intel first 64-bit architecture, the IA-64, and its first implementation, the Itanium, which employs Intel version of VLIW. We present a comprehensive analysis of the problem of under-utilization due to nops and stops across a wide range of application domains through the use of three different benchmark suites: SPEC CPU 2000, MediaBench, and PacketBench. Our results show that, on average, nops create an under-utilization factor of 28.46% in the case of SPEC CPU, 32.27% in MediaBench, and 29.76% in PacketBench. We also analyze the characteristics of different instruction bundle formats, which we obtain by collecting statistics concerning the frequency of the bundle formats","PeriodicalId":202629,"journal":{"name":"First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'06)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMSCCS.2006.37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In order to achieve high instruction level parallelism (ILP), designers are turning to very long instruction word (VLIW) based designs, in which different types of instructions are grouped together as bundles of 128 bits or longer. In VLIW, the added nops increase the code size, limit processor performance by the under-utilization of functional units. In examining these performance issues of VLIW systems, we consider Intel first 64-bit architecture, the IA-64, and its first implementation, the Itanium, which employs Intel version of VLIW. We present a comprehensive analysis of the problem of under-utilization due to nops and stops across a wide range of application domains through the use of three different benchmark suites: SPEC CPU 2000, MediaBench, and PacketBench. Our results show that, on average, nops create an under-utilization factor of 28.46% in the case of SPEC CPU, 32.27% in MediaBench, and 29.76% in PacketBench. We also analyze the characteristics of different instruction bundle formats, which we obtain by collecting statistics concerning the frequency of the bundle formats
为了实现高指令级并行性(ILP),设计人员正在转向基于超长指令字(VLIW)的设计,在这种设计中,不同类型的指令被分组为128位或更长的指令束。在VLIW中,添加的nops增加了代码大小,由于功能单元利用率不足而限制了处理器性能。在研究VLIW系统的这些性能问题时,我们考虑了英特尔的第一个64位体系结构IA-64,以及它的第一个实现Itanium,它采用了英特尔版本的VLIW。我们通过使用三种不同的基准测试套件(SPEC CPU 2000、mediabbench和packketbench),全面分析了在广泛的应用程序领域中由于中断和停止而导致的利用率不足问题。我们的结果表明,平均而言,在SPEC CPU的情况下,nops产生的利用率不足系数为28.46%,在mediabbench中为32.27%,在packketbench中为29.76%。我们还分析了不同指令束格式的特点,我们通过收集有关指令束格式频率的统计数据来获得这些特征