{"title":"Optimizing applications for performance on the pentium 4 architecture","authors":"A. Mehis, R. Radhakrishnan","doi":"10.1109/WWC.2002.1226494","DOIUrl":null,"url":null,"abstract":"In this paper we characterize the performance impact of using advanced compiler optimizations on the Intel Pentium 4 (P4) processor. Using the Intel C++/FORTRAN compilers we show that on a variety of benchmarks, advanced compiler optimizations are required to improve performance on the P4 processor. For applications developed using advanced optimizations targeting the earlier PentiumPro through Pentium III architectures, recompilation is likely required to obtain and/or maximize performance improvements on the P4. The performance enhancing design features of the P4 although dynamic in nature, require that applications be recompiled using P4 architecture aware compilers to obtain performance improvements.","PeriodicalId":320576,"journal":{"name":"2002 IEEE International Workshop on Workload Characterization","volume":"140 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 IEEE International Workshop on Workload Characterization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WWC.2002.1226494","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper we characterize the performance impact of using advanced compiler optimizations on the Intel Pentium 4 (P4) processor. Using the Intel C++/FORTRAN compilers we show that on a variety of benchmarks, advanced compiler optimizations are required to improve performance on the P4 processor. For applications developed using advanced optimizations targeting the earlier PentiumPro through Pentium III architectures, recompilation is likely required to obtain and/or maximize performance improvements on the P4. The performance enhancing design features of the P4 although dynamic in nature, require that applications be recompiled using P4 architecture aware compilers to obtain performance improvements.