{"title":"H.264编码器在Cell处理器上并行执行的性能分析","authors":"Jonghan Park, S. Ha","doi":"10.1109/ESTMED.2007.4375797","DOIUrl":null,"url":null,"abstract":"Performance improvement by parallel execution depends on two factors: the potential parallelism of the application itself, and the optimal mapping of the application to the target architecture, which is usually very target specific. As a case study, we analyze the expected performance of parallel execution of an H.264 encoding algorithm, known as X264, on the cell processor. Considering the communication architecture of the Cell processor, we parallelize the algorithm at the macro-block level. From the performance analysis, we discover the overhead factors of parallel execution and estimate the expected performance. Comparison with simulation results proves the accuracy and the usefulness of the proposed analysis method.","PeriodicalId":428196,"journal":{"name":"2007 IEEE/ACM/IFIP Workshop on Embedded Systems for Real-Time Multimedia","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"Performance Analysis of Parallel Execution of H.264 Encoder on the Cell Processor\",\"authors\":\"Jonghan Park, S. Ha\",\"doi\":\"10.1109/ESTMED.2007.4375797\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Performance improvement by parallel execution depends on two factors: the potential parallelism of the application itself, and the optimal mapping of the application to the target architecture, which is usually very target specific. As a case study, we analyze the expected performance of parallel execution of an H.264 encoding algorithm, known as X264, on the cell processor. Considering the communication architecture of the Cell processor, we parallelize the algorithm at the macro-block level. From the performance analysis, we discover the overhead factors of parallel execution and estimate the expected performance. Comparison with simulation results proves the accuracy and the usefulness of the proposed analysis method.\",\"PeriodicalId\":428196,\"journal\":{\"name\":\"2007 IEEE/ACM/IFIP Workshop on Embedded Systems for Real-Time Multimedia\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 IEEE/ACM/IFIP Workshop on Embedded Systems for Real-Time Multimedia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ESTMED.2007.4375797\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE/ACM/IFIP Workshop on Embedded Systems for Real-Time Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESTMED.2007.4375797","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance Analysis of Parallel Execution of H.264 Encoder on the Cell Processor
Performance improvement by parallel execution depends on two factors: the potential parallelism of the application itself, and the optimal mapping of the application to the target architecture, which is usually very target specific. As a case study, we analyze the expected performance of parallel execution of an H.264 encoding algorithm, known as X264, on the cell processor. Considering the communication architecture of the Cell processor, we parallelize the algorithm at the macro-block level. From the performance analysis, we discover the overhead factors of parallel execution and estimate the expected performance. Comparison with simulation results proves the accuracy and the usefulness of the proposed analysis method.