细胞处理器上的光线追踪

2006 IEEE Symposium on Interactive Ray Tracing Pub Date : 2006-09-01 DOI:10.1109/RT.2006.280210

Carsten Benthin, I. Wald, M. Scherbaum, Heiko Friedrich

{"title":"细胞处理器上的光线追踪","authors":"Carsten Benthin, I. Wald, M. Scherbaum, Heiko Friedrich","doi":"10.1109/RT.2006.280210","DOIUrl":null,"url":null,"abstract":"Over the last three decades, higher CPU performance has been achieved almost exclusively by raising the CPU's clock rate. Today, the resulting power consumption and heat dissipation threaten to end this trend, and CPU designers are looking for alternative ways of providing more compute power. In particular, they are looking towards three concepts: a streaming compute model, vector-like SIMD units, and multi-core architectures. One particular example of such an architecture is the cell broadband engine architecture (CBEA), a multi-core processor that offers a raw compute power of up to 200 GFlops per 3.2 GHz chip. The cell bears a huge potential for compute-intensive applications like ray tracing, but also requires addressing the challenges caused by this processor's unconventional architecture. In this paper, we describe an implementation of realtime ray tracing on a cell. Using a combination of low-level optimized kernel routines, a streaming software architecture, explicit caching, and a virtual software-hyperthreading approach to hide DMA latencies, we achieve for a single cell a pure ray tracing performance of nearly one order of magnitude over that achieved by a commodity CPU","PeriodicalId":158017,"journal":{"name":"2006 IEEE Symposium on Interactive Ray Tracing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"137","resultStr":"{\"title\":\"Ray Tracing on the Cell Processor\",\"authors\":\"Carsten Benthin, I. Wald, M. Scherbaum, Heiko Friedrich\",\"doi\":\"10.1109/RT.2006.280210\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the last three decades, higher CPU performance has been achieved almost exclusively by raising the CPU's clock rate. Today, the resulting power consumption and heat dissipation threaten to end this trend, and CPU designers are looking for alternative ways of providing more compute power. In particular, they are looking towards three concepts: a streaming compute model, vector-like SIMD units, and multi-core architectures. One particular example of such an architecture is the cell broadband engine architecture (CBEA), a multi-core processor that offers a raw compute power of up to 200 GFlops per 3.2 GHz chip. The cell bears a huge potential for compute-intensive applications like ray tracing, but also requires addressing the challenges caused by this processor's unconventional architecture. In this paper, we describe an implementation of realtime ray tracing on a cell. Using a combination of low-level optimized kernel routines, a streaming software architecture, explicit caching, and a virtual software-hyperthreading approach to hide DMA latencies, we achieve for a single cell a pure ray tracing performance of nearly one order of magnitude over that achieved by a commodity CPU\",\"PeriodicalId\":158017,\"journal\":{\"name\":\"2006 IEEE Symposium on Interactive Ray Tracing\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"137\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE Symposium on Interactive Ray Tracing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RT.2006.280210\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE Symposium on Interactive Ray Tracing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RT.2006.280210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 137

摘要

在过去的三十年里，提高CPU性能几乎完全是通过提高CPU的时钟速率来实现的。今天，由此产生的功耗和散热威胁到这一趋势的终结，CPU设计师正在寻找提供更多计算能力的替代方法。他们特别关注三个概念:流计算模型、类似矢量的SIMD单元和多核架构。这种架构的一个特殊示例是小区宽带引擎架构(CBEA)，它是一个多核处理器，每个3.2 GHz芯片提供高达200 GFlops的原始计算能力。该电池在光线追踪等计算密集型应用方面具有巨大潜力，但也需要解决该处理器非常规架构带来的挑战。在本文中，我们描述了一个实时光线追踪在一个细胞上的实现。使用低级优化的内核例程、流软件架构、显式缓存和虚拟软件超线程方法的组合来隐藏DMA延迟，我们为单个单元实现了纯光线跟踪性能，其性能比普通CPU实现的性能高出近一个数量级

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Ray Tracing on the Cell Processor

Over the last three decades, higher CPU performance has been achieved almost exclusively by raising the CPU's clock rate. Today, the resulting power consumption and heat dissipation threaten to end this trend, and CPU designers are looking for alternative ways of providing more compute power. In particular, they are looking towards three concepts: a streaming compute model, vector-like SIMD units, and multi-core architectures. One particular example of such an architecture is the cell broadband engine architecture (CBEA), a multi-core processor that offers a raw compute power of up to 200 GFlops per 3.2 GHz chip. The cell bears a huge potential for compute-intensive applications like ray tracing, but also requires addressing the challenges caused by this processor's unconventional architecture. In this paper, we describe an implementation of realtime ray tracing on a cell. Using a combination of low-level optimized kernel routines, a streaming software architecture, explicit caching, and a virtual software-hyperthreading approach to hide DMA latencies, we achieve for a single cell a pure ray tracing performance of nearly one order of magnitude over that achieved by a commodity CPU

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2006 IEEE Symposium on Interactive Ray Tracing

自引率

0.00%

发文量