{"title":"Spherical Harmonic Transforms and Convolutions on the GPU","authors":"A. Brunton, J. Lang, E. Dubois","doi":"10.1080/2151237X.2010.10390649","DOIUrl":null,"url":null,"abstract":"Abstract We present implementations of the spherical harmonic forward and inverse transforms on the GPU using CUDA. We implement two algorithms for the SH transform: the direct method and the semi-naive. Our direct method has low storage requirements due to our on-the-fly computation of the associated Legendre functions, and it can perform large transform sizes and non-power-of-two sizes. Our semi-naive implementation is faster than state-of-the-art CPU implementations by a factor of between five and six, depending on the transform size. We target our implementations at spherical panoramic image processing where a large number of basis functions are required. We apply our tool to decompose panoramic images into an overcomplete spherical wavelet model for spherical convolution. We present timings, errors, and application examples of our implementations.","PeriodicalId":354935,"journal":{"name":"Journal of Graphics, GPU, and Game Tools","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Graphics, GPU, and Game Tools","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/2151237X.2010.10390649","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Abstract We present implementations of the spherical harmonic forward and inverse transforms on the GPU using CUDA. We implement two algorithms for the SH transform: the direct method and the semi-naive. Our direct method has low storage requirements due to our on-the-fly computation of the associated Legendre functions, and it can perform large transform sizes and non-power-of-two sizes. Our semi-naive implementation is faster than state-of-the-art CPU implementations by a factor of between five and six, depending on the transform size. We target our implementations at spherical panoramic image processing where a large number of basis functions are required. We apply our tool to decompose panoramic images into an overcomplete spherical wavelet model for spherical convolution. We present timings, errors, and application examples of our implementations.