Trace: a high-throughput tomographic reconstruction engine for large-scale datasets

IF 3.56 Q1 Medicine
Tekin Bicer, Doğa Gürsoy, Vincent De Andrade, Rajkumar Kettimuthu, William Scullin, Francesco De Carlo, Ian T. Foster
{"title":"Trace: a high-throughput tomographic reconstruction engine for large-scale datasets","authors":"Tekin Bicer,&nbsp;Doğa Gürsoy,&nbsp;Vincent De Andrade,&nbsp;Rajkumar Kettimuthu,&nbsp;William Scullin,&nbsp;Francesco De Carlo,&nbsp;Ian T. Foster","doi":"10.1186/s40679-017-0040-7","DOIUrl":null,"url":null,"abstract":"<p>Modern synchrotron light sources and detectors produce data at such scale and complexity that large-scale computation is required to unleash their full power. One of the widely used imaging techniques that generates data at tens of gigabytes per second is computed tomography (CT). Although CT experiments result in rapid data generation, the analysis and reconstruction of the collected data may require hours or even days of computation time with a medium-sized workstation, which hinders the scientific progress that relies on the results of analysis.</p><p>We present Trace, a data-intensive computing engine that we have developed to enable high-performance implementation of iterative tomographic reconstruction algorithms for parallel computers. Trace provides fine-grained reconstruction of tomography datasets using both (thread-level) shared memory and (process-level) distributed memory parallelization. Trace utilizes a special data structure called replicated reconstruction object to maximize application performance. We also present the optimizations that we apply to the replicated reconstruction objects and evaluate them using tomography datasets collected at the Advanced Photon Source.\n</p><p>Our experimental evaluations show that our optimizations and parallelization techniques can provide 158× speedup using 32 compute nodes (384 cores) over a single-core configuration and decrease the end-to-end processing time of a large sinogram (with 4501 × 1 × 22,400 dimensions) from 12.5 h to &lt;5 min per iteration.</p><p>The proposed tomographic reconstruction engine can efficiently process large-scale tomographic data using many compute nodes and minimize reconstruction times.</p>","PeriodicalId":460,"journal":{"name":"Advanced Structural and Chemical Imaging","volume":"3 1","pages":""},"PeriodicalIF":3.5600,"publicationDate":"2017-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40679-017-0040-7","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Structural and Chemical Imaging","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1186/s40679-017-0040-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 27

Abstract

Modern synchrotron light sources and detectors produce data at such scale and complexity that large-scale computation is required to unleash their full power. One of the widely used imaging techniques that generates data at tens of gigabytes per second is computed tomography (CT). Although CT experiments result in rapid data generation, the analysis and reconstruction of the collected data may require hours or even days of computation time with a medium-sized workstation, which hinders the scientific progress that relies on the results of analysis.

We present Trace, a data-intensive computing engine that we have developed to enable high-performance implementation of iterative tomographic reconstruction algorithms for parallel computers. Trace provides fine-grained reconstruction of tomography datasets using both (thread-level) shared memory and (process-level) distributed memory parallelization. Trace utilizes a special data structure called replicated reconstruction object to maximize application performance. We also present the optimizations that we apply to the replicated reconstruction objects and evaluate them using tomography datasets collected at the Advanced Photon Source.

Our experimental evaluations show that our optimizations and parallelization techniques can provide 158× speedup using 32 compute nodes (384 cores) over a single-core configuration and decrease the end-to-end processing time of a large sinogram (with 4501 × 1 × 22,400 dimensions) from 12.5 h to <5 min per iteration.

The proposed tomographic reconstruction engine can efficiently process large-scale tomographic data using many compute nodes and minimize reconstruction times.

Abstract Image

Trace:用于大规模数据集的高通量层析重建引擎
现代同步加速器光源和探测器产生的数据如此庞大和复杂,以至于需要大规模的计算来释放它们的全部功率。计算机断层扫描(CT)是一种广泛使用的成像技术,每秒产生数十千兆字节的数据。CT实验虽然可以快速生成数据,但在中型工作站中对采集到的数据进行分析和重建可能需要数小时甚至数天的计算时间,这阻碍了依赖于分析结果的科学进步。我们介绍了Trace,这是一个数据密集型计算引擎,我们开发了它,可以为并行计算机实现迭代层析重建算法的高性能实现。Trace使用(线程级)共享内存和(进程级)分布式内存并行化提供了层析成像数据集的细粒度重建。Trace利用一种称为复制重建对象的特殊数据结构来最大化应用程序性能。我们还介绍了我们应用于复制重建对象的优化,并使用先进光子源收集的断层扫描数据集对它们进行评估。我们的实验评估表明,与单核配置相比,我们的优化和并行化技术使用32个计算节点(384个内核)可以提供158倍的加速,并将大型sinogram (4501 × 1 × 22400维)的端到端处理时间从每次迭代12.5小时减少到5分钟。所提出的层析重建引擎可以利用多个计算节点高效地处理大规模层析数据,并最大限度地减少重建时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advanced Structural and Chemical Imaging
Advanced Structural and Chemical Imaging Medicine-Radiology, Nuclear Medicine and Imaging
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信