{"title":"核磁共振成像预处理性能瓶颈分析","authors":"Mathieu Dugré, Yohan Chatelain, Tristan Glatard","doi":"arxiv-2405.17650","DOIUrl":null,"url":null,"abstract":"Magnetic Resonance Image (MRI) pre-processing is a critical step for\nneuroimaging analysis. However, the computational cost of MRI pre-processing\npipelines is a major bottleneck for large cohort studies and some clinical\napplications. While High-Performance Computing (HPC) and, more recently, Deep\nLearning have been adopted to accelerate the computations, these techniques\nrequire costly hardware and are not accessible to all researchers. Therefore,\nit is important to understand the performance bottlenecks of MRI pre-processing\npipelines to improve their performance. Using Intel VTune profiler, we\ncharacterized the bottlenecks of several commonly used MRI-preprocessing\npipelines from the ANTs, FSL, and FreeSurfer toolboxes. We found that few\nfunctions contributed to most of the CPU time, and that linear interpolation\nwas the largest contributor. Data access was also a substantial bottleneck. We\nidentified a bug in the ITK library that impacts the performance of ANTs\npipeline in single-precision and a potential issue with the OpenMP scaling in\nFreeSurfer recon-all. Our results provide a reference for future efforts to\noptimize MRI pre-processing pipelines.","PeriodicalId":501291,"journal":{"name":"arXiv - CS - Performance","volume":"129 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Analysis of Performance Bottlenecks in MRI Pre-Processing\",\"authors\":\"Mathieu Dugré, Yohan Chatelain, Tristan Glatard\",\"doi\":\"arxiv-2405.17650\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Magnetic Resonance Image (MRI) pre-processing is a critical step for\\nneuroimaging analysis. However, the computational cost of MRI pre-processing\\npipelines is a major bottleneck for large cohort studies and some clinical\\napplications. While High-Performance Computing (HPC) and, more recently, Deep\\nLearning have been adopted to accelerate the computations, these techniques\\nrequire costly hardware and are not accessible to all researchers. Therefore,\\nit is important to understand the performance bottlenecks of MRI pre-processing\\npipelines to improve their performance. Using Intel VTune profiler, we\\ncharacterized the bottlenecks of several commonly used MRI-preprocessing\\npipelines from the ANTs, FSL, and FreeSurfer toolboxes. We found that few\\nfunctions contributed to most of the CPU time, and that linear interpolation\\nwas the largest contributor. Data access was also a substantial bottleneck. We\\nidentified a bug in the ITK library that impacts the performance of ANTs\\npipeline in single-precision and a potential issue with the OpenMP scaling in\\nFreeSurfer recon-all. Our results provide a reference for future efforts to\\noptimize MRI pre-processing pipelines.\",\"PeriodicalId\":501291,\"journal\":{\"name\":\"arXiv - CS - Performance\",\"volume\":\"129 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Performance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.17650\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Performance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.17650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Analysis of Performance Bottlenecks in MRI Pre-Processing
Magnetic Resonance Image (MRI) pre-processing is a critical step for
neuroimaging analysis. However, the computational cost of MRI pre-processing
pipelines is a major bottleneck for large cohort studies and some clinical
applications. While High-Performance Computing (HPC) and, more recently, Deep
Learning have been adopted to accelerate the computations, these techniques
require costly hardware and are not accessible to all researchers. Therefore,
it is important to understand the performance bottlenecks of MRI pre-processing
pipelines to improve their performance. Using Intel VTune profiler, we
characterized the bottlenecks of several commonly used MRI-preprocessing
pipelines from the ANTs, FSL, and FreeSurfer toolboxes. We found that few
functions contributed to most of the CPU time, and that linear interpolation
was the largest contributor. Data access was also a substantial bottleneck. We
identified a bug in the ITK library that impacts the performance of ANTs
pipeline in single-precision and a potential issue with the OpenMP scaling in
FreeSurfer recon-all. Our results provide a reference for future efforts to
optimize MRI pre-processing pipelines.