Data Augmentation Based on Substituting Regional MRIs Volume Scores.

Tuo Leng, Qingyu Zhao, Chao Yang, Zhufu Lu, Ehsan Adeli, Kilian M Pohl
{"title":"Data Augmentation Based on Substituting Regional MRIs Volume Scores.","authors":"Tuo Leng, Qingyu Zhao, Chao Yang, Zhufu Lu, Ehsan Adeli, Kilian M Pohl","doi":"10.1007/978-3-030-33642-4_4","DOIUrl":null,"url":null,"abstract":"<p><p>Due to difficulties in collecting sufficient training data, recent advances in neural-network-based methods have not been fully explored in the analysis of brain Magnetic Resonance Imaging (MRI). A possible solution to the limited-data issue is to augment the training set with synthetically generated data. In this paper, we propose a data augmentation strategy based on <i>regional feature substitution</i>. We demonstrate the advantages of this strategy with respect to training a simple neural-network-based classifier in predicting when individual youth transition from no-to-low to medium-to-heavy alcohol drinkers solely based on their volumetric MRI measurements. Based on 20-fold cross-validation, we generate more than one million synthetic samples from less than 500 subjects for each training run. The classifier achieves an accuracy of 74.1% in correctly distinguishing non-drinkers from drinkers at baseline and a 43.2% weighted accuracy in predicting the transition over a three year period (5-group classification task). Both accuracy scores are significantly better than training the classifier on the original dataset.</p>","PeriodicalId":93124,"journal":{"name":"Large-Scale Annotation of Biomedical Data and Expert Label Synthesis and Hardware Aware Learning for Medical Imaging and Computer Assisted Intervention : International Workshops, LABELS 2019, HAL-MICCAI 2019, and CuRIOUS 2019, held in c...","volume":"11851 ","pages":"32-41"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7486010/pdf/nihms-1623450.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Large-Scale Annotation of Biomedical Data and Expert Label Synthesis and Hardware Aware Learning for Medical Imaging and Computer Assisted Intervention : International Workshops, LABELS 2019, HAL-MICCAI 2019, and CuRIOUS 2019, held in c...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-030-33642-4_4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2019/10/24 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Due to difficulties in collecting sufficient training data, recent advances in neural-network-based methods have not been fully explored in the analysis of brain Magnetic Resonance Imaging (MRI). A possible solution to the limited-data issue is to augment the training set with synthetically generated data. In this paper, we propose a data augmentation strategy based on regional feature substitution. We demonstrate the advantages of this strategy with respect to training a simple neural-network-based classifier in predicting when individual youth transition from no-to-low to medium-to-heavy alcohol drinkers solely based on their volumetric MRI measurements. Based on 20-fold cross-validation, we generate more than one million synthetic samples from less than 500 subjects for each training run. The classifier achieves an accuracy of 74.1% in correctly distinguishing non-drinkers from drinkers at baseline and a 43.2% weighted accuracy in predicting the transition over a three year period (5-group classification task). Both accuracy scores are significantly better than training the classifier on the original dataset.

基于区域磁共振成像体积分数替代的数据扩增
由于难以收集到足够的训练数据,基于神经网络的方法在脑部磁共振成像(MRI)分析方面的最新进展尚未得到充分探索。解决数据有限问题的一个可行办法是用合成数据来增加训练集。在本文中,我们提出了一种基于区域特征替换的数据增强策略。我们展示了这一策略在训练基于神经网络的简单分类器方面的优势,该分类器可以仅根据青少年的核磁共振成像容积测量结果预测他们何时从不善饮酒者过渡到中重度饮酒者。在 20 倍交叉验证的基础上,我们在每次训练运行中从不到 500 名受试者中生成了超过一百万个合成样本。分类器在正确区分基线非饮酒者和饮酒者方面的准确率为 74.1%,在预测三年内的转变方面的加权准确率为 43.2%(5 组分类任务)。这两个准确率都明显高于在原始数据集上训练分类器的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信