ImageNomer:描述功能连接和组学分析工具以及识别种族混淆的案例研究

Q4 Neuroscience
Anton Orlichenko , Grant Daly , Ziyu Zhou , Anqi Liu , Hui Shen , Hong-Wen Deng , Yu-Ping Wang
{"title":"ImageNomer:描述功能连接和组学分析工具以及识别种族混淆的案例研究","authors":"Anton Orlichenko ,&nbsp;Grant Daly ,&nbsp;Ziyu Zhou ,&nbsp;Anqi Liu ,&nbsp;Hui Shen ,&nbsp;Hong-Wen Deng ,&nbsp;Yu-Ping Wang","doi":"10.1016/j.ynirp.2023.100191","DOIUrl":null,"url":null,"abstract":"<div><p>Most packages for the analysis of fMRI-based functional connectivity (FC) and genomic data are used with a programming language interface, lacking an easy-to-navigate GUI frontend. This exacerbates two problems found in these types of data: demographic confounds and quality control in the face of high dimensionality of features. The reason is that it is too slow and cumbersome to use a programming interface to create all the necessary visualizations required to identify all correlations, confounding effects, or quality control problems in a dataset. FC in particular usually contains tens of thousands of features per subject, and can only be summarized and efficiently explored using visualizations. To remedy this situation, we have developed ImageNomer, a data visualization and analysis tool that allows inspection of both subject-level and cohort-level demographic, genomic, and imaging features. The software is Python-based, runs in a self-contained Docker image, and contains a browser-based GUI frontend. We demonstrate the usefulness of ImageNomer by identifying an unexpected race confound when predicting achievement scores in the Philadelphia Neurodevelopmental Cohort (PNC) dataset, which contains multitask fMRI and single nucleotide polymorphism (SNP) data of healthy adolescents. In the past, many studies have attempted to use FC to identify achievement-related features in fMRI. Using ImageNomer to visualize trends in achievement scores between races, we find a clear potential for confounding effects if race can be predicted using FC. Using correlation analysis in the ImageNomer software, we show that FCs correlated with Wide Range Achievement Test (WRAT) score are in fact more highly correlated with race. Investigating further, we find that whereas both FC and SNP (genomic) features can account for 10–15% of WRAT score variation, this predictive ability disappears when controlling for race. We also use ImageNomer to investigate race-FC correlation in the Bipolar and Schizophrenia Network for Intermediate Phenotypes (BSNIP) dataset. In this work, we demonstrate the advantage of our ImageNomer GUI tool in data exploration and confound detection. Additionally, this work identifies race as a strong confound in FC data and casts doubt on the possibility of finding unbiased achievement-related features in fMRI and SNP data of healthy adolescents.</p></div>","PeriodicalId":74277,"journal":{"name":"Neuroimage. Reports","volume":"3 4","pages":"Article 100191"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666956023000363/pdfft?md5=63d2f8e7b388542a1c1f1741653bcaa3&pid=1-s2.0-S2666956023000363-main.pdf","citationCount":"0","resultStr":"{\"title\":\"ImageNomer: Description of a functional connectivity and omics analysis tool and case study identifying a race confound\",\"authors\":\"Anton Orlichenko ,&nbsp;Grant Daly ,&nbsp;Ziyu Zhou ,&nbsp;Anqi Liu ,&nbsp;Hui Shen ,&nbsp;Hong-Wen Deng ,&nbsp;Yu-Ping Wang\",\"doi\":\"10.1016/j.ynirp.2023.100191\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Most packages for the analysis of fMRI-based functional connectivity (FC) and genomic data are used with a programming language interface, lacking an easy-to-navigate GUI frontend. This exacerbates two problems found in these types of data: demographic confounds and quality control in the face of high dimensionality of features. The reason is that it is too slow and cumbersome to use a programming interface to create all the necessary visualizations required to identify all correlations, confounding effects, or quality control problems in a dataset. FC in particular usually contains tens of thousands of features per subject, and can only be summarized and efficiently explored using visualizations. To remedy this situation, we have developed ImageNomer, a data visualization and analysis tool that allows inspection of both subject-level and cohort-level demographic, genomic, and imaging features. The software is Python-based, runs in a self-contained Docker image, and contains a browser-based GUI frontend. We demonstrate the usefulness of ImageNomer by identifying an unexpected race confound when predicting achievement scores in the Philadelphia Neurodevelopmental Cohort (PNC) dataset, which contains multitask fMRI and single nucleotide polymorphism (SNP) data of healthy adolescents. In the past, many studies have attempted to use FC to identify achievement-related features in fMRI. Using ImageNomer to visualize trends in achievement scores between races, we find a clear potential for confounding effects if race can be predicted using FC. Using correlation analysis in the ImageNomer software, we show that FCs correlated with Wide Range Achievement Test (WRAT) score are in fact more highly correlated with race. Investigating further, we find that whereas both FC and SNP (genomic) features can account for 10–15% of WRAT score variation, this predictive ability disappears when controlling for race. We also use ImageNomer to investigate race-FC correlation in the Bipolar and Schizophrenia Network for Intermediate Phenotypes (BSNIP) dataset. In this work, we demonstrate the advantage of our ImageNomer GUI tool in data exploration and confound detection. Additionally, this work identifies race as a strong confound in FC data and casts doubt on the possibility of finding unbiased achievement-related features in fMRI and SNP data of healthy adolescents.</p></div>\",\"PeriodicalId\":74277,\"journal\":{\"name\":\"Neuroimage. Reports\",\"volume\":\"3 4\",\"pages\":\"Article 100191\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666956023000363/pdfft?md5=63d2f8e7b388542a1c1f1741653bcaa3&pid=1-s2.0-S2666956023000363-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neuroimage. Reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666956023000363\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Neuroscience\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroimage. Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666956023000363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Neuroscience","Score":null,"Total":0}
引用次数: 0

摘要

大多数用于分析基于fmri的功能连接(FC)和基因组数据的软件包都使用编程语言界面,缺乏易于导航的GUI前端。这加剧了在这类数据中发现的两个问题:人口统计混淆和面对高维特征的质量控制。原因是,使用编程接口创建识别数据集中的所有相关性、混淆效应或质量控制问题所需的所有必要的可视化,速度太慢,也太麻烦。特别是FC,每个主题通常包含数万个特征,只能使用可视化来总结和有效地探索。为了纠正这种情况,我们开发了ImageNomer,这是一种数据可视化和分析工具,可以检查受试者水平和队列水平的人口统计学、基因组学和成像特征。该软件基于python,运行在一个独立的Docker镜像中,并包含一个基于浏览器的GUI前端。我们通过在费城神经发育队列(PNC)数据集(包含健康青少年的多任务功能磁共振成像和单核苷酸多态性(SNP)数据集)中预测成绩分数时识别意想不到的种族混淆,证明了ImageNomer的实用性。在过去,许多研究试图使用FC来识别fMRI中与成就相关的特征。使用ImageNomer可视化不同种族之间成就分数的趋势,我们发现如果可以使用FC来预测种族,就有可能产生混淆效应。通过ImageNomer软件的相关分析,我们发现与WRAT成绩相关的FCs实际上与种族的相关性更高。进一步调查发现,尽管FC和SNP(基因组)特征都可以解释WRAT评分变化的10-15%,但当控制种族时,这种预测能力就消失了。我们还使用ImageNomer来研究双相情感障碍和精神分裂症中间表型网络(BSNIP)数据集中的种族- fc相关性。在这项工作中,我们展示了ImageNomer GUI工具在数据探索和混淆检测方面的优势。此外,这项工作确定种族是FC数据中的一个严重混淆因素,并对在健康青少年的fMRI和SNP数据中发现公正的成就相关特征的可能性表示怀疑。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

ImageNomer: Description of a functional connectivity and omics analysis tool and case study identifying a race confound

ImageNomer: Description of a functional connectivity and omics analysis tool and case study identifying a race confound

Most packages for the analysis of fMRI-based functional connectivity (FC) and genomic data are used with a programming language interface, lacking an easy-to-navigate GUI frontend. This exacerbates two problems found in these types of data: demographic confounds and quality control in the face of high dimensionality of features. The reason is that it is too slow and cumbersome to use a programming interface to create all the necessary visualizations required to identify all correlations, confounding effects, or quality control problems in a dataset. FC in particular usually contains tens of thousands of features per subject, and can only be summarized and efficiently explored using visualizations. To remedy this situation, we have developed ImageNomer, a data visualization and analysis tool that allows inspection of both subject-level and cohort-level demographic, genomic, and imaging features. The software is Python-based, runs in a self-contained Docker image, and contains a browser-based GUI frontend. We demonstrate the usefulness of ImageNomer by identifying an unexpected race confound when predicting achievement scores in the Philadelphia Neurodevelopmental Cohort (PNC) dataset, which contains multitask fMRI and single nucleotide polymorphism (SNP) data of healthy adolescents. In the past, many studies have attempted to use FC to identify achievement-related features in fMRI. Using ImageNomer to visualize trends in achievement scores between races, we find a clear potential for confounding effects if race can be predicted using FC. Using correlation analysis in the ImageNomer software, we show that FCs correlated with Wide Range Achievement Test (WRAT) score are in fact more highly correlated with race. Investigating further, we find that whereas both FC and SNP (genomic) features can account for 10–15% of WRAT score variation, this predictive ability disappears when controlling for race. We also use ImageNomer to investigate race-FC correlation in the Bipolar and Schizophrenia Network for Intermediate Phenotypes (BSNIP) dataset. In this work, we demonstrate the advantage of our ImageNomer GUI tool in data exploration and confound detection. Additionally, this work identifies race as a strong confound in FC data and casts doubt on the possibility of finding unbiased achievement-related features in fMRI and SNP data of healthy adolescents.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Neuroimage. Reports
Neuroimage. Reports Neuroscience (General)
CiteScore
1.90
自引率
0.00%
发文量
0
审稿时长
87 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信