Proteomic signatures predict preeclampsia in individual cohorts but not across cohorts - implications for clinical biomarker studies.

Mohammad S Ghaemi, Adi L Tarca, Roberto Romero, Natalie Stanley, Ramin Fallahzadeh, Athena Tanada, Anthony Culos, Kazuo Ando, Xiaoyuan Han, Yair J Blumenfeld, Maurice L Druzin, Yasser Y El-Sayed, Ronald S Gibbs, Virginia D Winn, Kevin Contrepois, Xuefeng B Ling, Ronald J Wong, Gary M Shaw, David K Stevenson, Brice Gaudilliere, Nima Aghaeepour, Martin S Angst
{"title":"Proteomic signatures predict preeclampsia in individual cohorts but not across cohorts - implications for clinical biomarker studies.","authors":"Mohammad S Ghaemi,&nbsp;Adi L Tarca,&nbsp;Roberto Romero,&nbsp;Natalie Stanley,&nbsp;Ramin Fallahzadeh,&nbsp;Athena Tanada,&nbsp;Anthony Culos,&nbsp;Kazuo Ando,&nbsp;Xiaoyuan Han,&nbsp;Yair J Blumenfeld,&nbsp;Maurice L Druzin,&nbsp;Yasser Y El-Sayed,&nbsp;Ronald S Gibbs,&nbsp;Virginia D Winn,&nbsp;Kevin Contrepois,&nbsp;Xuefeng B Ling,&nbsp;Ronald J Wong,&nbsp;Gary M Shaw,&nbsp;David K Stevenson,&nbsp;Brice Gaudilliere,&nbsp;Nima Aghaeepour,&nbsp;Martin S Angst","doi":"10.1080/14767058.2021.1888915","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Early identification of pregnant women at risk for preeclampsia (PE) is important, as it will enable targeted interventions ahead of clinical manifestations. The quantitative analyses of plasma proteins feature prominently among molecular approaches used for risk prediction. However, derivation of protein signatures of sufficient predictive power has been challenging. The recent availability of platforms simultaneously assessing over 1000 plasma proteins offers broad examinations of the plasma proteome, which may enable the extraction of proteomic signatures with improved prognostic performance in prenatal care.</p><p><strong>Objective: </strong>The primary aim of this study was to examine the generalizability of proteomic signatures predictive of PE in two cohorts of pregnant women whose plasma proteome was interrogated with the same highly multiplexed platform. Establishing generalizability, or lack thereof, is critical to devise strategies facilitating the development of clinically useful predictive tests. A second aim was to examine the generalizability of protein signatures predictive of gestational age (GA) in uncomplicated pregnancies in the same cohorts to contrast physiological and pathological pregnancy outcomes.</p><p><strong>Study design: </strong>Serial blood samples were collected during the first, second, and third trimesters in 18 women who developed PE and 18 women with uncomplicated pregnancies (Stanford cohort). The second cohort (Detroit), used for comparative analysis, consisted of 76 women with PE and 90 women with uncomplicated pregnancies. Multivariate analyses were applied to infer predictive and cohort-specific proteomic models, which were then tested in the alternate cohort. Gene ontology (GO) analysis was performed to identify biological processes that were over-represented among top-ranked proteins associated with PE.</p><p><strong>Results: </strong>The model derived in the Stanford cohort was highly significant (<i>p</i> = 3.9E-15) and predictive (AUC = 0.96), but failed validation in the Detroit cohort (<i>p</i> = 9.7E-01, AUC = 0.50). Similarly, the model derived in the Detroit cohort was highly significant (<i>p</i> = 1.0E-21, AUC = 0.73), but failed validation in the Stanford cohort (<i>p</i> = 7.3E-02, AUC = 0.60). By contrast, proteomic models predicting GA were readily validated across the Stanford (<i>p</i> = 1.1E-454, <i>R</i> = 0.92) and Detroit cohorts (<i>p</i> = 1.1.E-92, <i>R</i> = 0.92) indicating that the proteomic assay performed well enough to infer a generalizable model across studied cohorts, which makes it less likely that technical aspects of the assay, including batch effects, accounted for observed differences.</p><p><strong>Conclusions: </strong>Results point to a broader issue relevant for proteomic and other omic discovery studies in patient cohorts suffering from a clinical syndrome, such as PE, driven by heterogeneous pathophysiologies. While novel technologies including highly multiplex proteomic arrays and adapted computational algorithms allow for novel discoveries for a particular study cohort, they may not readily generalize across cohorts. A likely reason is that the prevalence of pathophysiologic processes leading up to the \"same\" clinical syndrome can be distributed differently in different and smaller-sized cohorts. Signatures derived in individual cohorts may simply capture different facets of the spectrum of pathophysiologic processes driving a syndrome. Our findings have important implications for the design of omic studies of a syndrome like PE. They highlight the need for performing such studies in diverse and well-phenotyped patient populations that are large enough to characterize subsets of patients with shared pathophysiologies to then derive subset-specific signatures of sufficient predictive power.</p>","PeriodicalId":520807,"journal":{"name":"The journal of maternal-fetal & neonatal medicine : the official journal of the European Association of Perinatal Medicine, the Federation of Asia and Oceania Perinatal Societies, the International Society of Perinatal Obstetricians","volume":" ","pages":"5621-5628"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/14767058.2021.1888915","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of maternal-fetal & neonatal medicine : the official journal of the European Association of Perinatal Medicine, the Federation of Asia and Oceania Perinatal Societies, the International Society of Perinatal Obstetricians","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/14767058.2021.1888915","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/3/2 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

Background: Early identification of pregnant women at risk for preeclampsia (PE) is important, as it will enable targeted interventions ahead of clinical manifestations. The quantitative analyses of plasma proteins feature prominently among molecular approaches used for risk prediction. However, derivation of protein signatures of sufficient predictive power has been challenging. The recent availability of platforms simultaneously assessing over 1000 plasma proteins offers broad examinations of the plasma proteome, which may enable the extraction of proteomic signatures with improved prognostic performance in prenatal care.

Objective: The primary aim of this study was to examine the generalizability of proteomic signatures predictive of PE in two cohorts of pregnant women whose plasma proteome was interrogated with the same highly multiplexed platform. Establishing generalizability, or lack thereof, is critical to devise strategies facilitating the development of clinically useful predictive tests. A second aim was to examine the generalizability of protein signatures predictive of gestational age (GA) in uncomplicated pregnancies in the same cohorts to contrast physiological and pathological pregnancy outcomes.

Study design: Serial blood samples were collected during the first, second, and third trimesters in 18 women who developed PE and 18 women with uncomplicated pregnancies (Stanford cohort). The second cohort (Detroit), used for comparative analysis, consisted of 76 women with PE and 90 women with uncomplicated pregnancies. Multivariate analyses were applied to infer predictive and cohort-specific proteomic models, which were then tested in the alternate cohort. Gene ontology (GO) analysis was performed to identify biological processes that were over-represented among top-ranked proteins associated with PE.

Results: The model derived in the Stanford cohort was highly significant (p = 3.9E-15) and predictive (AUC = 0.96), but failed validation in the Detroit cohort (p = 9.7E-01, AUC = 0.50). Similarly, the model derived in the Detroit cohort was highly significant (p = 1.0E-21, AUC = 0.73), but failed validation in the Stanford cohort (p = 7.3E-02, AUC = 0.60). By contrast, proteomic models predicting GA were readily validated across the Stanford (p = 1.1E-454, R = 0.92) and Detroit cohorts (p = 1.1.E-92, R = 0.92) indicating that the proteomic assay performed well enough to infer a generalizable model across studied cohorts, which makes it less likely that technical aspects of the assay, including batch effects, accounted for observed differences.

Conclusions: Results point to a broader issue relevant for proteomic and other omic discovery studies in patient cohorts suffering from a clinical syndrome, such as PE, driven by heterogeneous pathophysiologies. While novel technologies including highly multiplex proteomic arrays and adapted computational algorithms allow for novel discoveries for a particular study cohort, they may not readily generalize across cohorts. A likely reason is that the prevalence of pathophysiologic processes leading up to the "same" clinical syndrome can be distributed differently in different and smaller-sized cohorts. Signatures derived in individual cohorts may simply capture different facets of the spectrum of pathophysiologic processes driving a syndrome. Our findings have important implications for the design of omic studies of a syndrome like PE. They highlight the need for performing such studies in diverse and well-phenotyped patient populations that are large enough to characterize subsets of patients with shared pathophysiologies to then derive subset-specific signatures of sufficient predictive power.

Abstract Image

蛋白质组学特征在个体队列中预测子痫前期,但不能跨队列预测-临床生物标志物研究的意义。
背景:早期识别有子痫前期(PE)风险的孕妇是很重要的,因为它可以在临床表现之前进行有针对性的干预。血浆蛋白的定量分析在用于风险预测的分子方法中占有突出地位。然而,推导具有足够预测能力的蛋白质特征一直具有挑战性。最近可同时评估超过1000种血浆蛋白的平台提供了对血浆蛋白质组的广泛检查,这可能使提取蛋白质组特征成为可能,并改善产前护理的预后表现。目的:本研究的主要目的是研究两组孕妇的蛋白质组特征预测PE的普遍性,这两组孕妇的血浆蛋白质组都是用相同的高度复用平台进行检测的。建立普遍性,或缺乏它,是至关重要的设计策略,促进临床有用的预测测试的发展。第二个目的是在相同的队列中检查预测无并发症妊娠中胎龄(GA)的蛋白质特征的普遍性,以对比生理和病理妊娠结局。研究设计:在18名发生PE的妇女和18名无并发症妊娠的妇女(斯坦福队列)的第一、第二和第三孕期收集了连续的血液样本。第二个队列(Detroit)用于比较分析,包括76名PE妇女和90名无并发症妊娠妇女。应用多变量分析来推断预测性和队列特异性蛋白质组学模型,然后在备用队列中进行测试。进行基因本体论(GO)分析,以确定与PE相关的顶级蛋白质中过度代表的生物过程。结果:在斯坦福队列中建立的模型具有高度显著性(p = 3.9E-15)和预测性(AUC = 0.96),但在底特律队列中未能得到验证(p = 9.7E-01, AUC = 0.50)。同样,在底特律队列中推导的模型非常显著(p = 1.0E-21, AUC = 0.73),但在斯坦福队列中未能得到验证(p = 7.3E-02, AUC = 0.60)。相比之下,预测GA的蛋白质组学模型很容易在斯坦福(p = 1.1E-454, R = 0.92)和底特律(p = 1.1)队列中得到验证。E-92, R = 0.92),表明蛋白质组学分析表现良好,足以在研究队列中推断出可推广的模型,这使得包括批效应在内的分析技术方面不太可能解释观察到的差异。结论:研究结果指出了一个与蛋白质组学和其他组学发现研究相关的更广泛的问题,这些研究涉及患有临床综合征(如PE)的患者队列,由异质病理生理驱动。虽然包括高度多元蛋白质组学阵列和适应性计算算法在内的新技术允许对特定研究队列进行新发现,但它们可能不容易在整个队列中推广。一个可能的原因是,导致“相同”临床综合征的病理生理过程的患病率可能在不同和较小规模的队列中分布不同。在个体队列中获得的特征可能只是捕获了驱动综合征的病理生理过程谱的不同方面。我们的发现对设计像PE这样的综合征的组学研究具有重要意义。他们强调需要在多样化和表型良好的患者群体中进行这样的研究,这些患者群体足够大,可以表征具有共同病理生理的患者亚群,然后得出具有足够预测能力的亚群特异性特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信