MultANOVA Followed by Post Hoc Analyses for Designed High-Dimensional Data: A Comprehensive Framework That Outperforms ASCA, rMANOVA, and VASCA

IF 2.3 4区 化学 Q1 SOCIAL WORK
Benjamin Mahieu, Véronique Cariou
{"title":"MultANOVA Followed by Post Hoc Analyses for Designed High-Dimensional Data: A Comprehensive Framework That Outperforms ASCA, rMANOVA, and VASCA","authors":"Benjamin Mahieu,&nbsp;Véronique Cariou","doi":"10.1002/cem.70039","DOIUrl":null,"url":null,"abstract":"<p>Analytical platforms generate high-dimensional data, where the number of variables usually exceeds the number of observations. Such data are frequently derived from an experimental design, where samples have been collected to identify potential variation in the factors or interactions of interest. To circumvent issues related to large data sizes when evaluating factor and interaction effects, ANOVA simultaneous component analysis (ASCA), regularized multivariate analysis of variance (rMANOVA), and variable selection ASCA (VASCA) have been proposed previously. However, they require computationally intensive methods to test the effects of factors and interactions. In the present paper, multiple ANOVAs (MultANOVA) is proposed as a simple yet effective alternative to the above methods. MultANOVA has the advantage of being direct and fast, as it does not rely on intensive calculation methods, while incorporating a variable selection strategy. This method entails the execution of multiple ANOVAs, one per variable, with multiple test corrections. Subsequent post hoc analyses are also introduced. These encompass multiple least-squares difference tests (MultLSD) for the pairwise comparison of multivariate least-squares means and diagonal canonical discriminant analysis (DCDA) with approximate confidence ellipses to visualize significant effects. MultANOVA is compared to the aforementioned methods based on simulations, which demonstrate that it holds the nominal alpha risk as opposed to rMANOVA and VASCA, while being more powerful than ASCA and VASCA. Even though MultANOVA is proven less powerful than VASCA for variable selection, it has been demonstrated to hold the nominal risk, whereas VASCA does not. Finally, the MultANOVA framework is illustrated based on metagenomics, metabolomics, and spectroscopic data.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 7","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70039","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemometrics","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cem.70039","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL WORK","Score":null,"Total":0}
引用次数: 0

Abstract

Analytical platforms generate high-dimensional data, where the number of variables usually exceeds the number of observations. Such data are frequently derived from an experimental design, where samples have been collected to identify potential variation in the factors or interactions of interest. To circumvent issues related to large data sizes when evaluating factor and interaction effects, ANOVA simultaneous component analysis (ASCA), regularized multivariate analysis of variance (rMANOVA), and variable selection ASCA (VASCA) have been proposed previously. However, they require computationally intensive methods to test the effects of factors and interactions. In the present paper, multiple ANOVAs (MultANOVA) is proposed as a simple yet effective alternative to the above methods. MultANOVA has the advantage of being direct and fast, as it does not rely on intensive calculation methods, while incorporating a variable selection strategy. This method entails the execution of multiple ANOVAs, one per variable, with multiple test corrections. Subsequent post hoc analyses are also introduced. These encompass multiple least-squares difference tests (MultLSD) for the pairwise comparison of multivariate least-squares means and diagonal canonical discriminant analysis (DCDA) with approximate confidence ellipses to visualize significant effects. MultANOVA is compared to the aforementioned methods based on simulations, which demonstrate that it holds the nominal alpha risk as opposed to rMANOVA and VASCA, while being more powerful than ASCA and VASCA. Even though MultANOVA is proven less powerful than VASCA for variable selection, it has been demonstrated to hold the nominal risk, whereas VASCA does not. Finally, the MultANOVA framework is illustrated based on metagenomics, metabolomics, and spectroscopic data.

Abstract Image

设计高维数据的事后分析:优于ASCA、rMANOVA和VASCA的综合框架
分析平台生成高维数据,其中变量的数量通常超过观测的数量。这些数据通常来自实验设计,其中收集样本以确定感兴趣的因素或相互作用的潜在变化。为了避免在评估因素和相互作用效应时与大数据量相关的问题,之前已经提出了ANOVA同步成分分析(ASCA),正则化多变量方差分析(rMANOVA)和变量选择ASCA (VASCA)。然而,它们需要计算密集的方法来测试因素和相互作用的影响。在本文中,多重方差分析(MultANOVA)被提出作为一种简单而有效的替代上述方法。MultANOVA具有直接和快速的优点,因为它不依赖于密集的计算方法,同时结合了变量选择策略。该方法需要执行多个anova,每个变量一个,具有多个测试更正。随后的事后分析也被介绍。这些包括多个最小二乘差异检验(MultLSD),用于对多变量最小二乘均值进行两两比较,并使用近似置信椭圆对角典型判别分析(DCDA)来可视化显着效果。MultANOVA与上述基于模拟的方法进行了比较,结果表明,与rMANOVA和VASCA相比,MultANOVA具有名义上的alpha风险,而比ASCA和VASCA更强大。尽管MultANOVA被证明在变量选择方面不如VASCA强大,但它已被证明具有名义风险,而VASCA则没有。最后,基于宏基因组学、代谢组学和光谱数据阐述了MultANOVA框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Chemometrics
Journal of Chemometrics 化学-分析化学
CiteScore
5.20
自引率
8.30%
发文量
78
审稿时长
2 months
期刊介绍: The Journal of Chemometrics is devoted to the rapid publication of original scientific papers, reviews and short communications on fundamental and applied aspects of chemometrics. It also provides a forum for the exchange of information on meetings and other news relevant to the growing community of scientists who are interested in chemometrics and its applications. Short, critical review papers are a particularly important feature of the journal, in view of the multidisciplinary readership at which it is aimed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信