msqrob2TMT: Robust Linear Mixed Models for Inferring Differential Abundant Proteins in Labeled Experiments With Arbitrarily Complex Design.

IF 5.5 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

Molecular & Cellular Proteomics Pub Date : 2025-07-01 Epub Date: 2025-05-30 DOI:10.1016/j.mcpro.2025.101002

Stijn Vandenbulcke, Christophe Vanderaa, Oliver Crook, Lennart Martens, Lieven Clement

{"title":"msqrob2TMT: Robust Linear Mixed Models for Inferring Differential Abundant Proteins in Labeled Experiments With Arbitrarily Complex Design.","authors":"Stijn Vandenbulcke, Christophe Vanderaa, Oliver Crook, Lennart Martens, Lieven Clement","doi":"10.1016/j.mcpro.2025.101002","DOIUrl":null,"url":null,"abstract":"<p><p>Labeling strategies in mass spectrometry-based proteomics enhance sample throughput by enabling the acquisition of multiplexed samples within a single run. However, contemporary experiments often involve increasingly complex designs, where the number of samples exceeds the capacity of a single run, resulting in a complex correlation structure that must be addressed for accurate statistical inference and reliable biomarker discovery. To this end, we introduce msqrob2TMT, a suite of mixed model-based workflows specifically designed for differential abundance analysis in labeled mass spectrometry-based proteomics data. msqrob2TMT accommodates both sample-specific and feature-specific (e.g., peptide or protein) covariates, facilitating inference in experiments with arbitrarily complex designs and allowing for explicit correction of feature-specific covariates. We benchmark our innovative workflows against state-of-the-art tools, including DEqMS, MSstatsTMT, and msTrawler, using two spike-in studies. Our findings demonstrate that msqrob2TMT offers greater flexibility, improved modularity, and enhanced performance, particularly through the application of robust ridge regression. Finally, we demonstrate the practical relevance of msqrob2TMT in a real mouse study, highlighting its capacity to effectively account for the complex correlation structure in the data.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101002"},"PeriodicalIF":5.5000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12365501/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular & Cellular Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.mcpro.2025.101002","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/30 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Labeling strategies in mass spectrometry-based proteomics enhance sample throughput by enabling the acquisition of multiplexed samples within a single run. However, contemporary experiments often involve increasingly complex designs, where the number of samples exceeds the capacity of a single run, resulting in a complex correlation structure that must be addressed for accurate statistical inference and reliable biomarker discovery. To this end, we introduce msqrob2TMT, a suite of mixed model-based workflows specifically designed for differential abundance analysis in labeled mass spectrometry-based proteomics data. msqrob2TMT accommodates both sample-specific and feature-specific (e.g., peptide or protein) covariates, facilitating inference in experiments with arbitrarily complex designs and allowing for explicit correction of feature-specific covariates. We benchmark our innovative workflows against state-of-the-art tools, including DEqMS, MSstatsTMT, and msTrawler, using two spike-in studies. Our findings demonstrate that msqrob2TMT offers greater flexibility, improved modularity, and enhanced performance, particularly through the application of robust ridge regression. Finally, we demonstrate the practical relevance of msqrob2TMT in a real mouse study, highlighting its capacity to effectively account for the complex correlation structure in the data.

查看原文本刊更多论文

msqrob2TMT：在任意复杂设计的标记实验中推断差异丰富蛋白的鲁棒线性混合模型。

标记策略在质谱（MS）为基础的蛋白质组学提高样品吞吐量，使获取多路样品在单次运行。然而，当代实验往往涉及越来越复杂的设计，其中样本数量超过单次运行的能力，导致复杂的相关结构，必须解决准确的统计推断和可靠的生物标志物发现。为此，我们引入了msqrob2TMT，这是一套基于混合模型的工作流程，专门用于标记MS-based蛋白质组学数据的差异丰度分析。msqrob2TMT可容纳样本特异性和特征特异性（如肽或蛋白质）协变量，便于在任意复杂设计的实验中进行推断，并允许明确校正特征特异性协变量。我们将我们的创新工作流程与最先进的工具（包括DEqMS， MSstatsTMT和msTrawler）进行对比，使用两个峰值研究。我们的研究结果表明，msqrob2TMT提供了更大的灵活性，改进的模块化和增强的性能，特别是通过应用鲁棒脊回归。最后，我们在真实的小鼠研究中展示了msqrob2TMT的实际相关性，突出了其有效解释数据中复杂关联结构的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Molecular & Cellular Proteomics 生物-生化研究方法

CiteScore

11.50

自引率

4.30%

发文量

131

审稿时长

84 days

期刊介绍： The mission of MCP is to foster the development and applications of proteomics in both basic and translational research. MCP will publish manuscripts that report significant new biological or clinical discoveries underpinned by proteomic observations across all kingdoms of life. Manuscripts must define the biological roles played by the proteins investigated or their mechanisms of action. The journal also emphasizes articles that describe innovative new computational methods and technological advancements that will enable future discoveries. Manuscripts describing such approaches do not have to include a solution to a biological problem, but must demonstrate that the technology works as described, is reproducible and is appropriate to uncover yet unknown protein/proteome function or properties using relevant model systems or publicly available data. Scope: -Fundamental studies in biology, including integrative "omics" studies, that provide mechanistic insights -Novel experimental and computational technologies -Proteogenomic data integration and analysis that enable greater understanding of physiology and disease processes -Pathway and network analyses of signaling that focus on the roles of post-translational modifications -Studies of proteome dynamics and quality controls, and their roles in disease -Studies of evolutionary processes effecting proteome dynamics, quality and regulation -Chemical proteomics, including mechanisms of drug action -Proteomics of the immune system and antigen presentation/recognition -Microbiome proteomics, host-microbe and host-pathogen interactions, and their roles in health and disease -Clinical and translational studies of human diseases -Metabolomics to understand functional connections between genes, proteins and phenotypes