Amanda M Figueroa-Navedo, Rohan Kapre, Tushita Gupta, Yingrong Xu, Clifford G Phaneuf, Pierre M Jean Beltran, Liang Xue, Alexander R Ivanov, Olga Vitek
{"title":"MSstatsTMT improves accuracy of thermal proteome profiling by trading off temperature treatments and biological replicates.","authors":"Amanda M Figueroa-Navedo, Rohan Kapre, Tushita Gupta, Yingrong Xu, Clifford G Phaneuf, Pierre M Jean Beltran, Liang Xue, Alexander R Ivanov, Olga Vitek","doi":"10.1016/j.mcpro.2025.100999","DOIUrl":null,"url":null,"abstract":"<p><p>Thermal proteome profiling investigates protein-protein, protein-nucleic acid, or protein-drug interactions, and the impact of metabolite binding and post-translational modifications on these interactions. The experiments quantitatively characterize biological samples treated with small molecules versus controls, and subjected to timed exposures to multiple temperatures. Typically, each enzymatically digested sample is labeled with a tandem mass tag (TMT), where each TMT channel corresponds to a specific temperature treatment, and profiled using liquid chromatography coupled with mass spectrometry in data-dependent data acquisition mode. The resulting mass spectra are processed with computational tools to identify and quantify proteins, and filter out noise. Protein-drug interactions are detected by fitting curves to the protein-level reporter ion abundances across the temperatures. Interacting proteins are identified by shifts in the fitted curves between treated samples and controls. In this manuscript, we focus on data processing and curve fitting in thermal proteome profiling. We review the statistical methods currently used for thermal proteome profiling, and demonstrate that such methods can yield substantially different results. We advocate for the statistical analysis strategy implemented in the open-source R package MSstatsTMT, as it does not require subjective pre-filtering of the data or curve fitting, and appropriately represents all the sources of variation. It supports experimental designs that trade-off temperatures for a larger number of biological replicates, and handles multiple drug concentrations or pools of samples treated with multiple temperatures, thus increasing the sensitivity of the results. We demonstrate these advantages of MSstatsTMT as compared to the currently used alternatives in a series of simulated and experimental datasets, which include conventional thermal proteome profiling and its OnePot counterpart that pools the samples treated at multiple temperatures into one sample, and incorporates multiple doses of a drug. The suggested MSstatsTMT-based workflow is documented in publicly available and fully reproducible R vignettes.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"100999"},"PeriodicalIF":6.1000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular & Cellular Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.mcpro.2025.100999","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Thermal proteome profiling investigates protein-protein, protein-nucleic acid, or protein-drug interactions, and the impact of metabolite binding and post-translational modifications on these interactions. The experiments quantitatively characterize biological samples treated with small molecules versus controls, and subjected to timed exposures to multiple temperatures. Typically, each enzymatically digested sample is labeled with a tandem mass tag (TMT), where each TMT channel corresponds to a specific temperature treatment, and profiled using liquid chromatography coupled with mass spectrometry in data-dependent data acquisition mode. The resulting mass spectra are processed with computational tools to identify and quantify proteins, and filter out noise. Protein-drug interactions are detected by fitting curves to the protein-level reporter ion abundances across the temperatures. Interacting proteins are identified by shifts in the fitted curves between treated samples and controls. In this manuscript, we focus on data processing and curve fitting in thermal proteome profiling. We review the statistical methods currently used for thermal proteome profiling, and demonstrate that such methods can yield substantially different results. We advocate for the statistical analysis strategy implemented in the open-source R package MSstatsTMT, as it does not require subjective pre-filtering of the data or curve fitting, and appropriately represents all the sources of variation. It supports experimental designs that trade-off temperatures for a larger number of biological replicates, and handles multiple drug concentrations or pools of samples treated with multiple temperatures, thus increasing the sensitivity of the results. We demonstrate these advantages of MSstatsTMT as compared to the currently used alternatives in a series of simulated and experimental datasets, which include conventional thermal proteome profiling and its OnePot counterpart that pools the samples treated at multiple temperatures into one sample, and incorporates multiple doses of a drug. The suggested MSstatsTMT-based workflow is documented in publicly available and fully reproducible R vignettes.
期刊介绍:
The mission of MCP is to foster the development and applications of proteomics in both basic and translational research. MCP will publish manuscripts that report significant new biological or clinical discoveries underpinned by proteomic observations across all kingdoms of life. Manuscripts must define the biological roles played by the proteins investigated or their mechanisms of action.
The journal also emphasizes articles that describe innovative new computational methods and technological advancements that will enable future discoveries. Manuscripts describing such approaches do not have to include a solution to a biological problem, but must demonstrate that the technology works as described, is reproducible and is appropriate to uncover yet unknown protein/proteome function or properties using relevant model systems or publicly available data.
Scope:
-Fundamental studies in biology, including integrative "omics" studies, that provide mechanistic insights
-Novel experimental and computational technologies
-Proteogenomic data integration and analysis that enable greater understanding of physiology and disease processes
-Pathway and network analyses of signaling that focus on the roles of post-translational modifications
-Studies of proteome dynamics and quality controls, and their roles in disease
-Studies of evolutionary processes effecting proteome dynamics, quality and regulation
-Chemical proteomics, including mechanisms of drug action
-Proteomics of the immune system and antigen presentation/recognition
-Microbiome proteomics, host-microbe and host-pathogen interactions, and their roles in health and disease
-Clinical and translational studies of human diseases
-Metabolomics to understand functional connections between genes, proteins and phenotypes