Sanghun Lee, Rachel S. Kelly, Kevin M. Mendez, Dmitry Prokopenko, Georg Hahn, Sharon M. Lutz, Juan C. Celedón, Clary B. Clish, Scott T. Weiss, Christoph Lange, Jessica A. Lasky-Su, Julian Hecker, NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium
{"title":"On the analysis of metabolite quantitative trait loci: Impact of different data transformations and study designs","authors":"Sanghun Lee, Rachel S. Kelly, Kevin M. Mendez, Dmitry Prokopenko, Georg Hahn, Sharon M. Lutz, Juan C. Celedón, Clary B. Clish, Scott T. Weiss, Christoph Lange, Jessica A. Lasky-Su, Julian Hecker, NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium","doi":"10.1126/sciadv.adp4532","DOIUrl":null,"url":null,"abstract":"<div >Metabolomic genome-wide association studies (mGWASs), or metabolomic quantitative trait locus (metQTL) analyses, are gaining growing attention. However, robust methods and analysis guidelines, vital to address the complexity of metabolomic data, remain to be established. Here, we use whole-genome sequencing and metabolomic data from two independent studies to compare different approaches. We adopted three popular data transformation methods for metabolite levels—(i) log<sub>10</sub> transformation, (ii) rank inverse normal transformation, and (iii) a fully adjusted two-step procedure—and compared population-based versus family-based analysis approaches. For validation, we performed permutation-based testing, Huber regression, and independent replication analysis. Simulation studies were used to illustrate the observed differences between data transformations. We demonstrate the advantages and limitations of popular analytic strategies used in mGWASs where especially low-frequency variants in combination with a skewed metabolite measurement distribution can lead to potentially false-positive metQTL findings. We recommend the rank inverse normal transformation or robust test statistics such as in family-based association tests as reliable approaches for mGWASs.</div>","PeriodicalId":21609,"journal":{"name":"Science Advances","volume":"11 15","pages":""},"PeriodicalIF":11.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.science.org/doi/reader/10.1126/sciadv.adp4532","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Advances","FirstCategoryId":"103","ListUrlMain":"https://www.science.org/doi/10.1126/sciadv.adp4532","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Metabolomic genome-wide association studies (mGWASs), or metabolomic quantitative trait locus (metQTL) analyses, are gaining growing attention. However, robust methods and analysis guidelines, vital to address the complexity of metabolomic data, remain to be established. Here, we use whole-genome sequencing and metabolomic data from two independent studies to compare different approaches. We adopted three popular data transformation methods for metabolite levels—(i) log10 transformation, (ii) rank inverse normal transformation, and (iii) a fully adjusted two-step procedure—and compared population-based versus family-based analysis approaches. For validation, we performed permutation-based testing, Huber regression, and independent replication analysis. Simulation studies were used to illustrate the observed differences between data transformations. We demonstrate the advantages and limitations of popular analytic strategies used in mGWASs where especially low-frequency variants in combination with a skewed metabolite measurement distribution can lead to potentially false-positive metQTL findings. We recommend the rank inverse normal transformation or robust test statistics such as in family-based association tests as reliable approaches for mGWASs.
期刊介绍:
Science Advances, an open-access journal by AAAS, publishes impactful research in diverse scientific areas. It aims for fair, fast, and expert peer review, providing freely accessible research to readers. Led by distinguished scientists, the journal supports AAAS's mission by extending Science magazine's capacity to identify and promote significant advances. Evolving digital publishing technologies play a crucial role in advancing AAAS's global mission for science communication and benefitting humankind.