FastQTLmapping: an ultra-fast and memory efficient package for mQTL-like analysis.

IF 2.9 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS
Xingjian Gao, Jiarui Li, Xinxuan Liu, Qianqian Peng, Han Jing, Sibte Hadi, Andrew E Teschendorff, Sijia Wang, Fan Liu
{"title":"FastQTLmapping: an ultra-fast and memory efficient package for mQTL-like analysis.","authors":"Xingjian Gao, Jiarui Li, Xinxuan Liu, Qianqian Peng, Han Jing, Sibte Hadi, Andrew E Teschendorff, Sijia Wang, Fan Liu","doi":"10.1186/s12859-025-06130-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>FastQTLmapping addresses the need for an ultra-fast and memory-efficient solver capable of handling exhaustive multiple regression analysis with a vast number of dependent and explanatory variables, including covariates. This challenge is especially pronounced in methylation quantitative trait loci (mQTL)-like analysis, which typically involves high-dimensional genetic and epigenetic data.</p><p><strong>Results: </strong>FastQTLmapping is a precompiled C++ software solution accelerated by Intel MKL and GSL, freely available at https://github.com/Fun-Gene/fastQTLmapping . Compared to state-of-the-art methods (MatrixEQTL, FastQTL, and TensorQTL), fastQTLmapping demonstrated an order of magnitude speed improvement, coupled with a marked reduction in peak memory usage. In a large dataset consisting of 3500 individuals, 8 million SNPs, 0.8 million CpGs, and 20 covariates, fastQTLmapping completed the entire mQTL analysis in 4.5 h with only 13.1 GB peak memory usage.</p><p><strong>Conclusions: </strong>FastQTLmapping effectively expedites comprehensive mQTL analyses by providing a robust and generic approach that accommodates large-scale genomic datasets with covariates. This solution has the potential to streamline mQTL-like studies and inform future method development for efficient computational genomics.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"112"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12036243/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06130-3","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: FastQTLmapping addresses the need for an ultra-fast and memory-efficient solver capable of handling exhaustive multiple regression analysis with a vast number of dependent and explanatory variables, including covariates. This challenge is especially pronounced in methylation quantitative trait loci (mQTL)-like analysis, which typically involves high-dimensional genetic and epigenetic data.

Results: FastQTLmapping is a precompiled C++ software solution accelerated by Intel MKL and GSL, freely available at https://github.com/Fun-Gene/fastQTLmapping . Compared to state-of-the-art methods (MatrixEQTL, FastQTL, and TensorQTL), fastQTLmapping demonstrated an order of magnitude speed improvement, coupled with a marked reduction in peak memory usage. In a large dataset consisting of 3500 individuals, 8 million SNPs, 0.8 million CpGs, and 20 covariates, fastQTLmapping completed the entire mQTL analysis in 4.5 h with only 13.1 GB peak memory usage.

Conclusions: FastQTLmapping effectively expedites comprehensive mQTL analyses by providing a robust and generic approach that accommodates large-scale genomic datasets with covariates. This solution has the potential to streamline mQTL-like studies and inform future method development for efficient computational genomics.

FastQTLmapping:用于类似mqtl的分析的超快速且内存高效的包。
背景:FastQTLmapping解决了对超快速和内存高效的求解器的需求,该求解器能够处理包含大量因变量和解释变量(包括协变量)的详尽多元回归分析。这一挑战在甲基化数量性状位点(mQTL)样分析中尤其明显,这通常涉及高维遗传和表观遗传数据。结果:FastQTLmapping是一个由Intel MKL和GSL加速的预编译c++软件解决方案,可在https://github.com/Fun-Gene/fastQTLmapping免费获得。与最先进的方法(MatrixEQTL、FastQTL和TensorQTL)相比,fastQTLmapping的速度提高了一个数量级,同时显著降低了峰值内存使用。在一个由3500个个体、800万个snp、80万个cpg和20个协变量组成的大型数据集中,fastQTLmapping在4.5小时内完成了整个mQTL分析,峰值内存使用量仅为13.1 GB。结论:FastQTLmapping提供了一种强大而通用的方法,可以适应带有协变量的大规模基因组数据集,从而有效地加快了全面的mQTL分析。该解决方案有可能简化类似mqtl的研究,并为未来高效计算基因组学的方法开发提供信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Bioinformatics
BMC Bioinformatics 生物-生化研究方法
CiteScore
5.70
自引率
3.30%
发文量
506
审稿时长
4.3 months
期刊介绍: BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology. BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信