Harmonizing heterogeneous single-cell gene expression data with individual-level covariate information.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Bioinformatics advances Pub Date : 2025-08-09 eCollection Date: 2025-01-01 DOI:10.1093/bioadv/vbaf189
Yudi Mu, Wei Vivian Li
{"title":"Harmonizing heterogeneous single-cell gene expression data with individual-level covariate information.","authors":"Yudi Mu, Wei Vivian Li","doi":"10.1093/bioadv/vbaf189","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>The growing availability of single-cell RNA sequencing (scRNA-seq) data highlights the necessity for robust integration methods to uncover both shared and unique cellular features across samples. These datasets often exhibit technical variations and biological differences, complicating integrative analyses. While numerous integration methods have been proposed, many fail to account for individual-level covariates or are limited to discrete variables.</p><p><strong>Results: </strong>To address these limitations, we propose scINSIGHT2, a generalized linear latent variable model that accommodates both continuous covariates, such as age, and discrete factors, such as disease conditions. Through both simulation studies and real-data applications, we demonstrate that scINSIGHT2 accurately harmonizes scRNA-seq datasets, whether from single or multiple sources. These results highlight scINSIGHT2's utility in capturing meaningful biological insights from scRNA-seq data while accounting for individual-level variation.</p><p><strong>Availability and implementation: </strong>The scINSIGHT2 method has been implemented as a R package, which is available at https://github.com/yudimu/scINSIGHT2/.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf189"},"PeriodicalIF":2.8000,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12380451/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf189","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: The growing availability of single-cell RNA sequencing (scRNA-seq) data highlights the necessity for robust integration methods to uncover both shared and unique cellular features across samples. These datasets often exhibit technical variations and biological differences, complicating integrative analyses. While numerous integration methods have been proposed, many fail to account for individual-level covariates or are limited to discrete variables.

Results: To address these limitations, we propose scINSIGHT2, a generalized linear latent variable model that accommodates both continuous covariates, such as age, and discrete factors, such as disease conditions. Through both simulation studies and real-data applications, we demonstrate that scINSIGHT2 accurately harmonizes scRNA-seq datasets, whether from single or multiple sources. These results highlight scINSIGHT2's utility in capturing meaningful biological insights from scRNA-seq data while accounting for individual-level variation.

Availability and implementation: The scINSIGHT2 method has been implemented as a R package, which is available at https://github.com/yudimu/scINSIGHT2/.

协调异质单细胞基因表达数据与个体水平协变量信息。
动机:单细胞RNA测序(scRNA-seq)数据的日益可用性突出了强大的整合方法的必要性,以揭示样本中共享和独特的细胞特征。这些数据集往往表现出技术差异和生物学差异,使综合分析复杂化。虽然已经提出了许多积分方法,但许多方法不能考虑个人水平的协变量或仅限于离散变量。结果:为了解决这些局限性,我们提出了scINSIGHT2,这是一个广义线性潜变量模型,可以容纳连续协变量(如年龄)和离散因素(如疾病状况)。通过仿真研究和实际数据应用,我们证明了scINSIGHT2能够准确地协调来自单个或多个来源的scRNA-seq数据集。这些结果突出了scINSIGHT2在从scRNA-seq数据中获取有意义的生物学见解时的实用性,同时考虑了个体水平的差异。可用性和实现:scINSIGHT2方法已经作为R包实现,可以在https://github.com/yudimu/scINSIGHT2/上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信