Deep multi-kernel cell clustering for single-cell RNA sequencing data

IF 3.7 3区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY
Maoxuan Yao , Lina Ren
{"title":"Deep multi-kernel cell clustering for single-cell RNA sequencing data","authors":"Maoxuan Yao ,&nbsp;Lina Ren","doi":"10.1016/j.bej.2025.109877","DOIUrl":null,"url":null,"abstract":"<div><div>Although existing deep learning methods for single-cell RNA sequencing (scRNA-seq) data can handle high-dimensional data and extract complex features, they also have the issue that the learned representations do not consider the clustering structure, leading to difficulties in linear separation. To address this, we propose a deep multi-kernel cell clustering network for scRNA-seq data, termed scDMKC, to cluster scRNA-seq data by simultaneously optimizing the multi-kernel representation of scRNA-seq data and the cell partitioning, enhancing the cell clustering performance by leveraging the strengths of multi-kernel learning and deep learning in capturing linearly separable data structures. To effectively learn the multi-kernel representation of scRNA-seq data, we propose a multi-kernel representation learner that adaptively selects an appropriate combination of multiple kernels to map the hidden representations of the data into the kernel space, thereby capturing the underlying linearly separable structure of the scRNA-seq data. In order to jointly optimize the multi-kernel representation of scRNA-seq data and the cell partitioning, a ZINB-based self-supervised strategy is developed. This strategy not only enhances the linear separability of the learned kernel representations but also improves clustering performance. Extensive experiments on various real scRNA-seq datasets were conducted. The experimental results indicate that our proposed scDMKC model significantly outperforms most of the existing cell clustering methods.</div></div>","PeriodicalId":8766,"journal":{"name":"Biochemical Engineering Journal","volume":"223 ","pages":"Article 109877"},"PeriodicalIF":3.7000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biochemical Engineering Journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1369703X25002517","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Although existing deep learning methods for single-cell RNA sequencing (scRNA-seq) data can handle high-dimensional data and extract complex features, they also have the issue that the learned representations do not consider the clustering structure, leading to difficulties in linear separation. To address this, we propose a deep multi-kernel cell clustering network for scRNA-seq data, termed scDMKC, to cluster scRNA-seq data by simultaneously optimizing the multi-kernel representation of scRNA-seq data and the cell partitioning, enhancing the cell clustering performance by leveraging the strengths of multi-kernel learning and deep learning in capturing linearly separable data structures. To effectively learn the multi-kernel representation of scRNA-seq data, we propose a multi-kernel representation learner that adaptively selects an appropriate combination of multiple kernels to map the hidden representations of the data into the kernel space, thereby capturing the underlying linearly separable structure of the scRNA-seq data. In order to jointly optimize the multi-kernel representation of scRNA-seq data and the cell partitioning, a ZINB-based self-supervised strategy is developed. This strategy not only enhances the linear separability of the learned kernel representations but also improves clustering performance. Extensive experiments on various real scRNA-seq datasets were conducted. The experimental results indicate that our proposed scDMKC model significantly outperforms most of the existing cell clustering methods.
单细胞RNA测序数据的深度多核细胞聚类
现有的单细胞RNA测序(scRNA-seq)数据深度学习方法虽然可以处理高维数据,提取复杂特征,但也存在学习到的表示没有考虑聚类结构,导致线性分离困难的问题。为了解决这个问题,我们提出了一个scDMKC的scRNA-seq数据深度多核细胞聚类网络,通过同时优化scRNA-seq数据的多核表示和细胞划分来聚类scRNA-seq数据,通过利用多核学习和深度学习在捕获线性可分数据结构方面的优势来提高细胞聚类性能。为了有效地学习scRNA-seq数据的多核表示,我们提出了一个多核表示学习器,它自适应地选择多个核的适当组合,将数据的隐藏表示映射到核空间中,从而捕获scRNA-seq数据的潜在线性可分结构。为了联合优化scRNA-seq数据的多核表示和细胞划分,提出了一种基于zinb的自监督策略。该策略不仅提高了学习到的核表示的线性可分性,而且提高了聚类性能。在各种真实的scRNA-seq数据集上进行了广泛的实验。实验结果表明,我们提出的scDMKC模型显著优于大多数现有的细胞聚类方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biochemical Engineering Journal
Biochemical Engineering Journal 工程技术-工程:化工
CiteScore
7.10
自引率
5.10%
发文量
380
审稿时长
34 days
期刊介绍: The Biochemical Engineering Journal aims to promote progress in the crucial chemical engineering aspects of the development of biological processes associated with everything from raw materials preparation to product recovery relevant to industries as diverse as medical/healthcare, industrial biotechnology, and environmental biotechnology. The Journal welcomes full length original research papers, short communications, and review papers* in the following research fields: Biocatalysis (enzyme or microbial) and biotransformations, including immobilized biocatalyst preparation and kinetics Biosensors and Biodevices including biofabrication and novel fuel cell development Bioseparations including scale-up and protein refolding/renaturation Environmental Bioengineering including bioconversion, bioremediation, and microbial fuel cells Bioreactor Systems including characterization, optimization and scale-up Bioresources and Biorefinery Engineering including biomass conversion, biofuels, bioenergy, and optimization Industrial Biotechnology including specialty chemicals, platform chemicals and neutraceuticals Biomaterials and Tissue Engineering including bioartificial organs, cell encapsulation, and controlled release Cell Culture Engineering (plant, animal or insect cells) including viral vectors, monoclonal antibodies, recombinant proteins, vaccines, and secondary metabolites Cell Therapies and Stem Cells including pluripotent, mesenchymal and hematopoietic stem cells; immunotherapies; tissue-specific differentiation; and cryopreservation Metabolic Engineering, Systems and Synthetic Biology including OMICS, bioinformatics, in silico biology, and metabolic flux analysis Protein Engineering including enzyme engineering and directed evolution.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信