{"title":"Deep multi-kernel cell clustering for single-cell RNA sequencing data","authors":"Maoxuan Yao , Lina Ren","doi":"10.1016/j.bej.2025.109877","DOIUrl":null,"url":null,"abstract":"<div><div>Although existing deep learning methods for single-cell RNA sequencing (scRNA-seq) data can handle high-dimensional data and extract complex features, they also have the issue that the learned representations do not consider the clustering structure, leading to difficulties in linear separation. To address this, we propose a deep multi-kernel cell clustering network for scRNA-seq data, termed scDMKC, to cluster scRNA-seq data by simultaneously optimizing the multi-kernel representation of scRNA-seq data and the cell partitioning, enhancing the cell clustering performance by leveraging the strengths of multi-kernel learning and deep learning in capturing linearly separable data structures. To effectively learn the multi-kernel representation of scRNA-seq data, we propose a multi-kernel representation learner that adaptively selects an appropriate combination of multiple kernels to map the hidden representations of the data into the kernel space, thereby capturing the underlying linearly separable structure of the scRNA-seq data. In order to jointly optimize the multi-kernel representation of scRNA-seq data and the cell partitioning, a ZINB-based self-supervised strategy is developed. This strategy not only enhances the linear separability of the learned kernel representations but also improves clustering performance. Extensive experiments on various real scRNA-seq datasets were conducted. The experimental results indicate that our proposed scDMKC model significantly outperforms most of the existing cell clustering methods.</div></div>","PeriodicalId":8766,"journal":{"name":"Biochemical Engineering Journal","volume":"223 ","pages":"Article 109877"},"PeriodicalIF":3.7000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biochemical Engineering Journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1369703X25002517","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Although existing deep learning methods for single-cell RNA sequencing (scRNA-seq) data can handle high-dimensional data and extract complex features, they also have the issue that the learned representations do not consider the clustering structure, leading to difficulties in linear separation. To address this, we propose a deep multi-kernel cell clustering network for scRNA-seq data, termed scDMKC, to cluster scRNA-seq data by simultaneously optimizing the multi-kernel representation of scRNA-seq data and the cell partitioning, enhancing the cell clustering performance by leveraging the strengths of multi-kernel learning and deep learning in capturing linearly separable data structures. To effectively learn the multi-kernel representation of scRNA-seq data, we propose a multi-kernel representation learner that adaptively selects an appropriate combination of multiple kernels to map the hidden representations of the data into the kernel space, thereby capturing the underlying linearly separable structure of the scRNA-seq data. In order to jointly optimize the multi-kernel representation of scRNA-seq data and the cell partitioning, a ZINB-based self-supervised strategy is developed. This strategy not only enhances the linear separability of the learned kernel representations but also improves clustering performance. Extensive experiments on various real scRNA-seq datasets were conducted. The experimental results indicate that our proposed scDMKC model significantly outperforms most of the existing cell clustering methods.
期刊介绍:
The Biochemical Engineering Journal aims to promote progress in the crucial chemical engineering aspects of the development of biological processes associated with everything from raw materials preparation to product recovery relevant to industries as diverse as medical/healthcare, industrial biotechnology, and environmental biotechnology.
The Journal welcomes full length original research papers, short communications, and review papers* in the following research fields:
Biocatalysis (enzyme or microbial) and biotransformations, including immobilized biocatalyst preparation and kinetics
Biosensors and Biodevices including biofabrication and novel fuel cell development
Bioseparations including scale-up and protein refolding/renaturation
Environmental Bioengineering including bioconversion, bioremediation, and microbial fuel cells
Bioreactor Systems including characterization, optimization and scale-up
Bioresources and Biorefinery Engineering including biomass conversion, biofuels, bioenergy, and optimization
Industrial Biotechnology including specialty chemicals, platform chemicals and neutraceuticals
Biomaterials and Tissue Engineering including bioartificial organs, cell encapsulation, and controlled release
Cell Culture Engineering (plant, animal or insect cells) including viral vectors, monoclonal antibodies, recombinant proteins, vaccines, and secondary metabolites
Cell Therapies and Stem Cells including pluripotent, mesenchymal and hematopoietic stem cells; immunotherapies; tissue-specific differentiation; and cryopreservation
Metabolic Engineering, Systems and Synthetic Biology including OMICS, bioinformatics, in silico biology, and metabolic flux analysis
Protein Engineering including enzyme engineering and directed evolution.