{"title":"旋风:循环对比学习整合单细胞基因表达数据。","authors":"Han Ji, Xinwei He, Hongwei Li","doi":"10.1186/s12859-025-06214-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Combining single-cell transcriptome sequencing results from several batches reduces batch effect, which improves our understanding of cellular identity and function.</p><p><strong>Results: </strong>This paper introduces CYCLONE, a new method for integrating single-cell gene expression data using a recycle contrastive learning network. The contrastive learning network and the VAE model work together to jointly train the low-dimensional representations. Additionally, they update the indices of inter-batch MNN pairs to generate positive pairs from a reduced-noise low-dimensional space. Meanwhile, CYCLONE cyclically updates the MNN pairs by iteratively training the low-dimensional space to gradually improve the confidence of the positive sample pairs, and augments the MNN pairs with KNN pairs to identify batch-specific cell types, thus avoiding the problems associated with overcorrecting for the batch effect. The performance of CYCLONE was evaluated on simulated and real scRNA-seq datasets, confirming its ability to improve clustering accuracy while successfully eliminating batch effects. In addition, experiments on batch-specific cell types identification validated CYCLONE's ability to retain batch-specific information while eliminating batch effect, thus preserving batch-specific cell types.</p><p><strong>Conclusion: </strong>CYCLONE is an effective integration method based on recycle contrastive learning that improves the accuracy of cell clustering while successfully eliminating batch effects and preserving batch-specific information.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"202"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12312599/pdf/","citationCount":"0","resultStr":"{\"title\":\"CYCLONE: recycle contrastive learning for integrating single-cell gene expression data.\",\"authors\":\"Han Ji, Xinwei He, Hongwei Li\",\"doi\":\"10.1186/s12859-025-06214-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Combining single-cell transcriptome sequencing results from several batches reduces batch effect, which improves our understanding of cellular identity and function.</p><p><strong>Results: </strong>This paper introduces CYCLONE, a new method for integrating single-cell gene expression data using a recycle contrastive learning network. The contrastive learning network and the VAE model work together to jointly train the low-dimensional representations. Additionally, they update the indices of inter-batch MNN pairs to generate positive pairs from a reduced-noise low-dimensional space. Meanwhile, CYCLONE cyclically updates the MNN pairs by iteratively training the low-dimensional space to gradually improve the confidence of the positive sample pairs, and augments the MNN pairs with KNN pairs to identify batch-specific cell types, thus avoiding the problems associated with overcorrecting for the batch effect. The performance of CYCLONE was evaluated on simulated and real scRNA-seq datasets, confirming its ability to improve clustering accuracy while successfully eliminating batch effects. In addition, experiments on batch-specific cell types identification validated CYCLONE's ability to retain batch-specific information while eliminating batch effect, thus preserving batch-specific cell types.</p><p><strong>Conclusion: </strong>CYCLONE is an effective integration method based on recycle contrastive learning that improves the accuracy of cell clustering while successfully eliminating batch effects and preserving batch-specific information.</p>\",\"PeriodicalId\":8958,\"journal\":{\"name\":\"BMC Bioinformatics\",\"volume\":\"26 1\",\"pages\":\"202\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12312599/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12859-025-06214-0\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06214-0","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
CYCLONE: recycle contrastive learning for integrating single-cell gene expression data.
Background: Combining single-cell transcriptome sequencing results from several batches reduces batch effect, which improves our understanding of cellular identity and function.
Results: This paper introduces CYCLONE, a new method for integrating single-cell gene expression data using a recycle contrastive learning network. The contrastive learning network and the VAE model work together to jointly train the low-dimensional representations. Additionally, they update the indices of inter-batch MNN pairs to generate positive pairs from a reduced-noise low-dimensional space. Meanwhile, CYCLONE cyclically updates the MNN pairs by iteratively training the low-dimensional space to gradually improve the confidence of the positive sample pairs, and augments the MNN pairs with KNN pairs to identify batch-specific cell types, thus avoiding the problems associated with overcorrecting for the batch effect. The performance of CYCLONE was evaluated on simulated and real scRNA-seq datasets, confirming its ability to improve clustering accuracy while successfully eliminating batch effects. In addition, experiments on batch-specific cell types identification validated CYCLONE's ability to retain batch-specific information while eliminating batch effect, thus preserving batch-specific cell types.
Conclusion: CYCLONE is an effective integration method based on recycle contrastive learning that improves the accuracy of cell clustering while successfully eliminating batch effects and preserving batch-specific information.
期刊介绍:
BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology.
BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.