{"title":"Less is more: improving cell-type identification with augmentation-free single-cell RNA-Seq contrastive learning.","authors":"Ibrahim Alsaggaf, Daniel Buchan, Cen Wan","doi":"10.1093/bioinformatics/btaf437","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Cell-type identification is one of the most important tasks in single-cell RNA Sequencing (scRNA-Seq) analysis. Recent research has revealed contrastive learning's great potential in handling multiple cell-type identification tasks.</p><p><strong>Results: </strong>In this work, we proposed a novel augmentation-free scRNA-Seq contrastive learning (AF-RCL) algorithm, which simplifies the conventional data augmentation operation and adopts a new contrastive learning loss function. A large-scale empirical evaluation suggests that AF-RCL not only outperformed other contrastive learning-based cell-type identification methods but also obtained state-of-the-art predictive performance compared with other well-known cell-type identification methods. Further analysis also shows AF-RCL's advantages in learning high-quality discriminative feature representations based on scRNA-Seq expression profiles.</p><p><strong>Availability and implementation: </strong>The source code is available at https://doi.org/10.6084/m9.figshare.28830311.v1 and at https://github.com/ibrahimsaggaf/AFRCL. The pre-trained AF-RCL encoders can be downloaded from https://doi.org/10.5281/zenodo.15109736, and the scRNA-Seq datasets used in this work can be downloaded from https://doi.org/10.5281/zenodo.8087611.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12417077/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: Cell-type identification is one of the most important tasks in single-cell RNA Sequencing (scRNA-Seq) analysis. Recent research has revealed contrastive learning's great potential in handling multiple cell-type identification tasks.
Results: In this work, we proposed a novel augmentation-free scRNA-Seq contrastive learning (AF-RCL) algorithm, which simplifies the conventional data augmentation operation and adopts a new contrastive learning loss function. A large-scale empirical evaluation suggests that AF-RCL not only outperformed other contrastive learning-based cell-type identification methods but also obtained state-of-the-art predictive performance compared with other well-known cell-type identification methods. Further analysis also shows AF-RCL's advantages in learning high-quality discriminative feature representations based on scRNA-Seq expression profiles.
Availability and implementation: The source code is available at https://doi.org/10.6084/m9.figshare.28830311.v1 and at https://github.com/ibrahimsaggaf/AFRCL. The pre-trained AF-RCL encoders can be downloaded from https://doi.org/10.5281/zenodo.15109736, and the scRNA-Seq datasets used in this work can be downloaded from https://doi.org/10.5281/zenodo.8087611.