{"title":"scCAM:基于残差网络和层类激活图对scRNA-seq数据进行注释并识别注释相关基因的一种可解释的方法","authors":"Ya Zhang , Yongzhao Du , Yuqing Fu","doi":"10.1016/j.genrep.2025.102297","DOIUrl":null,"url":null,"abstract":"<div><div>Single-cell RNA sequencing (scRNA-seq) has been widely used to explore gene expression and cellular heterogeneity. Cell type annotation is a crucial step in the scRNA-seq data analysis. Recently, several deep learning methods have been developed for cell annotation. However, most existing methods lack biological explainability and fail to discover key genes related to annotation. Therefore, we propose an explainable automatic cell annotation method: scCAM. Our method combines residual networks and layer class activation maps, constructs grayscale images to represent gene expression, and utilizes backward class-specific gradients and the spatial location importance to explore the cell annotation decision-making processes and discover annotation-related genes. We performed experiments on benchmark datasets in multiple situations, the experimental results show that scCAM outperforms other state-of-the-art methods, especially on the large-scale dataset, exceeds other methods by 6.4 %, 18 %, 39.2 %, 16.5 % and 9.8 % on the accuracy, respectively. Explainable analysis on the Segerstolpe pancreas dataset successfully identifies annotation-related genes including marker genes and differentially expressed genes, provides reference and support for the discovery of new marker genes. The source code of scCAM is available at <span><span>https://github.com/zhangya10956/scCAM-cell-anno</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":12673,"journal":{"name":"Gene Reports","volume":"41 ","pages":"Article 102297"},"PeriodicalIF":0.9000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"scCAM: An explainable method to annotate scRNA-seq data and identify annotation-related genes based on residual network and layer class activation maps\",\"authors\":\"Ya Zhang , Yongzhao Du , Yuqing Fu\",\"doi\":\"10.1016/j.genrep.2025.102297\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Single-cell RNA sequencing (scRNA-seq) has been widely used to explore gene expression and cellular heterogeneity. Cell type annotation is a crucial step in the scRNA-seq data analysis. Recently, several deep learning methods have been developed for cell annotation. However, most existing methods lack biological explainability and fail to discover key genes related to annotation. Therefore, we propose an explainable automatic cell annotation method: scCAM. Our method combines residual networks and layer class activation maps, constructs grayscale images to represent gene expression, and utilizes backward class-specific gradients and the spatial location importance to explore the cell annotation decision-making processes and discover annotation-related genes. We performed experiments on benchmark datasets in multiple situations, the experimental results show that scCAM outperforms other state-of-the-art methods, especially on the large-scale dataset, exceeds other methods by 6.4 %, 18 %, 39.2 %, 16.5 % and 9.8 % on the accuracy, respectively. Explainable analysis on the Segerstolpe pancreas dataset successfully identifies annotation-related genes including marker genes and differentially expressed genes, provides reference and support for the discovery of new marker genes. The source code of scCAM is available at <span><span>https://github.com/zhangya10956/scCAM-cell-anno</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":12673,\"journal\":{\"name\":\"Gene Reports\",\"volume\":\"41 \",\"pages\":\"Article 102297\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2025-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Gene Reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2452014425001700\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gene Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452014425001700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
scCAM: An explainable method to annotate scRNA-seq data and identify annotation-related genes based on residual network and layer class activation maps
Single-cell RNA sequencing (scRNA-seq) has been widely used to explore gene expression and cellular heterogeneity. Cell type annotation is a crucial step in the scRNA-seq data analysis. Recently, several deep learning methods have been developed for cell annotation. However, most existing methods lack biological explainability and fail to discover key genes related to annotation. Therefore, we propose an explainable automatic cell annotation method: scCAM. Our method combines residual networks and layer class activation maps, constructs grayscale images to represent gene expression, and utilizes backward class-specific gradients and the spatial location importance to explore the cell annotation decision-making processes and discover annotation-related genes. We performed experiments on benchmark datasets in multiple situations, the experimental results show that scCAM outperforms other state-of-the-art methods, especially on the large-scale dataset, exceeds other methods by 6.4 %, 18 %, 39.2 %, 16.5 % and 9.8 % on the accuracy, respectively. Explainable analysis on the Segerstolpe pancreas dataset successfully identifies annotation-related genes including marker genes and differentially expressed genes, provides reference and support for the discovery of new marker genes. The source code of scCAM is available at https://github.com/zhangya10956/scCAM-cell-anno.
Gene ReportsBiochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.30
自引率
7.70%
发文量
246
审稿时长
49 days
期刊介绍:
Gene Reports publishes papers that focus on the regulation, expression, function and evolution of genes in all biological contexts, including all prokaryotic and eukaryotic organisms, as well as viruses. Gene Reports strives to be a very diverse journal and topics in all fields will be considered for publication. Although not limited to the following, some general topics include: DNA Organization, Replication & Evolution -Focus on genomic DNA (chromosomal organization, comparative genomics, DNA replication, DNA repair, mobile DNA, mitochondrial DNA, chloroplast DNA). Expression & Function - Focus on functional RNAs (microRNAs, tRNAs, rRNAs, mRNA splicing, alternative polyadenylation) Regulation - Focus on processes that mediate gene-read out (epigenetics, chromatin, histone code, transcription, translation, protein degradation). Cell Signaling - Focus on mechanisms that control information flow into the nucleus to control gene expression (kinase and phosphatase pathways controlled by extra-cellular ligands, Wnt, Notch, TGFbeta/BMPs, FGFs, IGFs etc.) Profiling of gene expression and genetic variation - Focus on high throughput approaches (e.g., DeepSeq, ChIP-Seq, Affymetrix microarrays, proteomics) that define gene regulatory circuitry, molecular pathways and protein/protein networks. Genetics - Focus on development in model organisms (e.g., mouse, frog, fruit fly, worm), human genetic variation, population genetics, as well as agricultural and veterinary genetics. Molecular Pathology & Regenerative Medicine - Focus on the deregulation of molecular processes in human diseases and mechanisms supporting regeneration of tissues through pluripotent or multipotent stem cells.