{"title":"An Advanced NMF-Based Approach for Single Cell Data Clustering","authors":"Peng Zhao, Yongpan Sheng, Xiaohui Zhan","doi":"10.1109/icicse55337.2022.9828919","DOIUrl":null,"url":null,"abstract":"Single-cell RNA sequencing (scRNA-seq) provides transcriptomic profiling for individual cells, allowing researchers to study the heterogeneity of tissues, recognize rare cell identities and discover new cellular subtypes. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, The performance of existing single-cell clustering methods is extremely sensitive to the presence of noise data and outliers. Nevertheless, there is still no consensus on the best performing method. To address this issue, we utilize an advanced NMF for scRNA-seq data clustering based on soft self-paced learning (S3NMF). We will gradually add cells from simple to complex to our model until the model converges. In this way, the influence of noisy data and outliers can be significantly reduced. The proposed method achieves the best performance on both simulation data and real scRNA-seq data.","PeriodicalId":177985,"journal":{"name":"2022 IEEE 2nd International Conference on Information Communication and Software Engineering (ICICSE)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 2nd International Conference on Information Communication and Software Engineering (ICICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icicse55337.2022.9828919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Single-cell RNA sequencing (scRNA-seq) provides transcriptomic profiling for individual cells, allowing researchers to study the heterogeneity of tissues, recognize rare cell identities and discover new cellular subtypes. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, The performance of existing single-cell clustering methods is extremely sensitive to the presence of noise data and outliers. Nevertheless, there is still no consensus on the best performing method. To address this issue, we utilize an advanced NMF for scRNA-seq data clustering based on soft self-paced learning (S3NMF). We will gradually add cells from simple to complex to our model until the model converges. In this way, the influence of noisy data and outliers can be significantly reduced. The proposed method achieves the best performance on both simulation data and real scRNA-seq data.