Bang Tran, Quyen Nguyen, Sangam Shrestha, Tin Nguyen
{"title":"scIDS: Single-cell Imputation by combining Deep autoencoder neural networks and Subspace regression","authors":"Bang Tran, Quyen Nguyen, Sangam Shrestha, Tin Nguyen","doi":"10.1109/KSE53942.2021.9648664","DOIUrl":null,"url":null,"abstract":"Single-cell RNA-sequencing (scRNA-seq) has emerged as a powerful high throughput technique that enables the characterization of transcriptomic profiles at single-cell resolution. However, scRNA-seq data has an excess number of zeros in expressed genes due to a low amount of sequenced mRNA in each cell. This missing detection in a portion of mRNA molecules (dropout) presents a fundamental challenge for various types of data analyses. Here we introduce scIDS, a novel imputation method that is a combination of deep autoencoder neural networks and subspace regression to reliably recover the missing values in scRNA-seq data. We compare scIDS with two widely used methods using eight datasets. Extensive experiments demonstrate that scIDS outperforms existing approaches in improving the identification of cell populations while preserving the biological landscape.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Single-cell RNA-sequencing (scRNA-seq) has emerged as a powerful high throughput technique that enables the characterization of transcriptomic profiles at single-cell resolution. However, scRNA-seq data has an excess number of zeros in expressed genes due to a low amount of sequenced mRNA in each cell. This missing detection in a portion of mRNA molecules (dropout) presents a fundamental challenge for various types of data analyses. Here we introduce scIDS, a novel imputation method that is a combination of deep autoencoder neural networks and subspace regression to reliably recover the missing values in scRNA-seq data. We compare scIDS with two widely used methods using eight datasets. Extensive experiments demonstrate that scIDS outperforms existing approaches in improving the identification of cell populations while preserving the biological landscape.