{"title":"A Comparison of Supervised Classification Methods for Classifying Biological Cell Types","authors":"Yuning Chen, Abhishek Hemlani, Song Zheng","doi":"10.1145/3512452.3512456","DOIUrl":null,"url":null,"abstract":"The advent of single-cell RNA sequencing has enabled researchers to characterize specific cell types and learn more about their functional and pathological roles. Therefore, correctly classifying these cells into cell types is essential. Past research has focused on developing robust methods to classify cell types, hierarchies, and annotations, focusing on generalizability. In this article, different reduction methods and classifiers on a scRNA-seq dataset are evaluated, and the classifiers are k-nearest neighbors (KNN), neural network (NN), AdaBoost, and support vector machines (SVM). Optimizing SVM with cross-validation will also be discussed. Results showed that a Principal Component Analysis-reduced dataset and a Support Vector Machine (SVM) with a linear kernel performed better than others. This article offers potential directions for selecting classifiers and reduction methods to work with biological data and additional analytical insights.","PeriodicalId":120446,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computational Biology and Bioinformatics","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 5th International Conference on Computational Biology and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3512452.3512456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The advent of single-cell RNA sequencing has enabled researchers to characterize specific cell types and learn more about their functional and pathological roles. Therefore, correctly classifying these cells into cell types is essential. Past research has focused on developing robust methods to classify cell types, hierarchies, and annotations, focusing on generalizability. In this article, different reduction methods and classifiers on a scRNA-seq dataset are evaluated, and the classifiers are k-nearest neighbors (KNN), neural network (NN), AdaBoost, and support vector machines (SVM). Optimizing SVM with cross-validation will also be discussed. Results showed that a Principal Component Analysis-reduced dataset and a Support Vector Machine (SVM) with a linear kernel performed better than others. This article offers potential directions for selecting classifiers and reduction methods to work with biological data and additional analytical insights.