Comparison of different machine learning algorithms on Cell Classification with scRNA-seq after Principal Component Analysis

2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP) Pub Date : 2022-04-15 DOI:10.1109/ICSP54964.2022.9778439

Jingkai Guo, Jing Gao

引用次数: 0

Abstract

This project did the process of the Single-cell RNA sequencing data (scRNA-seq) to predict the cell type. Researchers iterated the currently commonly used machine learning algorithm to form predict training models from an extensive dataset. To begin with, researchers executed the principal component analysis (PCA) to reduce the dataset sample dimension. Furthermore, four other different algorithms were constructed in this classification process in each iteration: logistic regression (LR), k nearest neighbor (kNN), supporting vector machine (SVM). In addition, this work applied boosting methods to the decision tree algorithm. Finally, the best approach for listing testing models above is the PCA for dimensional reduction and logistic regression as the classifier. The accuracy is 54.4% for testing data.

查看原文本刊更多论文

主成分分析后不同机器学习算法与scRNA-seq细胞分类的比较

本项目对单细胞RNA测序数据(scRNA-seq)进行处理，预测细胞类型。研究人员迭代了目前常用的机器学习算法，从广泛的数据集中形成预测训练模型。首先，研究人员执行主成分分析(PCA)来降低数据集样本维度。此外，在每次迭代中，在该分类过程中构建了其他四种不同的算法:逻辑回归(LR)， k近邻(kNN)，支持向量机(SVM)。此外，本文还将增强方法应用于决策树算法。最后，列出上述测试模型的最佳方法是PCA降维和逻辑回归作为分类器。测试数据的准确率为54.4%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP)

自引率

0.00%

发文量