Enhancing single-cell classification accuracy using image conversion and deep learning.

Q3 Medicine
遗传 Pub Date : 2025-03-01 DOI:10.16288/j.yczz.24-213
Bingxi Gao, Huaxuan Wu, Zhiqiang Du
{"title":"Enhancing single-cell classification accuracy using image conversion and deep learning.","authors":"Bingxi Gao, Huaxuan Wu, Zhiqiang Du","doi":"10.16288/j.yczz.24-213","DOIUrl":null,"url":null,"abstract":"<p><p>Single-cell transcriptome sequencing (scRNA-seq) is widely used in the fields of animal and plant developmental biology and important trait analysis by obtaining single-cell transcript abundance data in high throughput, which can deeply reveal cell types, subtype composition, specific gene markers and functional differences. However, scRNA-seq data are often accompanied by problems such as high noise, high dimensionality and batch effect, resulting in a large number of low-expressed genes and variants, which seriously affect the accuracy and reliability of data analysis. This not only increases the complexity of data processing, but also limits the effectiveness of feature selection and downstream analysis. Although several statistical inference and machine learning methods have been used to address these challenges, the existing methods still have limitations in cell type identification, feature selection, and batch effect correction, which are difficult to meet the needs of complex biological research. In this study, we proposes an innovative single-cell classification method, scIC (single-cell image classification), which converts scRNA-seq data into image form and combines it with deep learning techniques for cell classification. Through this image conversion, we are able to capture complex patterns in the data more efficiently, and then construct efficient classification models using convolutional neural networks (CNN) and residual networks (ResNet). After testing scRNA-seq data from four cell types (mouse skin basal cells, mouse lymphocytes, human neuronal cells, and mouse spinal cord cells), the accuracy of the classification models exceeded 94%, with the mouse skin basal cell dataset achieving a classification accuracy of 99.8% when using the ResNet50 model. These results indicate that image transformation of scRNA-seq data and combining it with deep learning techniques can significantly improve the classification accuracy, providing new ideas and effective tools for solving key challenges in single-cell data analysis. The code for this study is publicly available at: https://github.com/Bingxi-Gao/SCImageClassify.</p>","PeriodicalId":35536,"journal":{"name":"遗传","volume":"47 3","pages":"382-392"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"遗传","FirstCategoryId":"1091","ListUrlMain":"https://doi.org/10.16288/j.yczz.24-213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Single-cell transcriptome sequencing (scRNA-seq) is widely used in the fields of animal and plant developmental biology and important trait analysis by obtaining single-cell transcript abundance data in high throughput, which can deeply reveal cell types, subtype composition, specific gene markers and functional differences. However, scRNA-seq data are often accompanied by problems such as high noise, high dimensionality and batch effect, resulting in a large number of low-expressed genes and variants, which seriously affect the accuracy and reliability of data analysis. This not only increases the complexity of data processing, but also limits the effectiveness of feature selection and downstream analysis. Although several statistical inference and machine learning methods have been used to address these challenges, the existing methods still have limitations in cell type identification, feature selection, and batch effect correction, which are difficult to meet the needs of complex biological research. In this study, we proposes an innovative single-cell classification method, scIC (single-cell image classification), which converts scRNA-seq data into image form and combines it with deep learning techniques for cell classification. Through this image conversion, we are able to capture complex patterns in the data more efficiently, and then construct efficient classification models using convolutional neural networks (CNN) and residual networks (ResNet). After testing scRNA-seq data from four cell types (mouse skin basal cells, mouse lymphocytes, human neuronal cells, and mouse spinal cord cells), the accuracy of the classification models exceeded 94%, with the mouse skin basal cell dataset achieving a classification accuracy of 99.8% when using the ResNet50 model. These results indicate that image transformation of scRNA-seq data and combining it with deep learning techniques can significantly improve the classification accuracy, providing new ideas and effective tools for solving key challenges in single-cell data analysis. The code for this study is publicly available at: https://github.com/Bingxi-Gao/SCImageClassify.

利用图像转换和深度学习提高单细胞分类精度。
单细胞转录组测序(scRNA-seq)通过高通量获取单细胞转录丰度数据,可以深入揭示细胞类型、亚型组成、特定基因标记和功能差异,广泛应用于动植物发育生物学和重要性状分析领域。然而,scRNA-seq数据往往伴随着高噪声、高维数和批效应等问题,导致大量低表达基因和变异,严重影响了数据分析的准确性和可靠性。这不仅增加了数据处理的复杂性,而且限制了特征选择和下游分析的有效性。尽管已经使用了几种统计推断和机器学习方法来解决这些挑战,但现有方法在细胞类型识别、特征选择和批效应校正方面仍然存在局限性,难以满足复杂生物学研究的需要。在本研究中,我们提出了一种创新的单细胞分类方法scIC(单细胞图像分类),该方法将scRNA-seq数据转换为图像形式,并将其与深度学习技术相结合进行细胞分类。通过这种图像转换,我们能够更有效地捕获数据中的复杂模式,然后使用卷积神经网络(CNN)和残差网络(ResNet)构建高效的分类模型。在对四种细胞类型(小鼠皮肤基底细胞、小鼠淋巴细胞、人类神经元细胞和小鼠脊髓细胞)的scRNA-seq数据进行测试后,分类模型的准确率超过94%,使用ResNet50模型时,小鼠皮肤基底细胞数据集的分类准确率达到99.8%。这些结果表明,对scRNA-seq数据进行图像变换并与深度学习技术相结合可以显著提高分类精度,为解决单细胞数据分析中的关键挑战提供了新的思路和有效的工具。这项研究的代码可以在https://github.com/Bingxi-Gao/SCImageClassify上公开获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
遗传
遗传 Medicine-Medicine (all)
CiteScore
2.50
自引率
0.00%
发文量
6699
期刊介绍: Hereditas is a national academic journal sponsored by the Institute of Genetics and Developmental Biology of the Chinese Academy of Sciences and the Chinese Society of Genetics and published by Science Press. It is a Chinese core journal and a Chinese high-quality scientific journal. The journal mainly publishes innovative research papers in the fields of genetics, genomics, cell biology, developmental biology, biological evolution, genetic engineering and biotechnology; new technologies and new methods; monographs and reviews on hot issues in the discipline; academic debates and discussions; experience in genetics teaching; introductions to famous geneticists at home and abroad; genetic counseling; information on academic conferences at home and abroad, etc. Main columns: review, frontier focus, research report, technology and method, resources and platform, experimental operation guide, genetic resources, genetics teaching, scientific news, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信