A systematic overview of single-cell transcriptomics databases, their use cases, and limitations.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics Pub Date : 2024-07-08 eCollection Date: 2024-01-01 DOI:10.3389/fbinf.2024.1417428

Mahnoor N Gondal, Saad Ur Rehman Shah, Arul M Chinnaiyan, Marcin Cieslik

{"title":"A systematic overview of single-cell transcriptomics databases, their use cases, and limitations.","authors":"Mahnoor N Gondal, Saad Ur Rehman Shah, Arul M Chinnaiyan, Marcin Cieslik","doi":"10.3389/fbinf.2024.1417428","DOIUrl":null,"url":null,"abstract":"<p><p>Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq) technologies and experimental protocols have led to the generation of vast amounts of transcriptomic data that populates several online databases and repositories. Here, we systematically examined large-scale scRNA-seq databases, categorizing them based on their scope and purpose such as general, tissue-specific databases, disease-specific databases, cancer-focused databases, and cell type-focused databases. Next, we discuss the technical and methodological challenges associated with curating large-scale scRNA-seq databases, along with current computational solutions. We argue that understanding scRNA-seq databases, including their limitations and assumptions, is crucial for effectively utilizing this data to make robust discoveries and identify novel biological insights. Such platforms can help bridge the gap between computational and wet lab scientists through user-friendly web-based interfaces needed for democratizing access to single-cell data. These platforms would facilitate interdisciplinary research, enabling researchers from various disciplines to collaborate effectively. This review underscores the importance of leveraging computational approaches to unravel the complexities of single-cell data and offers a promising direction for future research in the field.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1417428"},"PeriodicalIF":2.8000,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11260681/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2024.1417428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq) technologies and experimental protocols have led to the generation of vast amounts of transcriptomic data that populates several online databases and repositories. Here, we systematically examined large-scale scRNA-seq databases, categorizing them based on their scope and purpose such as general, tissue-specific databases, disease-specific databases, cancer-focused databases, and cell type-focused databases. Next, we discuss the technical and methodological challenges associated with curating large-scale scRNA-seq databases, along with current computational solutions. We argue that understanding scRNA-seq databases, including their limitations and assumptions, is crucial for effectively utilizing this data to make robust discoveries and identify novel biological insights. Such platforms can help bridge the gap between computational and wet lab scientists through user-friendly web-based interfaces needed for democratizing access to single-cell data. These platforms would facilitate interdisciplinary research, enabling researchers from various disciplines to collaborate effectively. This review underscores the importance of leveraging computational approaches to unravel the complexities of single-cell data and offers a promising direction for future research in the field.

查看原文本刊更多论文

系统概述单细胞转录组学数据库、其用例和局限性。

高通量单细胞RNA-seq（scRNA-seq）技术和实验方案的快速发展产生了大量转录组数据，这些数据充斥着多个在线数据库和资料库。在此，我们系统地研究了大规模 scRNA-seq 数据库，并根据其范围和目的进行了分类，如通用数据库、组织特异性数据库、疾病特异性数据库、癌症数据库和细胞类型数据库。接下来，我们讨论了与整理大规模 scRNA-seq 数据库相关的技术和方法挑战，以及当前的计算解决方案。我们认为，了解 scRNA-seq 数据库，包括其局限性和假设，对于有效利用这些数据进行有力的发现和确定新的生物学见解至关重要。这些平台可以通过用户友好的网络界面，帮助弥合计算科学家和湿实验室科学家之间的差距，从而实现单细胞数据访问的民主化。这些平台将促进跨学科研究，使来自不同学科的研究人员能够有效合作。这篇综述强调了利用计算方法揭示单细胞数据复杂性的重要性，并为该领域未来的研究指明了方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Frontiers in bioinformatics

CiteScore

2.60

自引率

0.00%

发文量