Mahnoor N. Gondal, Saad Ur Rehman Shah, Arul M. Chinnaiyan, Marcin Cieslik
{"title":"单细胞转录组学数据库、用例和局限性系统概述","authors":"Mahnoor N. Gondal, Saad Ur Rehman Shah, Arul M. Chinnaiyan, Marcin Cieslik","doi":"arxiv-2404.10545","DOIUrl":null,"url":null,"abstract":"Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq)\ntechnologies and experimental protocols have led to the generation of vast\namounts of genomic data that populates several online databases and\nrepositories. Here, we systematically examined large-scale scRNA-seq databases,\ncategorizing them based on their scope and purpose such as general,\ntissue-specific databases, disease-specific databases, cancer-focused\ndatabases, and cell type-focused databases. Next, we discuss the technical and\nmethodological challenges associated with curating large-scale scRNA-seq\ndatabases, along with current computational solutions. We argue that\nunderstanding scRNA-seq databases, including their limitations and assumptions,\nis crucial for effectively utilizing this data to make robust discoveries and\nidentify novel biological insights. Furthermore, we propose that bridging the\ngap between computational and wet lab scientists through user-friendly\nweb-based platforms is needed for democratizing access to single-cell data.\nThese platforms would facilitate interdisciplinary research, enabling\nresearchers from various disciplines to collaborate effectively. This review\nunderscores the importance of leveraging computational approaches to unravel\nthe complexities of single-cell data and offers a promising direction for\nfuture research in the field.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"230 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Systematic Overview of Single-Cell Transcriptomics Databases, their Use cases, and Limitations\",\"authors\":\"Mahnoor N. Gondal, Saad Ur Rehman Shah, Arul M. Chinnaiyan, Marcin Cieslik\",\"doi\":\"arxiv-2404.10545\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq)\\ntechnologies and experimental protocols have led to the generation of vast\\namounts of genomic data that populates several online databases and\\nrepositories. Here, we systematically examined large-scale scRNA-seq databases,\\ncategorizing them based on their scope and purpose such as general,\\ntissue-specific databases, disease-specific databases, cancer-focused\\ndatabases, and cell type-focused databases. Next, we discuss the technical and\\nmethodological challenges associated with curating large-scale scRNA-seq\\ndatabases, along with current computational solutions. We argue that\\nunderstanding scRNA-seq databases, including their limitations and assumptions,\\nis crucial for effectively utilizing this data to make robust discoveries and\\nidentify novel biological insights. Furthermore, we propose that bridging the\\ngap between computational and wet lab scientists through user-friendly\\nweb-based platforms is needed for democratizing access to single-cell data.\\nThese platforms would facilitate interdisciplinary research, enabling\\nresearchers from various disciplines to collaborate effectively. This review\\nunderscores the importance of leveraging computational approaches to unravel\\nthe complexities of single-cell data and offers a promising direction for\\nfuture research in the field.\",\"PeriodicalId\":501070,\"journal\":{\"name\":\"arXiv - QuanBio - Genomics\",\"volume\":\"230 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Genomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2404.10545\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2404.10545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Systematic Overview of Single-Cell Transcriptomics Databases, their Use cases, and Limitations
Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq)
technologies and experimental protocols have led to the generation of vast
amounts of genomic data that populates several online databases and
repositories. Here, we systematically examined large-scale scRNA-seq databases,
categorizing them based on their scope and purpose such as general,
tissue-specific databases, disease-specific databases, cancer-focused
databases, and cell type-focused databases. Next, we discuss the technical and
methodological challenges associated with curating large-scale scRNA-seq
databases, along with current computational solutions. We argue that
understanding scRNA-seq databases, including their limitations and assumptions,
is crucial for effectively utilizing this data to make robust discoveries and
identify novel biological insights. Furthermore, we propose that bridging the
gap between computational and wet lab scientists through user-friendly
web-based platforms is needed for democratizing access to single-cell data.
These platforms would facilitate interdisciplinary research, enabling
researchers from various disciplines to collaborate effectively. This review
underscores the importance of leveraging computational approaches to unravel
the complexities of single-cell data and offers a promising direction for
future research in the field.