如何利用单细胞基因表达数据构建调控网络

Aditya Pratapa, A. Jalihal, Jeffrey N. Law, Aditya Bharadwaj, T. Murali
{"title":"如何利用单细胞基因表达数据构建调控网络","authors":"Aditya Pratapa, A. Jalihal, Jeffrey N. Law, Aditya Bharadwaj, T. Murali","doi":"10.1145/3388440.3414213","DOIUrl":null,"url":null,"abstract":"Over a dozen methods have been developed to infer gene regulatory networks (GRNs) from single-cell RNA-seq data. An experimentalist seeking to analyze a new dataset faces a daunting task in selecting an appropriate inference method since there are no widely accepted ground-truth datasets for assessing algorithm accuracy and the criteria for evaluation and comparison of methods are varied. We have developed BEELINE, a comprehensive evaluation of state-of-the-art algorithms for inferring GRNs from single-cell transcriptomic data [1]. BEELINE incorporates 12 diverse algorithms for GRN inference. It provides an easy-to-use and uniform interface to each method in the form of a Docker image. BEELINE implements several measures for estimating and comparing the accuracy, stability, and efficiency of these algorithms. Thus, BEELINE facilitates reproducible, rigorous, and extensible evaluations of GRN inference methods. We selected (a) synthetic networks with predictable cellular trajectories, (b) literature-curated Boolean models, and (c) diverse transcriptional regulatory and functional interaction networks to serve as the ground truth for evaluating the accuracy of GRN inference algorithms. We developed a strategy to simulate single-cell gene expression data from the first two types of networks. We used multiple experimental single-cell RNA-seq datasets in conjunction with the third type of network. Our evaluations suggest that the area under the precision-recall curve and early precision of these algorithms are moderate. Techniques that do not require pseudotime-ordered cells are generally more accurate. Based on these results, we present recommendations to end users of GRN inference methods. Finally, we discuss the potential of supervised algorithms for GRN inference.","PeriodicalId":411338,"journal":{"name":"Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"How to build regulatory networks from single-cell gene expression data\",\"authors\":\"Aditya Pratapa, A. Jalihal, Jeffrey N. Law, Aditya Bharadwaj, T. Murali\",\"doi\":\"10.1145/3388440.3414213\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over a dozen methods have been developed to infer gene regulatory networks (GRNs) from single-cell RNA-seq data. An experimentalist seeking to analyze a new dataset faces a daunting task in selecting an appropriate inference method since there are no widely accepted ground-truth datasets for assessing algorithm accuracy and the criteria for evaluation and comparison of methods are varied. We have developed BEELINE, a comprehensive evaluation of state-of-the-art algorithms for inferring GRNs from single-cell transcriptomic data [1]. BEELINE incorporates 12 diverse algorithms for GRN inference. It provides an easy-to-use and uniform interface to each method in the form of a Docker image. BEELINE implements several measures for estimating and comparing the accuracy, stability, and efficiency of these algorithms. Thus, BEELINE facilitates reproducible, rigorous, and extensible evaluations of GRN inference methods. We selected (a) synthetic networks with predictable cellular trajectories, (b) literature-curated Boolean models, and (c) diverse transcriptional regulatory and functional interaction networks to serve as the ground truth for evaluating the accuracy of GRN inference algorithms. We developed a strategy to simulate single-cell gene expression data from the first two types of networks. We used multiple experimental single-cell RNA-seq datasets in conjunction with the third type of network. Our evaluations suggest that the area under the precision-recall curve and early precision of these algorithms are moderate. Techniques that do not require pseudotime-ordered cells are generally more accurate. Based on these results, we present recommendations to end users of GRN inference methods. Finally, we discuss the potential of supervised algorithms for GRN inference.\",\"PeriodicalId\":411338,\"journal\":{\"name\":\"Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3388440.3414213\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3388440.3414213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

从单细胞RNA-seq数据中推断基因调控网络(grn)的方法已经发展了十几种。由于没有广泛接受的用于评估算法准确性的基础真值数据集,并且评估和比较方法的标准各不相同,因此,寻求分析新数据集的实验者在选择适当的推理方法方面面临着艰巨的任务。我们开发了BEELINE,这是一种从单细胞转录组数据推断grn的最先进算法的综合评估[1]。BEELINE结合了12种不同的GRN推理算法。它以Docker映像的形式为每个方法提供了一个易于使用和统一的接口。BEELINE实现了几种方法来评估和比较这些算法的精度、稳定性和效率。因此,BEELINE有助于对GRN推理方法进行可重复、严格和可扩展的评估。我们选择(a)具有可预测细胞轨迹的合成网络,(b)文献策划的布尔模型,以及(c)多种转录调控和功能相互作用网络作为评估GRN推理算法准确性的基础事实。我们开发了一种策略来模拟来自前两种类型网络的单细胞基因表达数据。我们将多个实验性单细胞RNA-seq数据集与第三种类型的网络结合使用。我们的评估表明,这些算法的精确召回曲线下的面积和早期精度是中等的。不需要伪时间顺序单元格的技术通常更准确。基于这些结果,我们对GRN推理方法的最终用户提出了建议。最后,我们讨论了监督算法在GRN推理中的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
How to build regulatory networks from single-cell gene expression data
Over a dozen methods have been developed to infer gene regulatory networks (GRNs) from single-cell RNA-seq data. An experimentalist seeking to analyze a new dataset faces a daunting task in selecting an appropriate inference method since there are no widely accepted ground-truth datasets for assessing algorithm accuracy and the criteria for evaluation and comparison of methods are varied. We have developed BEELINE, a comprehensive evaluation of state-of-the-art algorithms for inferring GRNs from single-cell transcriptomic data [1]. BEELINE incorporates 12 diverse algorithms for GRN inference. It provides an easy-to-use and uniform interface to each method in the form of a Docker image. BEELINE implements several measures for estimating and comparing the accuracy, stability, and efficiency of these algorithms. Thus, BEELINE facilitates reproducible, rigorous, and extensible evaluations of GRN inference methods. We selected (a) synthetic networks with predictable cellular trajectories, (b) literature-curated Boolean models, and (c) diverse transcriptional regulatory and functional interaction networks to serve as the ground truth for evaluating the accuracy of GRN inference algorithms. We developed a strategy to simulate single-cell gene expression data from the first two types of networks. We used multiple experimental single-cell RNA-seq datasets in conjunction with the third type of network. Our evaluations suggest that the area under the precision-recall curve and early precision of these algorithms are moderate. Techniques that do not require pseudotime-ordered cells are generally more accurate. Based on these results, we present recommendations to end users of GRN inference methods. Finally, we discuss the potential of supervised algorithms for GRN inference.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信