基因组研究中的并行计算:进展与应用。

Q2 Biochemistry, Genetics and Molecular Biology
Advances and Applications in Bioinformatics and Chemistry Pub Date : 2015-11-13 eCollection Date: 2015-01-01 DOI:10.2147/AABC.S64482
Kary Ocaña, Daniel de Oliveira
{"title":"基因组研究中的并行计算:进展与应用。","authors":"Kary Ocaña, Daniel de Oliveira","doi":"10.2147/AABC.S64482","DOIUrl":null,"url":null,"abstract":"<p><p>Today's genomic experiments have to process the so-called \"biological big data\" that is now reaching the size of Terabytes and Petabytes. To process this huge amount of data, scientists may require weeks or months if they use their own workstations. Parallelism techniques and high-performance computing (HPC) environments can be applied for reducing the total processing time and to ease the management, treatment, and analyses of this data. However, running bioinformatics experiments in HPC environments such as clouds, grids, clusters, and graphics processing unit requires the expertise from scientists to integrate computational, biological, and mathematical techniques and technologies. Several solutions have already been proposed to allow scientists for processing their genomic experiments using HPC capabilities and parallelism techniques. This article brings a systematic review of literature that surveys the most recently published research involving genomics and parallel computing. Our objective is to gather the main characteristics, benefits, and challenges that can be considered by scientists when running their genomic experiments to benefit from parallelism techniques and HPC capabilities. </p>","PeriodicalId":53584,"journal":{"name":"Advances and Applications in Bioinformatics and Chemistry","volume":"8 ","pages":"23-35"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4655901/pdf/","citationCount":"0","resultStr":"{\"title\":\"Parallel computing in genomic research: advances and applications.\",\"authors\":\"Kary Ocaña, Daniel de Oliveira\",\"doi\":\"10.2147/AABC.S64482\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Today's genomic experiments have to process the so-called \\\"biological big data\\\" that is now reaching the size of Terabytes and Petabytes. To process this huge amount of data, scientists may require weeks or months if they use their own workstations. Parallelism techniques and high-performance computing (HPC) environments can be applied for reducing the total processing time and to ease the management, treatment, and analyses of this data. However, running bioinformatics experiments in HPC environments such as clouds, grids, clusters, and graphics processing unit requires the expertise from scientists to integrate computational, biological, and mathematical techniques and technologies. Several solutions have already been proposed to allow scientists for processing their genomic experiments using HPC capabilities and parallelism techniques. This article brings a systematic review of literature that surveys the most recently published research involving genomics and parallel computing. Our objective is to gather the main characteristics, benefits, and challenges that can be considered by scientists when running their genomic experiments to benefit from parallelism techniques and HPC capabilities. </p>\",\"PeriodicalId\":53584,\"journal\":{\"name\":\"Advances and Applications in Bioinformatics and Chemistry\",\"volume\":\"8 \",\"pages\":\"23-35\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4655901/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances and Applications in Bioinformatics and Chemistry\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2147/AABC.S64482\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2015/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances and Applications in Bioinformatics and Chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2147/AABC.S64482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2015/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0

摘要

当今的基因组学实验必须处理所谓的 "生物大数据",这些数据的大小现已达到 Terabytes 和 Petabytes。要处理这些海量数据,如果科学家使用自己的工作站,可能需要数周或数月的时间。并行化技术和高性能计算(HPC)环境可以缩短总处理时间,简化数据的管理、处理和分析。然而,在云计算、网格、集群和图形处理单元等高性能计算环境中运行生物信息学实验,需要科学家具备将计算、生物和数学技术与科技相结合的专业知识。目前已经提出了几种解决方案,允许科学家利用高性能计算能力和并行技术处理基因组实验。本文系统回顾了最近发表的涉及基因组学和并行计算的研究文献。我们的目的是收集科学家在运行基因组实验时可以考虑的主要特点、优势和挑战,以便从并行技术和高性能计算能力中获益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Parallel computing in genomic research: advances and applications.

Today's genomic experiments have to process the so-called "biological big data" that is now reaching the size of Terabytes and Petabytes. To process this huge amount of data, scientists may require weeks or months if they use their own workstations. Parallelism techniques and high-performance computing (HPC) environments can be applied for reducing the total processing time and to ease the management, treatment, and analyses of this data. However, running bioinformatics experiments in HPC environments such as clouds, grids, clusters, and graphics processing unit requires the expertise from scientists to integrate computational, biological, and mathematical techniques and technologies. Several solutions have already been proposed to allow scientists for processing their genomic experiments using HPC capabilities and parallelism techniques. This article brings a systematic review of literature that surveys the most recently published research involving genomics and parallel computing. Our objective is to gather the main characteristics, benefits, and challenges that can be considered by scientists when running their genomic experiments to benefit from parallelism techniques and HPC capabilities.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advances and Applications in Bioinformatics and Chemistry
Advances and Applications in Bioinformatics and Chemistry Biochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (miscellaneous)
CiteScore
6.50
自引率
0.00%
发文量
7
审稿时长
16 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信