基于EllPACK-R稀疏数据格式的生物信息学快速并行马尔可夫聚类GPU实现

A. Bustamam, K. Burrage, N. Hamilton
{"title":"基于EllPACK-R稀疏数据格式的生物信息学快速并行马尔可夫聚类GPU实现","authors":"A. Bustamam, K. Burrage, N. Hamilton","doi":"10.1109/ACT.2010.10","DOIUrl":null,"url":null,"abstract":"The massively parallel computing using graphical processing unit (GPU), which based on tens of thousands of parallel threats within hundreds of GPU’s streaming processors, has gained broad popularity and attracted researchers in a wide range of application areas from finance, computer aided engineering, computational fluid dynamics, game physics, numerics, science, medical imaging, life science, and so on, including molecular biology and bioinformatics. Meanwhile, Markov clustering algorithm (MCl) has become one of the most effective and highly cited methods to detect and analyze the communities/clusters within an interaction network dataset on many real world problems such us social, technological, or biological networks including protein-protein interaction networks. However, as the dataset become bigger and bigger, the computation time of MCl algorithm become slower and slower. Hence, GPU computing is an interesting and challenging alternative to attempt to improve the MCl performance. In this poster paper we introduce our improvement of MCl performance based on EllPACK-R sparse dataset format using GPU computing with the Compute Unified Device Architecture tool (CUDA) from NVIDIA (called CUDA-MCl). As the results show the significant improvement in CUDA-MCl performance and with the low-cost and widely available GPU devices in the market today, this CUDA-MCl implementation is allowing large-scale parallel computation on off-the-shelf desktop machines. Moreover the GPU computing approaches potentially may contribute to significantly change the way bioinformaticians and biologists compute and interact with their data.","PeriodicalId":147311,"journal":{"name":"2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"A GPU Implementation of Fast Parallel Markov Clustering in Bioinformatics Using EllPACK-R Sparse Data Format\",\"authors\":\"A. Bustamam, K. Burrage, N. Hamilton\",\"doi\":\"10.1109/ACT.2010.10\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The massively parallel computing using graphical processing unit (GPU), which based on tens of thousands of parallel threats within hundreds of GPU’s streaming processors, has gained broad popularity and attracted researchers in a wide range of application areas from finance, computer aided engineering, computational fluid dynamics, game physics, numerics, science, medical imaging, life science, and so on, including molecular biology and bioinformatics. Meanwhile, Markov clustering algorithm (MCl) has become one of the most effective and highly cited methods to detect and analyze the communities/clusters within an interaction network dataset on many real world problems such us social, technological, or biological networks including protein-protein interaction networks. However, as the dataset become bigger and bigger, the computation time of MCl algorithm become slower and slower. Hence, GPU computing is an interesting and challenging alternative to attempt to improve the MCl performance. In this poster paper we introduce our improvement of MCl performance based on EllPACK-R sparse dataset format using GPU computing with the Compute Unified Device Architecture tool (CUDA) from NVIDIA (called CUDA-MCl). As the results show the significant improvement in CUDA-MCl performance and with the low-cost and widely available GPU devices in the market today, this CUDA-MCl implementation is allowing large-scale parallel computation on off-the-shelf desktop machines. Moreover the GPU computing approaches potentially may contribute to significantly change the way bioinformaticians and biologists compute and interact with their data.\",\"PeriodicalId\":147311,\"journal\":{\"name\":\"2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACT.2010.10\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACT.2010.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

利用图形处理单元(GPU)的大规模并行计算,基于数百个GPU流处理器中的数万个并行威胁,得到了广泛的普及,并吸引了包括分子生物学和生物信息学在内的金融、计算机辅助工程、计算流体动力学、游戏物理、数字、科学、医学成像、生命科学等广泛应用领域的研究人员。同时,马尔可夫聚类算法(MCl)已经成为检测和分析交互网络数据集中的社区/聚类的最有效和被引用最多的方法之一,用于许多现实世界的问题,如社会、技术或生物网络(包括蛋白质-蛋白质相互作用网络)。然而,随着数据集越来越大,MCl算法的计算速度越来越慢。因此,GPU计算是尝试提高MCl性能的有趣且具有挑战性的替代方案。在这篇海报论文中,我们介绍了基于EllPACK-R稀疏数据集格式的MCl性能改进,使用GPU计算和NVIDIA的计算统一设备架构工具(CUDA)(称为CUDA-MCl)。由于结果显示CUDA-MCl性能的显着改善,以及当今市场上低成本和广泛可用的GPU设备,这种CUDA-MCl实现允许在现成的台式计算机上进行大规模并行计算。此外,GPU计算方法可能会显著改变生物信息学家和生物学家计算数据和与数据交互的方式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A GPU Implementation of Fast Parallel Markov Clustering in Bioinformatics Using EllPACK-R Sparse Data Format
The massively parallel computing using graphical processing unit (GPU), which based on tens of thousands of parallel threats within hundreds of GPU’s streaming processors, has gained broad popularity and attracted researchers in a wide range of application areas from finance, computer aided engineering, computational fluid dynamics, game physics, numerics, science, medical imaging, life science, and so on, including molecular biology and bioinformatics. Meanwhile, Markov clustering algorithm (MCl) has become one of the most effective and highly cited methods to detect and analyze the communities/clusters within an interaction network dataset on many real world problems such us social, technological, or biological networks including protein-protein interaction networks. However, as the dataset become bigger and bigger, the computation time of MCl algorithm become slower and slower. Hence, GPU computing is an interesting and challenging alternative to attempt to improve the MCl performance. In this poster paper we introduce our improvement of MCl performance based on EllPACK-R sparse dataset format using GPU computing with the Compute Unified Device Architecture tool (CUDA) from NVIDIA (called CUDA-MCl). As the results show the significant improvement in CUDA-MCl performance and with the low-cost and widely available GPU devices in the market today, this CUDA-MCl implementation is allowing large-scale parallel computation on off-the-shelf desktop machines. Moreover the GPU computing approaches potentially may contribute to significantly change the way bioinformaticians and biologists compute and interact with their data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信