Sergio Santander-Jiménez, Miguel A. Vega-Rodríguez
{"title":"High-level multi-platform approaches for scoring phylogenies on CPU and GPU devices","authors":"Sergio Santander-Jiménez, Miguel A. Vega-Rodríguez","doi":"10.1016/j.future.2025.108088","DOIUrl":null,"url":null,"abstract":"<div><div>Research on parallel computing has promoted significant advances in the solution of different real-world problems. One of the main application domains that have benefited from the exploitation of parallelism is bioinformatics. However, bioinformatics tools are often constrained by the use of low-level platform-specific programming models. This issue limits the spectrum of hardware that can be used to accelerate time-consuming biological workloads, while also affecting programming productivity and code maintenance. To address such limitations, this work investigates high-level multi-platform approaches to parallelize, as a case study, an important task in evolutionary bioinformatics: the evaluation of phylogenetic quality. Particularly, we define parallel designs of the parsimony scoring function for CPU and GPU devices, using three high-level application programming interfaces with multi-platform support: OpenMP, OpenACC, and SYCL. Different optimizations and algorithmic strategies are defined to boost execution according to the characteristics of the tackled parallel tasks. An in-depth experimental evaluation on real-world data reveals the main strengths and areas of improvement for each high-level multi-platform approach, in comparison to lower-level platform-specific alternatives. In addition, the study of single-source performance portability suggests that CPU designs behave better than GPU ones, yet device-oriented optimizations are still needed to pursue a precise exploitation of computational resources.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"175 ","pages":"Article 108088"},"PeriodicalIF":6.2000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25003826","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Research on parallel computing has promoted significant advances in the solution of different real-world problems. One of the main application domains that have benefited from the exploitation of parallelism is bioinformatics. However, bioinformatics tools are often constrained by the use of low-level platform-specific programming models. This issue limits the spectrum of hardware that can be used to accelerate time-consuming biological workloads, while also affecting programming productivity and code maintenance. To address such limitations, this work investigates high-level multi-platform approaches to parallelize, as a case study, an important task in evolutionary bioinformatics: the evaluation of phylogenetic quality. Particularly, we define parallel designs of the parsimony scoring function for CPU and GPU devices, using three high-level application programming interfaces with multi-platform support: OpenMP, OpenACC, and SYCL. Different optimizations and algorithmic strategies are defined to boost execution according to the characteristics of the tackled parallel tasks. An in-depth experimental evaluation on real-world data reveals the main strengths and areas of improvement for each high-level multi-platform approach, in comparison to lower-level platform-specific alternatives. In addition, the study of single-source performance portability suggests that CPU designs behave better than GPU ones, yet device-oriented optimizations are still needed to pursue a precise exploitation of computational resources.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.