Reza Rafati Bonab, Ali Akbar Jamali, Kyle Klenk, Mohammad Mahdi Moayeri, Raymond J Spiteri
{"title":"SW-actors: accelerating the Smith-Waterman algorithm via actors.","authors":"Reza Rafati Bonab, Ali Akbar Jamali, Kyle Klenk, Mohammad Mahdi Moayeri, Raymond J Spiteri","doi":"10.1093/bioadv/vbaf173","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>The Smith-Waterman (SW) algorithm is widely regarded as the gold standard for local sequence alignment. However, its time complexity in a serial implementation limits its practicality for large datasets. In this article, we introduce SW-actors, a parallel implementation of the SW algorithm that leverages the actor model of concurrent computation to optimize resource utilization by efficiently scheduling and managing independent alignment tasks across processors at both the interalignment and intraalignment levels.</p><p><strong>Results: </strong>SW-actors is compared with the state-of-the-art implementations Parasail, SeqAn, and SWIPE using four datasets of varying sequence lengths ranging from 85 to 74778 nucleotides. In terms of wall-clock time, SW-actors is 1.33 <math><mo>×</mo></math> , 2.00 <math><mo>×</mo></math> , 2.49 <math><mo>×</mo></math> , and 1.94 <math><mo>×</mo></math> faster than the next best implementation for the different datasets. SW-actors is up to 22 <math><mo>×</mo></math> faster than serial on 40 cores. The speedup is consistent for larger datasets and hence offers significant advantages for medium- to large-scale alignments.</p><p><strong>Availability and implementation: </strong>The SW-actors source code and underlying data are available at https://git.cs.usask.ca/numerical_simulations_lab/actors/papers/sw-actors.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf173"},"PeriodicalIF":2.8000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12449131/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: The Smith-Waterman (SW) algorithm is widely regarded as the gold standard for local sequence alignment. However, its time complexity in a serial implementation limits its practicality for large datasets. In this article, we introduce SW-actors, a parallel implementation of the SW algorithm that leverages the actor model of concurrent computation to optimize resource utilization by efficiently scheduling and managing independent alignment tasks across processors at both the interalignment and intraalignment levels.
Results: SW-actors is compared with the state-of-the-art implementations Parasail, SeqAn, and SWIPE using four datasets of varying sequence lengths ranging from 85 to 74778 nucleotides. In terms of wall-clock time, SW-actors is 1.33 , 2.00 , 2.49 , and 1.94 faster than the next best implementation for the different datasets. SW-actors is up to 22 faster than serial on 40 cores. The speedup is consistent for larger datasets and hence offers significant advantages for medium- to large-scale alignments.
Availability and implementation: The SW-actors source code and underlying data are available at https://git.cs.usask.ca/numerical_simulations_lab/actors/papers/sw-actors.