Eashwar V Somasundaram, Shael E Brown, Adam Litzler, Jacob G Scott, Raoul R Wadhwa
{"title":"Benchmarking R packages for Calculation of Persistent Homology.","authors":"Eashwar V Somasundaram, Shael E Brown, Adam Litzler, Jacob G Scott, Raoul R Wadhwa","doi":"10.32614/RJ-2021-033","DOIUrl":null,"url":null,"abstract":"<p><p>Several persistent homology software libraries have been implemented in R. Specifically, the Dionysus, GUDHI, and Ripser libraries have been wrapped by the <b>TDA</b> and <b>TDAstats</b> CRAN packages. These software represent powerful analysis tools that are computationally expensive and, to our knowledge, have not been formally benchmarked. Here, we analyze runtime and memory growth for the 2 R packages and the 3 underlying libraries. We find that datasets with less than 3 dimensions can be evaluated with persistent homology fastest by the GUDHI library in the <b>TDA</b> package. For higher-dimensional datasets, the Ripser library in the TDAstats package is the fastest. Ripser and <b>TDAstats</b> are also the most memory-efficient tools to calculate persistent homology.</p>","PeriodicalId":51285,"journal":{"name":"R Journal","volume":"13 1","pages":"184-193"},"PeriodicalIF":2.3000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434812/pdf/nihms-1733366.pdf","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"R Journal","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.32614/RJ-2021-033","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/6/7 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 7
Abstract
Several persistent homology software libraries have been implemented in R. Specifically, the Dionysus, GUDHI, and Ripser libraries have been wrapped by the TDA and TDAstats CRAN packages. These software represent powerful analysis tools that are computationally expensive and, to our knowledge, have not been formally benchmarked. Here, we analyze runtime and memory growth for the 2 R packages and the 3 underlying libraries. We find that datasets with less than 3 dimensions can be evaluated with persistent homology fastest by the GUDHI library in the TDA package. For higher-dimensional datasets, the Ripser library in the TDAstats package is the fastest. Ripser and TDAstats are also the most memory-efficient tools to calculate persistent homology.
R JournalCOMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-STATISTICS & PROBABILITY
CiteScore
2.70
自引率
0.00%
发文量
40
审稿时长
>12 weeks
期刊介绍:
The R Journal is the open access, refereed journal of the R project for statistical computing. It features short to medium length articles covering topics that should be of interest to users or developers of R.
The R Journal intends to reach a wide audience and have a thorough review process. Papers are expected to be reasonably short, clearly written, not too technical, and of course focused on R. Authors of refereed articles should take care to:
- put their contribution in context, in particular discuss related R functions or packages;
- explain the motivation for their contribution;
- provide code examples that are reproducible.