{"title":"Computational Benchmark Study in Spatio-Temporal Statistics With a Hands-On Guide to Optimise R","authors":"Lorenzo Tedesco, Jacopo Rodeschini, Philipp Otto","doi":"10.1002/env.70017","DOIUrl":null,"url":null,"abstract":"<p>This study provides a comprehensive evaluation of the computational performance of <span>R</span>, <span>MATLAB</span>, <span>Python</span>, and <span>Julia</span> for spatial and spatio-temporal modelling, focusing on high-dimensional datasets typical in geospatial statistical analysis. We benchmark each language across key tasks, including matrix manipulations and transformations, iterative programming routines, and Input/Output processes, all of which are critical in environmetrics. The results demonstrate that <span>MATLAB</span> excels in matrix-based computations, while <span>Julia</span> consistently delivers competitive performance across a wide range of tasks, establishing itself as a robust, open-source alternative. <span>Python</span>, when combined with libraries like <span>NumPy</span>, shows strength in specific numerical operations, offering versatility for general-purpose programming. <span>R</span>, despite its slower default performance in raw computations, proves to be highly adaptable; by linking to optimized libraries like <span>OpenBLAS</span> or <span>MKL</span> and integrating <span>C++</span> with packages like <span>Rcpp</span>, <span>R</span> achieves significant performance gains, becoming competitive with the other languages. This study also provides practical guidance for researchers to optimize <span>R</span> for geospatial data processing, offering insights to support the selection of the most suitable language for specific modelling requirements.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 5","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.70017","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmetrics","FirstCategoryId":"93","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/env.70017","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study provides a comprehensive evaluation of the computational performance of R, MATLAB, Python, and Julia for spatial and spatio-temporal modelling, focusing on high-dimensional datasets typical in geospatial statistical analysis. We benchmark each language across key tasks, including matrix manipulations and transformations, iterative programming routines, and Input/Output processes, all of which are critical in environmetrics. The results demonstrate that MATLAB excels in matrix-based computations, while Julia consistently delivers competitive performance across a wide range of tasks, establishing itself as a robust, open-source alternative. Python, when combined with libraries like NumPy, shows strength in specific numerical operations, offering versatility for general-purpose programming. R, despite its slower default performance in raw computations, proves to be highly adaptable; by linking to optimized libraries like OpenBLAS or MKL and integrating C++ with packages like Rcpp, R achieves significant performance gains, becoming competitive with the other languages. This study also provides practical guidance for researchers to optimize R for geospatial data processing, offering insights to support the selection of the most suitable language for specific modelling requirements.
期刊介绍:
Environmetrics, the official journal of The International Environmetrics Society (TIES), an Association of the International Statistical Institute, is devoted to the dissemination of high-quality quantitative research in the environmental sciences.
The journal welcomes pertinent and innovative submissions from quantitative disciplines developing new statistical and mathematical techniques, methods, and theories that solve modern environmental problems. Articles must proffer substantive, new statistical or mathematical advances to answer important scientific questions in the environmental sciences, or must develop novel or enhanced statistical methodology with clear applications to environmental science. New methods should be illustrated with recent environmental data.