Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems最新文献

Improving Reproducibility of Distributed Computational Experiments 提高分布式计算实验的再现性

Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems Pub Date : 2018-06-11 DOI: 10.1145/3214239.3214241

Quan Pham, T. Malik, Dai Hai Ton That, A. Youngdahl

{"title":"Improving Reproducibility of Distributed Computational Experiments","authors":"Quan Pham, T. Malik, Dai Hai Ton That, A. Youngdahl","doi":"10.1145/3214239.3214241","DOIUrl":"https://doi.org/10.1145/3214239.3214241","url":null,"abstract":"Conference and journal publications increasingly require experiments associated with a submitted article to be repeatable. Authors comply to this requirement by sharing all associated digital artifacts, i.e., code, data, and environment configuration scripts. To ease aggregation of the digital artifacts, several tools have recently emerged that automate the aggregation of digital artifacts by auditing an experiment execution and building a portable container of code, data, and environment. However, current tools only package non-distributed computational experiments. Distributed computational experiments must either be packaged manually or supplemented with sufficient documentation. In this paper, we outline the reproducibility requirements of distributed experiments using a distributed computational science experiment involving use of message-passing interface (MPI), and propose a general method for auditing and repeating distributed experiments. Using Sciunit we show how this method can be implemented. We validate our method with initial experiments showing application re-execution runtime can be improved by 63% with a trade-off of longer run-time on initial audit execution.","PeriodicalId":422030,"journal":{"name":"Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132566099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems 第一届计算机系统实用可重复评估国际研讨会论文集

Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems Pub Date : 2018-06-11 DOI: 10.1145/3214239

引用次数: 0

Popper Pitfalls: Experiences Following a Reproducibility Convention 波普尔陷阱:遵循可重复性惯例的经验

Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems Pub Date : 2018-06-11 DOI: 10.1145/3214239.3214243

Michael Sevilla, C. Maltzahn

引用次数: 3

Software Provenance: Track the Reality Not the Virtual Machine 软件来源:跟踪现实而不是虚拟机

Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems Pub Date : 2018-06-11 DOI: 10.1145/3214239.3214244

D. Wilkinson, Luis Oliveira, D. Mossé, B. Childers

引用次数: 5

Supporting Long-term Reproducible Software Execution 支持长期可复制的软件执行

Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems Pub Date : 2018-06-11 DOI: 10.1145/3214239.3214245

Luis Oliveira, D. Wilkinson, D. Mossé, B. Childers

{"title":"Supporting Long-term Reproducible Software Execution","authors":"Luis Oliveira, D. Wilkinson, D. Mossé, B. Childers","doi":"10.1145/3214239.3214245","DOIUrl":"https://doi.org/10.1145/3214239.3214245","url":null,"abstract":"A recent widespread realization that software experiments are not as easily replicated as once believed brought software execution preservation to the science spotlight. As a result, scientists, institutions, and funding agencies have recently been pushing for the development of methodologies and tools that preserve software artifacts. Despite current efforts, long term reproducibility still eludes us. In this paper, we present the requirements for software execution preservation and discuss how to improve long-term reproducibility in science. In particular, we discuss the reasons why preserving binaries and pre-built execution environments is not enough and why preserving the ability to replicate results is not the same as preserving software for reproducible science. Finally, we show how these requirements are supported by Occam, an open curation framework that fully preserves software and its dependencies from source to execution, promoting transparency, longevity, and re-use. Specifically, Occam provides the ability to automatically deploy workflows in a fully-functional environment that is able to not only run them, but make them easily replicable.","PeriodicalId":422030,"journal":{"name":"Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121594954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Enabling the Verification of Computational Results: An Empirical Evaluation of Computational Reproducibility 使计算结果的验证:计算再现性的经验评价

Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems Pub Date : 2018-06-11 DOI: 10.1145/3214239.3214242

V. Stodden, M. Krafczyk, A. Bhaskar

{"title":"Enabling the Verification of Computational Results: An Empirical Evaluation of Computational Reproducibility","authors":"V. Stodden, M. Krafczyk, A. Bhaskar","doi":"10.1145/3214239.3214242","DOIUrl":"https://doi.org/10.1145/3214239.3214242","url":null,"abstract":"The ability to independently regenerate published computational claims is widely recognized as a key component of scientific reproducibility. In this article we take a narrow interpretation of this goal, and attempt to regenerate published claims from author-supplied information, including data, code, inputs, and other provided specifications, on a different computational system than that used by the original authors. We are motivated by Claerbout and Donoho's exhortation of the importance of providing complete information for reproducibility of the published claim. We chose the Elsevier journal, the Journal of Computational Physics, which has stated author guidelines that encourage the availability of computational digital artifacts that support scholarly findings. In an IRB approved study at the University of Illinois at Urbana-Champaign (IRB #17329) we gathered artifacts from a sample of authors who published in this journal in 2016 and 2017. We then used the ICERM criteria generated at the 2012 ICERM workshop \"Reproducibility in Computational and Experimental Mathematics\" to evaluate the sufficiency of the information provided in the publications and the ease with which the digital artifacts afforded computational reproducibility. We find that, for the articles for which we obtained computational artifacts, we could not easily regenerate the findings for 67% of them, and we were unable to easily regenerate all the findings for any of the articles. We then evaluated the artifacts we did obtain (55 of 306 articles) and find that the main barriers to computational reproducibility are inadequate documentation of code, data, and workflow information (70.9%), missing code function and setting information, and missing licensing information (75%). We recommend improvements based on these findings, including the deposit of supporting digital artifacts for reproducibility as a condition of publication, and verification of computational findings via re-execution of the code when possible.","PeriodicalId":422030,"journal":{"name":"Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123427160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 20

MonEx: An Integrated Experiment Monitoring Framework Standing on Off-The-Shelf Components MonEx:基于现成组件的集成实验监控框架

Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems Pub Date : 2018-06-11 DOI: 10.1145/3214239.3214240

Abdulqawi Saif, Alexandre Merlin, L. Nussbaum, Yeqiong Song

{"title":"MonEx: An Integrated Experiment Monitoring Framework Standing on Off-The-Shelf Components","authors":"Abdulqawi Saif, Alexandre Merlin, L. Nussbaum, Yeqiong Song","doi":"10.1145/3214239.3214240","DOIUrl":"https://doi.org/10.1145/3214239.3214240","url":null,"abstract":"Most computer experiments include a phase where metrics are gathered from and about various kinds of resources. This phase is often done via manual, non-reproducible and error-prone steps. Infrastructure monitoring tools facilitate collecting experiments' data to some extent. However, there is no conventional way for doing so, and there is still much work to be done (e.g. capturing user experiments) to leverage the monitoring activity for monitoring experiments. To overcome those challenges, we define the requirements of experiments monitoring, clarifying Experiment Monitoring Frameworks' scope and mainly focusing on reusability of experiments' data, and portability of experiments' metrics. We then propose MonEx EMF that satisfies those requirements. MonEx is built on top of infrastructure monitoring solutions and supports various monitoring approaches. It fully integrates into the experiment workflow by encompassing all steps from data acquisition to producing publishable figures. Hence, MonEx represents a first step towards unifying methods of collecting experiments' data.","PeriodicalId":422030,"journal":{"name":"Proceedings of the First International Workshop on Practical Reproducible Evaluation of Computer Systems","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121592342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1