GitHub is an effective platform for collaborative and reproducible laboratory research

Katharine Y. Chen, Maria Toro-Moreno, Arvind Rasi Subramaniam
{"title":"GitHub is an effective platform for collaborative and reproducible laboratory research","authors":"Katharine Y. Chen, Maria Toro-Moreno, Arvind Rasi Subramaniam","doi":"arxiv-2408.09344","DOIUrl":null,"url":null,"abstract":"Laboratory research is a complex, collaborative process that involves several\nstages, including hypothesis formulation, experimental design, data generation\nand analysis, and manuscript writing. Although reproducibility and data sharing\nare increasingly prioritized at the publication stage, integrating these\nprinciples at earlier stages of laboratory research has been hampered by the\nlack of broadly applicable solutions. Here, we propose that the workflow used\nin modern software development offers a robust framework for enhancing\nreproducibility and collaboration in laboratory research. In particular, we\nshow that GitHub, a platform widely used for collaborative software projects,\ncan be effectively adapted to organize and document all aspects of a research\nproject's lifecycle in a molecular biology laboratory. We outline a three-step\napproach for incorporating the GitHub ecosystem into laboratory research\nworkflows: 1. designing and organizing experiments using issues and project\nboards, 2. documenting experiments and data analyses with a version control\nsystem, and 3. ensuring reproducible software environments for data analyses\nand writing tasks with containerized packages. The versatility, scalability,\nand affordability of this approach make it suitable for various scenarios,\nranging from small research groups to large, cross-institutional\ncollaborations. Adopting this framework from a project's outset can increase\nthe efficiency and fidelity of knowledge transfer within and across research\nlaboratories. An example GitHub repository based on the above approach is\navailable at https://github.com/rasilab/github_demo.","PeriodicalId":501219,"journal":{"name":"arXiv - QuanBio - Other Quantitative Biology","volume":"36 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Other Quantitative Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.09344","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Laboratory research is a complex, collaborative process that involves several stages, including hypothesis formulation, experimental design, data generation and analysis, and manuscript writing. Although reproducibility and data sharing are increasingly prioritized at the publication stage, integrating these principles at earlier stages of laboratory research has been hampered by the lack of broadly applicable solutions. Here, we propose that the workflow used in modern software development offers a robust framework for enhancing reproducibility and collaboration in laboratory research. In particular, we show that GitHub, a platform widely used for collaborative software projects, can be effectively adapted to organize and document all aspects of a research project's lifecycle in a molecular biology laboratory. We outline a three-step approach for incorporating the GitHub ecosystem into laboratory research workflows: 1. designing and organizing experiments using issues and project boards, 2. documenting experiments and data analyses with a version control system, and 3. ensuring reproducible software environments for data analyses and writing tasks with containerized packages. The versatility, scalability, and affordability of this approach make it suitable for various scenarios, ranging from small research groups to large, cross-institutional collaborations. Adopting this framework from a project's outset can increase the efficiency and fidelity of knowledge transfer within and across research laboratories. An example GitHub repository based on the above approach is available at https://github.com/rasilab/github_demo.
GitHub 是实验室合作研究和可重现性研究的有效平台
实验室研究是一个复杂的协作过程,涉及多个阶段,包括假设提出、实验设计、数据生成和分析以及手稿撰写。尽管可重复性和数据共享在发表阶段越来越受到重视,但由于缺乏广泛适用的解决方案,在实验室研究的早期阶段整合这些原则一直受到阻碍。在这里,我们提出,现代软件开发中使用的工作流程为提高实验室研究的可重复性和协作性提供了一个强大的框架。我们特别展示了 GitHub(一个广泛应用于协作软件项目的平台)可以有效地用于组织和记录分子生物学实验室研究项目生命周期的各个方面。我们概述了将 GitHub 生态系统融入实验室研究工作流程的三步方法:1.使用问题和项目板设计和组织实验;2.使用版本控制系统记录实验和数据分析;3.使用容器包确保数据分析和编写任务的软件环境具有可重复性。这种方法的多功能性、可扩展性和经济性使其适用于从小型研究小组到大型跨机构合作的各种情况。从项目一开始就采用这种框架,可以提高研究实验室内部和跨实验室知识转移的效率和保真度。基于上述方法的 GitHub 知识库示例可在 https://github.com/rasilab/github_demo 上获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信