{"title":"共享去标识化数据和代码的指导原则和最佳做法","authors":"Nicholas J. Horton, Sara Stoudt","doi":"arxiv-2405.18232","DOIUrl":null,"url":null,"abstract":"In 2022, the Journal of Statistics and Data Science Education (JSDSE)\ninstituted augmented requirements for authors to post deidentified data and\ncode underlying their papers. These changes were prompted by an increased focus\non reproducibility and open science (NASEM 2019). A recent review of data\navailability practices noted that \"such policies help increase the\nreproducibility of the published literature, as well as make a larger body of\ndata available for reuse and re-analysis\" (PLOS ONE, 2024). JSDSE values\naccessibility as it endeavors to share knowledge that can improve educational\napproaches to teaching statistics and data science. Because institution,\nenvironment, and students differ across readers of the journal, it is\nespecially important to facilitate the transfer of a journal article's findings\nto new contexts. This process may require digging into more of the details,\nincluding the deidentified data and code. Our goal is to provide our readers\nand authors with a review of why the requirements for code and data sharing\nwere instituted, summarize ongoing trends and developments in open science,\ndiscuss options for data and code sharing, and share advice for authors.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":"133 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Guidelines and Best Practices to Share Deidentified Data and Code\",\"authors\":\"Nicholas J. Horton, Sara Stoudt\",\"doi\":\"arxiv-2405.18232\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In 2022, the Journal of Statistics and Data Science Education (JSDSE)\\ninstituted augmented requirements for authors to post deidentified data and\\ncode underlying their papers. These changes were prompted by an increased focus\\non reproducibility and open science (NASEM 2019). A recent review of data\\navailability practices noted that \\\"such policies help increase the\\nreproducibility of the published literature, as well as make a larger body of\\ndata available for reuse and re-analysis\\\" (PLOS ONE, 2024). JSDSE values\\naccessibility as it endeavors to share knowledge that can improve educational\\napproaches to teaching statistics and data science. Because institution,\\nenvironment, and students differ across readers of the journal, it is\\nespecially important to facilitate the transfer of a journal article's findings\\nto new contexts. This process may require digging into more of the details,\\nincluding the deidentified data and code. Our goal is to provide our readers\\nand authors with a review of why the requirements for code and data sharing\\nwere instituted, summarize ongoing trends and developments in open science,\\ndiscuss options for data and code sharing, and share advice for authors.\",\"PeriodicalId\":501323,\"journal\":{\"name\":\"arXiv - STAT - Other Statistics\",\"volume\":\"133 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Other Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.18232\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.18232","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Guidelines and Best Practices to Share Deidentified Data and Code
In 2022, the Journal of Statistics and Data Science Education (JSDSE)
instituted augmented requirements for authors to post deidentified data and
code underlying their papers. These changes were prompted by an increased focus
on reproducibility and open science (NASEM 2019). A recent review of data
availability practices noted that "such policies help increase the
reproducibility of the published literature, as well as make a larger body of
data available for reuse and re-analysis" (PLOS ONE, 2024). JSDSE values
accessibility as it endeavors to share knowledge that can improve educational
approaches to teaching statistics and data science. Because institution,
environment, and students differ across readers of the journal, it is
especially important to facilitate the transfer of a journal article's findings
to new contexts. This process may require digging into more of the details,
including the deidentified data and code. Our goal is to provide our readers
and authors with a review of why the requirements for code and data sharing
were instituted, summarize ongoing trends and developments in open science,
discuss options for data and code sharing, and share advice for authors.