{"title":"Guidelines for the Creation of Analysis Ready Data","authors":"Harriette Phillips, Aiden Price, Owen Forbes, Claire Boulange, Kerrie Mengersen, Marketa Reeves, Rebecca Glauert","doi":"arxiv-2403.08127","DOIUrl":null,"url":null,"abstract":"Globally, there is an increased need for guidelines to produce high-quality\ndata outputs for analysis. There is no framework currently exists providing\nguidelines for a comprehensive approach in producing analysis ready data (ARD).\nThrough critically reviewing and summarising current literature, this paper\nproposes such guidelines for the creation of ARD. The guidelines proposed in\nthis paper inform ten steps in the generation of ARD: ethics, project\ndocumentation, data governance, data management, data storage, data discovery\nand collection, data cleaning, quality assurance, metadata, and data\ndictionary. These steps are illustrated through a substantive case study which\naimed to create ARD for a digital spatial platform: the Australian Child and\nYouth Wellbeing Atlas (ACYWA).","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":"15 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.08127","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Globally, there is an increased need for guidelines to produce high-quality
data outputs for analysis. There is no framework currently exists providing
guidelines for a comprehensive approach in producing analysis ready data (ARD).
Through critically reviewing and summarising current literature, this paper
proposes such guidelines for the creation of ARD. The guidelines proposed in
this paper inform ten steps in the generation of ARD: ethics, project
documentation, data governance, data management, data storage, data discovery
and collection, data cleaning, quality assurance, metadata, and data
dictionary. These steps are illustrated through a substantive case study which
aimed to create ARD for a digital spatial platform: the Australian Child and
Youth Wellbeing Atlas (ACYWA).