A Contemporary Conceptual Framework for Initial Data Analysis

M. Huebner, S. le Cessie, C. O. Schmidt, W. Vach
{"title":"A Contemporary Conceptual Framework for Initial Data Analysis","authors":"M. Huebner, S. le Cessie, C. O. Schmidt, W. Vach","doi":"10.1353/obs.2018.0014","DOIUrl":null,"url":null,"abstract":"Abstract:Initial data analyses (IDA) are often performed as part of studies with primary-data collection, where data are obtained to address a predefined set of research questions, and with a clear plan of the intended statistical analyses. An informal or unstructured approach may have a large and non-transparent impact on results and conclusions presented in publications. Key principles for IDA are to avoid analyses that are part of the research question, and full documentation and transparency.We develop a framework for IDA from the perspective of a study with primary-data collection and define and discuss six steps of IDA: (1) Metadata setup to properly conduct all following IDA steps, (2) Data cleaning to identify and correct data errors, (3) Data screening that consists of understanding the properties of the data, (4) Initial data reporting that informs all potential collaborators working with the data about insights, (5) Refining and updating the analysis plan to incorporate the relevant findings, (6) Reporting of IDA in research papers to document steps that impact the interpretation of results. We describe basic principles to be applied in each step and illustrate them by example.Initial data analysis needs to be recognized as an important part and independent element of the research process. Lack of resources or organizational barriers can be obstacles to IDA. Further methodological developments are needed for IDA dealing with multi-purpose studies or increasingly complex data sets.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2018.0014","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Observational studies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1353/obs.2018.0014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31

Abstract

Abstract:Initial data analyses (IDA) are often performed as part of studies with primary-data collection, where data are obtained to address a predefined set of research questions, and with a clear plan of the intended statistical analyses. An informal or unstructured approach may have a large and non-transparent impact on results and conclusions presented in publications. Key principles for IDA are to avoid analyses that are part of the research question, and full documentation and transparency.We develop a framework for IDA from the perspective of a study with primary-data collection and define and discuss six steps of IDA: (1) Metadata setup to properly conduct all following IDA steps, (2) Data cleaning to identify and correct data errors, (3) Data screening that consists of understanding the properties of the data, (4) Initial data reporting that informs all potential collaborators working with the data about insights, (5) Refining and updating the analysis plan to incorporate the relevant findings, (6) Reporting of IDA in research papers to document steps that impact the interpretation of results. We describe basic principles to be applied in each step and illustrate them by example.Initial data analysis needs to be recognized as an important part and independent element of the research process. Lack of resources or organizational barriers can be obstacles to IDA. Further methodological developments are needed for IDA dealing with multi-purpose studies or increasingly complex data sets.
初始数据分析的当代概念框架
摘要:初始数据分析(IDA)通常是作为研究的一部分进行的,主要数据收集是为了解决预先定义的一组研究问题,并有明确的统计分析计划。非正式或非结构化的方法可能对出版物中提出的结果和结论产生巨大而不透明的影响。IDA的主要原则是避免作为研究问题一部分的分析,以及完整的文件和透明度。我们从初级数据收集研究的角度为IDA开发了一个框架,并定义和讨论了IDA的六个步骤:(1)元数据设置,以正确执行以下IDA步骤;(2)数据清理,以识别和纠正数据错误,(4)初始数据报告,告知所有使用数据的潜在合作者有关见解,(5)完善和更新分析计划,以纳入相关发现,(6)在研究论文中报告IDA,以记录影响结果解释的步骤。我们描述了在每一步中应用的基本原则,并通过实例加以说明。初始数据分析需要被视为研究过程的重要组成部分和独立元素。缺乏资源或组织障碍可能是IDA的障碍。IDA需要进一步发展方法,以处理多用途研究或日益复杂的数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信