基于社区的数据元素(代码)提高了数据报告质量和语义互操作性。脊髓损伤(ODC-SCI)的开放数据分析。

IF 4.6 2区 医学 Q1 NEUROSCIENCES
Anushka Sheoran , Kenneth A. Fond , Lex Maliga Davis , J. Russell Huie , Romana Vavrek , P.J. Axtman , Vance Lemmon , John L. Bixby , Ubbo Visser , John C. Gensel , Karim Fouad , Adam R. Ferguson , Jeffrey S. Grethe , Anita Bandrowski , Maryann E. Martone , Abel Torres-Espin
{"title":"基于社区的数据元素(代码)提高了数据报告质量和语义互操作性。脊髓损伤(ODC-SCI)的开放数据分析。","authors":"Anushka Sheoran ,&nbsp;Kenneth A. Fond ,&nbsp;Lex Maliga Davis ,&nbsp;J. Russell Huie ,&nbsp;Romana Vavrek ,&nbsp;P.J. Axtman ,&nbsp;Vance Lemmon ,&nbsp;John L. Bixby ,&nbsp;Ubbo Visser ,&nbsp;John C. Gensel ,&nbsp;Karim Fouad ,&nbsp;Adam R. Ferguson ,&nbsp;Jeffrey S. Grethe ,&nbsp;Anita Bandrowski ,&nbsp;Maryann E. Martone ,&nbsp;Abel Torres-Espin","doi":"10.1016/j.expneurol.2024.115100","DOIUrl":null,"url":null,"abstract":"<div><div>Data interoperability is crucial for effectively combining data for scientific inquiry. To facilitate interoperability, data standards such as a common definition of variables are often developed. The Open Data Commons for Spinal Cord Injury (<span><span>odc-sci.org</span><svg><path></path></svg></span>) has established an initial set of community-based data elements (CoDEs)—a minimal set of variables for sharing—to promote data interoperability in SCI research, aligning with FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. We sought to understand the use of CoDEs by the SCI community to inform current standards adherence and future standards development. We systematically analyzed 39 public datasets in relation to 17 required CoDEs and found variations between reported data and the structure specified by the CoDEs. Overall, we found that the enforcement of data standards improved reporting rates of CoDEs variables. Notably, different variables were found to require different levels of curation to ensure semantic equivalence among datasets. We also uncovered specific reporting habits of researchers such as formatting and naming patterns. A need for different data standards based on the nature of the study (e.g., human study, derivative study) was realized alongside a detailed list of issues that should be addressed when implementing such standards. Among the various approaches to developing data standards, ODC-SCI adopted a semi-formal approach by creating standards that are easy to adopt by the user. Our data-driven evaluation of actual reporting behavior shows that this flexibility can lead to subsequent problems in harmonization. This study serves as a baseline analysis of reporting behaviors for shaping and facilitating data standards.</div></div>","PeriodicalId":12246,"journal":{"name":"Experimental Neurology","volume":"385 ","pages":"Article 115100"},"PeriodicalIF":4.6000,"publicationDate":"2024-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data reporting quality and semantic interoperability increase with community-based data elements (CoDEs). Analysis of the open data commons for spinal cord injury (ODC-SCI)\",\"authors\":\"Anushka Sheoran ,&nbsp;Kenneth A. Fond ,&nbsp;Lex Maliga Davis ,&nbsp;J. Russell Huie ,&nbsp;Romana Vavrek ,&nbsp;P.J. Axtman ,&nbsp;Vance Lemmon ,&nbsp;John L. Bixby ,&nbsp;Ubbo Visser ,&nbsp;John C. Gensel ,&nbsp;Karim Fouad ,&nbsp;Adam R. Ferguson ,&nbsp;Jeffrey S. Grethe ,&nbsp;Anita Bandrowski ,&nbsp;Maryann E. Martone ,&nbsp;Abel Torres-Espin\",\"doi\":\"10.1016/j.expneurol.2024.115100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Data interoperability is crucial for effectively combining data for scientific inquiry. To facilitate interoperability, data standards such as a common definition of variables are often developed. The Open Data Commons for Spinal Cord Injury (<span><span>odc-sci.org</span><svg><path></path></svg></span>) has established an initial set of community-based data elements (CoDEs)—a minimal set of variables for sharing—to promote data interoperability in SCI research, aligning with FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. We sought to understand the use of CoDEs by the SCI community to inform current standards adherence and future standards development. We systematically analyzed 39 public datasets in relation to 17 required CoDEs and found variations between reported data and the structure specified by the CoDEs. Overall, we found that the enforcement of data standards improved reporting rates of CoDEs variables. Notably, different variables were found to require different levels of curation to ensure semantic equivalence among datasets. We also uncovered specific reporting habits of researchers such as formatting and naming patterns. A need for different data standards based on the nature of the study (e.g., human study, derivative study) was realized alongside a detailed list of issues that should be addressed when implementing such standards. Among the various approaches to developing data standards, ODC-SCI adopted a semi-formal approach by creating standards that are easy to adopt by the user. Our data-driven evaluation of actual reporting behavior shows that this flexibility can lead to subsequent problems in harmonization. This study serves as a baseline analysis of reporting behaviors for shaping and facilitating data standards.</div></div>\",\"PeriodicalId\":12246,\"journal\":{\"name\":\"Experimental Neurology\",\"volume\":\"385 \",\"pages\":\"Article 115100\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Experimental Neurology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0014488624004266\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Experimental Neurology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0014488624004266","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

数据互操作性对于有效地组合数据进行科学探究至关重要。为了促进互操作性,通常会开发数据标准,例如变量的通用定义。脊髓损伤开放数据共享(odc-sci.org)已经建立了一套基于社区的初始数据元素(代码)——一组用于共享的最小变量——以促进SCI研究中的数据互操作性,与FAIR(可查找、可访问、可互操作和可重用)数据原则保持一致。我们试图了解SCI社区对规范的使用,以告知当前标准的遵守情况和未来标准的制定。我们系统地分析了与17个必需规范相关的39个公共数据集,发现报告的数据与规范指定的结构之间存在差异。总的来说,我们发现数据标准的实施提高了代码变量的报告率。值得注意的是,不同的变量需要不同级别的管理来确保数据集之间的语义等价。我们还发现了研究人员的特定报告习惯,如格式和命名模式。需要根据研究的性质(例如,人体研究,衍生研究)制定不同的数据标准,并详细列出实施这些标准时应解决的问题。在开发数据标准的各种方法中,ODC-SCI通过创建易于用户采用的标准采用了半形式化的方法。我们对实际报告行为的数据驱动评估表明,这种灵活性可能导致后续的协调问题。本研究作为报告行为的基线分析,以形成和促进数据标准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Data reporting quality and semantic interoperability increase with community-based data elements (CoDEs). Analysis of the open data commons for spinal cord injury (ODC-SCI)
Data interoperability is crucial for effectively combining data for scientific inquiry. To facilitate interoperability, data standards such as a common definition of variables are often developed. The Open Data Commons for Spinal Cord Injury (odc-sci.org) has established an initial set of community-based data elements (CoDEs)—a minimal set of variables for sharing—to promote data interoperability in SCI research, aligning with FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. We sought to understand the use of CoDEs by the SCI community to inform current standards adherence and future standards development. We systematically analyzed 39 public datasets in relation to 17 required CoDEs and found variations between reported data and the structure specified by the CoDEs. Overall, we found that the enforcement of data standards improved reporting rates of CoDEs variables. Notably, different variables were found to require different levels of curation to ensure semantic equivalence among datasets. We also uncovered specific reporting habits of researchers such as formatting and naming patterns. A need for different data standards based on the nature of the study (e.g., human study, derivative study) was realized alongside a detailed list of issues that should be addressed when implementing such standards. Among the various approaches to developing data standards, ODC-SCI adopted a semi-formal approach by creating standards that are easy to adopt by the user. Our data-driven evaluation of actual reporting behavior shows that this flexibility can lead to subsequent problems in harmonization. This study serves as a baseline analysis of reporting behaviors for shaping and facilitating data standards.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Experimental Neurology
Experimental Neurology 医学-神经科学
CiteScore
10.10
自引率
3.80%
发文量
258
审稿时长
42 days
期刊介绍: Experimental Neurology, a Journal of Neuroscience Research, publishes original research in neuroscience with a particular emphasis on novel findings in neural development, regeneration, plasticity and transplantation. The journal has focused on research concerning basic mechanisms underlying neurological disorders.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信