Anushka Sheoran , Kenneth A. Fond , Lex Maliga Davis , J. Russell Huie , Romana Vavrek , P.J. Axtman , Vance Lemmon , John L. Bixby , Ubbo Visser , John C. Gensel , Karim Fouad , Adam R. Ferguson , Jeffrey S. Grethe , Anita Bandrowski , Maryann E. Martone , Abel Torres-Espin
{"title":"基于社区的数据元素(代码)提高了数据报告质量和语义互操作性。脊髓损伤(ODC-SCI)的开放数据分析。","authors":"Anushka Sheoran , Kenneth A. Fond , Lex Maliga Davis , J. Russell Huie , Romana Vavrek , P.J. Axtman , Vance Lemmon , John L. Bixby , Ubbo Visser , John C. Gensel , Karim Fouad , Adam R. Ferguson , Jeffrey S. Grethe , Anita Bandrowski , Maryann E. Martone , Abel Torres-Espin","doi":"10.1016/j.expneurol.2024.115100","DOIUrl":null,"url":null,"abstract":"<div><div>Data interoperability is crucial for effectively combining data for scientific inquiry. To facilitate interoperability, data standards such as a common definition of variables are often developed. The Open Data Commons for Spinal Cord Injury (<span><span>odc-sci.org</span><svg><path></path></svg></span>) has established an initial set of community-based data elements (CoDEs)—a minimal set of variables for sharing—to promote data interoperability in SCI research, aligning with FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. We sought to understand the use of CoDEs by the SCI community to inform current standards adherence and future standards development. We systematically analyzed 39 public datasets in relation to 17 required CoDEs and found variations between reported data and the structure specified by the CoDEs. Overall, we found that the enforcement of data standards improved reporting rates of CoDEs variables. Notably, different variables were found to require different levels of curation to ensure semantic equivalence among datasets. We also uncovered specific reporting habits of researchers such as formatting and naming patterns. A need for different data standards based on the nature of the study (e.g., human study, derivative study) was realized alongside a detailed list of issues that should be addressed when implementing such standards. Among the various approaches to developing data standards, ODC-SCI adopted a semi-formal approach by creating standards that are easy to adopt by the user. Our data-driven evaluation of actual reporting behavior shows that this flexibility can lead to subsequent problems in harmonization. This study serves as a baseline analysis of reporting behaviors for shaping and facilitating data standards.</div></div>","PeriodicalId":12246,"journal":{"name":"Experimental Neurology","volume":"385 ","pages":"Article 115100"},"PeriodicalIF":4.6000,"publicationDate":"2024-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data reporting quality and semantic interoperability increase with community-based data elements (CoDEs). Analysis of the open data commons for spinal cord injury (ODC-SCI)\",\"authors\":\"Anushka Sheoran , Kenneth A. Fond , Lex Maliga Davis , J. Russell Huie , Romana Vavrek , P.J. Axtman , Vance Lemmon , John L. Bixby , Ubbo Visser , John C. Gensel , Karim Fouad , Adam R. Ferguson , Jeffrey S. Grethe , Anita Bandrowski , Maryann E. Martone , Abel Torres-Espin\",\"doi\":\"10.1016/j.expneurol.2024.115100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Data interoperability is crucial for effectively combining data for scientific inquiry. To facilitate interoperability, data standards such as a common definition of variables are often developed. The Open Data Commons for Spinal Cord Injury (<span><span>odc-sci.org</span><svg><path></path></svg></span>) has established an initial set of community-based data elements (CoDEs)—a minimal set of variables for sharing—to promote data interoperability in SCI research, aligning with FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. We sought to understand the use of CoDEs by the SCI community to inform current standards adherence and future standards development. We systematically analyzed 39 public datasets in relation to 17 required CoDEs and found variations between reported data and the structure specified by the CoDEs. Overall, we found that the enforcement of data standards improved reporting rates of CoDEs variables. Notably, different variables were found to require different levels of curation to ensure semantic equivalence among datasets. We also uncovered specific reporting habits of researchers such as formatting and naming patterns. A need for different data standards based on the nature of the study (e.g., human study, derivative study) was realized alongside a detailed list of issues that should be addressed when implementing such standards. Among the various approaches to developing data standards, ODC-SCI adopted a semi-formal approach by creating standards that are easy to adopt by the user. Our data-driven evaluation of actual reporting behavior shows that this flexibility can lead to subsequent problems in harmonization. This study serves as a baseline analysis of reporting behaviors for shaping and facilitating data standards.</div></div>\",\"PeriodicalId\":12246,\"journal\":{\"name\":\"Experimental Neurology\",\"volume\":\"385 \",\"pages\":\"Article 115100\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Experimental Neurology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0014488624004266\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Experimental Neurology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0014488624004266","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
Data reporting quality and semantic interoperability increase with community-based data elements (CoDEs). Analysis of the open data commons for spinal cord injury (ODC-SCI)
Data interoperability is crucial for effectively combining data for scientific inquiry. To facilitate interoperability, data standards such as a common definition of variables are often developed. The Open Data Commons for Spinal Cord Injury (odc-sci.org) has established an initial set of community-based data elements (CoDEs)—a minimal set of variables for sharing—to promote data interoperability in SCI research, aligning with FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. We sought to understand the use of CoDEs by the SCI community to inform current standards adherence and future standards development. We systematically analyzed 39 public datasets in relation to 17 required CoDEs and found variations between reported data and the structure specified by the CoDEs. Overall, we found that the enforcement of data standards improved reporting rates of CoDEs variables. Notably, different variables were found to require different levels of curation to ensure semantic equivalence among datasets. We also uncovered specific reporting habits of researchers such as formatting and naming patterns. A need for different data standards based on the nature of the study (e.g., human study, derivative study) was realized alongside a detailed list of issues that should be addressed when implementing such standards. Among the various approaches to developing data standards, ODC-SCI adopted a semi-formal approach by creating standards that are easy to adopt by the user. Our data-driven evaluation of actual reporting behavior shows that this flexibility can lead to subsequent problems in harmonization. This study serves as a baseline analysis of reporting behaviors for shaping and facilitating data standards.
期刊介绍:
Experimental Neurology, a Journal of Neuroscience Research, publishes original research in neuroscience with a particular emphasis on novel findings in neural development, regeneration, plasticity and transplantation. The journal has focused on research concerning basic mechanisms underlying neurological disorders.