Jeffrey A Shero, Alexis E Swanz, Allyson L Hanson, Sara A Hart, Jessica A R Logan
{"title":"教育和心理学研究中数据共享的数据去识别:重要性、障碍和技术。","authors":"Jeffrey A Shero, Alexis E Swanz, Allyson L Hanson, Sara A Hart, Jessica A R Logan","doi":"10.1177/23328584251352814","DOIUrl":null,"url":null,"abstract":"<p><p>In this manuscript, we discuss the importance of data sharing in educational and psychological research, emphasizing the historical context of data sharing, the current open science movement, and the so-called replication crisis. We additionally explore the barriers to data sharing, particularly the fear of incorrectly deidentifying data or accidentally including private information. We then highlight the importance of deidentifying data for data sharing. Finally, we present specific techniques for data deidentification, namely non-perturbative and perturbative methods, and make recommendations for which techniques are relevant for specific types of variables. To assist readers in implementing the material from this study, we have additionally created an interactive tutorial as a Shiny web application, which is publicly available and free to use.</p>","PeriodicalId":31132,"journal":{"name":"Aera Open","volume":"11 ","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12700633/pdf/","citationCount":"0","resultStr":"{\"title\":\"Data Deidentification for Data Sharing in Educational and Psychological Research: Importance, Barriers, and Techniques.\",\"authors\":\"Jeffrey A Shero, Alexis E Swanz, Allyson L Hanson, Sara A Hart, Jessica A R Logan\",\"doi\":\"10.1177/23328584251352814\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In this manuscript, we discuss the importance of data sharing in educational and psychological research, emphasizing the historical context of data sharing, the current open science movement, and the so-called replication crisis. We additionally explore the barriers to data sharing, particularly the fear of incorrectly deidentifying data or accidentally including private information. We then highlight the importance of deidentifying data for data sharing. Finally, we present specific techniques for data deidentification, namely non-perturbative and perturbative methods, and make recommendations for which techniques are relevant for specific types of variables. To assist readers in implementing the material from this study, we have additionally created an interactive tutorial as a Shiny web application, which is publicly available and free to use.</p>\",\"PeriodicalId\":31132,\"journal\":{\"name\":\"Aera Open\",\"volume\":\"11 \",\"pages\":\"\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12700633/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aera Open\",\"FirstCategoryId\":\"95\",\"ListUrlMain\":\"https://doi.org/10.1177/23328584251352814\",\"RegionNum\":3,\"RegionCategory\":\"教育学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/7/4 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"EDUCATION & EDUCATIONAL RESEARCH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aera Open","FirstCategoryId":"95","ListUrlMain":"https://doi.org/10.1177/23328584251352814","RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
Data Deidentification for Data Sharing in Educational and Psychological Research: Importance, Barriers, and Techniques.
In this manuscript, we discuss the importance of data sharing in educational and psychological research, emphasizing the historical context of data sharing, the current open science movement, and the so-called replication crisis. We additionally explore the barriers to data sharing, particularly the fear of incorrectly deidentifying data or accidentally including private information. We then highlight the importance of deidentifying data for data sharing. Finally, we present specific techniques for data deidentification, namely non-perturbative and perturbative methods, and make recommendations for which techniques are relevant for specific types of variables. To assist readers in implementing the material from this study, we have additionally created an interactive tutorial as a Shiny web application, which is publicly available and free to use.