Enabling FAIR data stewardship in complex international multi-site studies: Data Operations for the Accelerating Medicines Partnership® Schizophrenia Program.
Tashrif Billah, Kang Ik K Cho, Owen Borders, Yoonho Chung, Michaela Ennis, Grace R Jacobs, Einat Liebenthal, Daniel H Mathalon, Dheshan Mohandass, Spero C Nicholas, Ofer Pasternak, Nora Penzel, Habiballah Rahimi Eichi, Phillip Wolff, Alan Anticevic, Kristen Laulette, Angela R Nunez, Zailyn Tamayo, Kate Buccilli, Beau-Luke Colton, Dominic B Dwyer, Larry Hendricks, Hok Pan Yuen, Jessica Spark, Sophie Tod, Holly Carrington, Justine T Chen, Michael J Coleman, Cheryl M Corcoran, Anastasia Haidar, Omar John, Sinead Kelly, Patricia J Marcy, Priya Matneja, Alessia McGowan, Susan E Ray, Simone Veale, Inge Winter-Van Rossum, Jean Addington, Kelly A Allott, Monica E Calkins, Scott R Clark, Ruben C Gur, Michael P Harms, Diana O Perkins, Kosha Ruparel, William S Stone, John Torous, Alison R Yung, Eirini Zoupou, Paolo Fusar-Poli, Vijay A Mittal, Jai L Shah, Daniel H Wolf, Guillermo Cecchi, Tina Kapur, Marek Kubicki, Kathryn Eve Lewandowski, Carrie E Bearden, Patrick D McGorry, René S Kahn, John M Kane, Barnaby Nelson, Scott W Woods, Martha E Shenton, Justin T Baker, Sylvain Bouix
{"title":"Enabling FAIR data stewardship in complex international multi-site studies: Data Operations for the Accelerating Medicines Partnership® Schizophrenia Program.","authors":"Tashrif Billah, Kang Ik K Cho, Owen Borders, Yoonho Chung, Michaela Ennis, Grace R Jacobs, Einat Liebenthal, Daniel H Mathalon, Dheshan Mohandass, Spero C Nicholas, Ofer Pasternak, Nora Penzel, Habiballah Rahimi Eichi, Phillip Wolff, Alan Anticevic, Kristen Laulette, Angela R Nunez, Zailyn Tamayo, Kate Buccilli, Beau-Luke Colton, Dominic B Dwyer, Larry Hendricks, Hok Pan Yuen, Jessica Spark, Sophie Tod, Holly Carrington, Justine T Chen, Michael J Coleman, Cheryl M Corcoran, Anastasia Haidar, Omar John, Sinead Kelly, Patricia J Marcy, Priya Matneja, Alessia McGowan, Susan E Ray, Simone Veale, Inge Winter-Van Rossum, Jean Addington, Kelly A Allott, Monica E Calkins, Scott R Clark, Ruben C Gur, Michael P Harms, Diana O Perkins, Kosha Ruparel, William S Stone, John Torous, Alison R Yung, Eirini Zoupou, Paolo Fusar-Poli, Vijay A Mittal, Jai L Shah, Daniel H Wolf, Guillermo Cecchi, Tina Kapur, Marek Kubicki, Kathryn Eve Lewandowski, Carrie E Bearden, Patrick D McGorry, René S Kahn, John M Kane, Barnaby Nelson, Scott W Woods, Martha E Shenton, Justin T Baker, Sylvain Bouix","doi":"10.1038/s41537-025-00560-x","DOIUrl":null,"url":null,"abstract":"<p><p>Modern research management, particularly for publicly funded studies, assumes a data governance model in which grantees are considered stewards rather than owners of important data sets. Thus, there is an expectation that collected data are shared as widely as possible with the general research community. This presents problems in complex studies that involve sensitive health information. The latter requires balancing participant privacy with the needs of the research community. Here, we report on the data operation ecosystem crafted for the Accelerating Medicines Partnership® Schizophrenia project, an international observational study of young individuals at clinical high risk for developing a psychotic disorder. We review data capture systems, data dictionaries, organization principles, data flow, security, quality control protocols, data visualization, monitoring, and dissemination through the NIMH Data Archive platform. We focus on the interconnectedness of these steps, where our goal is to design a seamless data flow and an alignment with the FAIR (Findability, Accessibility, Interoperability, and Reusability) principles while balancing local regulatory and ethical considerations. This process-oriented approach leverages automated pipelines for data flow to enhance data quality, speed, and collaboration, underscoring the project's contribution to advancing research practices involving multisite studies of sensitive mental health conditions. An important feature is the data's close-to-real-time quality assessment (QA) and quality control (QC). The focus on close-to-real-time QA/QC makes it possible for a subject to redo a testing session, as well as facilitate course corrections to prevent repeating errors in future data acquisition. Watch Dr. Sylvain Bouix discuss his work and this article: https://vimeo.com/1025555648 .</p>","PeriodicalId":74758,"journal":{"name":"Schizophrenia (Heidelberg, Germany)","volume":"11 1","pages":"55"},"PeriodicalIF":3.0000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11968805/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Schizophrenia (Heidelberg, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s41537-025-00560-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
Abstract
Modern research management, particularly for publicly funded studies, assumes a data governance model in which grantees are considered stewards rather than owners of important data sets. Thus, there is an expectation that collected data are shared as widely as possible with the general research community. This presents problems in complex studies that involve sensitive health information. The latter requires balancing participant privacy with the needs of the research community. Here, we report on the data operation ecosystem crafted for the Accelerating Medicines Partnership® Schizophrenia project, an international observational study of young individuals at clinical high risk for developing a psychotic disorder. We review data capture systems, data dictionaries, organization principles, data flow, security, quality control protocols, data visualization, monitoring, and dissemination through the NIMH Data Archive platform. We focus on the interconnectedness of these steps, where our goal is to design a seamless data flow and an alignment with the FAIR (Findability, Accessibility, Interoperability, and Reusability) principles while balancing local regulatory and ethical considerations. This process-oriented approach leverages automated pipelines for data flow to enhance data quality, speed, and collaboration, underscoring the project's contribution to advancing research practices involving multisite studies of sensitive mental health conditions. An important feature is the data's close-to-real-time quality assessment (QA) and quality control (QC). The focus on close-to-real-time QA/QC makes it possible for a subject to redo a testing session, as well as facilitate course corrections to prevent repeating errors in future data acquisition. Watch Dr. Sylvain Bouix discuss his work and this article: https://vimeo.com/1025555648 .