Julia M Kelliher, Leah Y D Johnson, Francisca E Rodriguez, Jaclyn K Saunders, Marie E Kroeger, Buck Hanson, Aaron J Robinson, Winston E Anthony, Marc W Van Goethem, Anders Kiledal, Ahmed A Shibl, Amanda Araujo Serrao de Andrade, Cassandra L Ettinger, Chhedi Lal Gupta, Chris R P Robinson, Cristal Zuniga, Daniel Sprockett, Douglas Terra Machado, Emilie J Skoog, Iyanu Oduwole, Jason A Rothman, Kaelan Prime, Katherine R Lane, Leandro Nascimento Lemos, Lisa Karstens, Mark McCauley, Mitiku Mihiret Seyoum, Moamen M Elmassry, Mustafa Guzel, Reid Longley, Simon Roux, Thomas M Pitot, Emiley A Eloe-Fadrosh
{"title":"A cost and community perspective on the barriers to microbiome data reuse.","authors":"Julia M Kelliher, Leah Y D Johnson, Francisca E Rodriguez, Jaclyn K Saunders, Marie E Kroeger, Buck Hanson, Aaron J Robinson, Winston E Anthony, Marc W Van Goethem, Anders Kiledal, Ahmed A Shibl, Amanda Araujo Serrao de Andrade, Cassandra L Ettinger, Chhedi Lal Gupta, Chris R P Robinson, Cristal Zuniga, Daniel Sprockett, Douglas Terra Machado, Emilie J Skoog, Iyanu Oduwole, Jason A Rothman, Kaelan Prime, Katherine R Lane, Leandro Nascimento Lemos, Lisa Karstens, Mark McCauley, Mitiku Mihiret Seyoum, Moamen M Elmassry, Mustafa Guzel, Reid Longley, Simon Roux, Thomas M Pitot, Emiley A Eloe-Fadrosh","doi":"10.3389/fbinf.2025.1585717","DOIUrl":null,"url":null,"abstract":"<p><p>Microbiome research is becoming a mature field with a wealth of data amassed from diverse ecosystems, yet the ability to fully leverage multi-omics data for reuse remains challenging. To provide a view into researchers' behavior and attitudes towards data reuse, we surveyed over 700 microbiome researchers to evaluate data sharing and reuse challenges. We found that many researchers are impeded by difficulties with metadata records, challenges with processing and bioinformatics, and problems with data repository submissions. We also explored the cost constraints of data reuse at each step of the data reuse process to better understand \"pain points\" and to provide a more quantitative perspective from sixteen active researchers. The bioinformatics and data processing step was estimated to be the most time consuming, which aligns with some of the most frequently reported challenges from the community survey. From these two approaches, we present evidence-based recommendations for how to address data sharing and reuse challenges with concrete actions for future work.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1585717"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12015674/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2025.1585717","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Microbiome research is becoming a mature field with a wealth of data amassed from diverse ecosystems, yet the ability to fully leverage multi-omics data for reuse remains challenging. To provide a view into researchers' behavior and attitudes towards data reuse, we surveyed over 700 microbiome researchers to evaluate data sharing and reuse challenges. We found that many researchers are impeded by difficulties with metadata records, challenges with processing and bioinformatics, and problems with data repository submissions. We also explored the cost constraints of data reuse at each step of the data reuse process to better understand "pain points" and to provide a more quantitative perspective from sixteen active researchers. The bioinformatics and data processing step was estimated to be the most time consuming, which aligns with some of the most frequently reported challenges from the community survey. From these two approaches, we present evidence-based recommendations for how to address data sharing and reuse challenges with concrete actions for future work.