Ronald Domi, Falko Noé, Peter Leary, Hubert Rehrauer
{"title":"GEO uploader: simplifying the data deposition in the GEO repository.","authors":"Ronald Domi, Falko Noé, Peter Leary, Hubert Rehrauer","doi":"10.1186/s12859-026-06466-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The Gene Expression Omnibus (GEO) (Clough and Barrett in: methods in molecular biology, Clifton, 2016) repository requires complex multistep submissions involving metadata preparation, FTP uploads, and MD5 validation. Current manual processes are error-prone, time-consuming, and require significant bioinformatics expertise, creating barriers for many researchers.</p><p><strong>Results: </strong>We present GEO Uploader, a web-based tool that automates the entire GEO submission workflow through an intuitive interface. The application reduces the submission initiation time from 2-3 h to under 20 s by automating file uploads, MD5 calculations, and metadata template population. Key features include parallel processing of uploads and checksum calculations, automated error prevention through template-based metadata completion, real-time progress tracking, and support for complex submission structures. Deployment across 30 + users with 50 + upload sessions, including datasets exceeding hundreds of gigabytes, demonstrates practical utility and reliability in research environments.</p><p><strong>Conclusion: </strong>GEO Uploader significantly reduces the technical barrier for GEO submissions while minimizing errors through comprehensive automation. The tool supports data sharing by enabling researchers without specialized bioinformatics expertise to complete submissions independently. Available as open-source software with multiuser deployment capabilities, GEO Uploader represents a substantial improvement in research data sharing accessibility and supports broader adoption of open science practices in the genomics community.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2026-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-026-06466-4","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The Gene Expression Omnibus (GEO) (Clough and Barrett in: methods in molecular biology, Clifton, 2016) repository requires complex multistep submissions involving metadata preparation, FTP uploads, and MD5 validation. Current manual processes are error-prone, time-consuming, and require significant bioinformatics expertise, creating barriers for many researchers.
Results: We present GEO Uploader, a web-based tool that automates the entire GEO submission workflow through an intuitive interface. The application reduces the submission initiation time from 2-3 h to under 20 s by automating file uploads, MD5 calculations, and metadata template population. Key features include parallel processing of uploads and checksum calculations, automated error prevention through template-based metadata completion, real-time progress tracking, and support for complex submission structures. Deployment across 30 + users with 50 + upload sessions, including datasets exceeding hundreds of gigabytes, demonstrates practical utility and reliability in research environments.
Conclusion: GEO Uploader significantly reduces the technical barrier for GEO submissions while minimizing errors through comprehensive automation. The tool supports data sharing by enabling researchers without specialized bioinformatics expertise to complete submissions independently. Available as open-source software with multiuser deployment capabilities, GEO Uploader represents a substantial improvement in research data sharing accessibility and supports broader adoption of open science practices in the genomics community.
期刊介绍:
BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology.
BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.