Zack GoldblumUniversity of Pennsylvania, Zhongchuan XuUniversity of Pennsylvania, Haoer ShiUniversity of Pennsylvania, Patryk OrzechowskiUniversity of PennsylvaniaAGH University of Krakow, Jamaal SpenceUniversity of Pennsylvania, Kathryn A DavisUniversity of Pennsylvania, Brian LittUniversity of Pennsylvania, Nishant SinhaUniversity of Pennsylvania, Joost WagenaarUniversity of Pennsylvania
{"title":"Pennsieve - A Collaborative Platform for Translational Neuroscience and Beyond","authors":"Zack GoldblumUniversity of Pennsylvania, Zhongchuan XuUniversity of Pennsylvania, Haoer ShiUniversity of Pennsylvania, Patryk OrzechowskiUniversity of PennsylvaniaAGH University of Krakow, Jamaal SpenceUniversity of Pennsylvania, Kathryn A DavisUniversity of Pennsylvania, Brian LittUniversity of Pennsylvania, Nishant SinhaUniversity of Pennsylvania, Joost WagenaarUniversity of Pennsylvania","doi":"arxiv-2409.10509","DOIUrl":null,"url":null,"abstract":"The exponential growth of neuroscientific data necessitates platforms that\nfacilitate data management and multidisciplinary collaboration. In this paper,\nwe introduce Pennsieve - an open-source, cloud-based scientific data management\nplatform built to meet these needs. Pennsieve supports complex multimodal\ndatasets and provides tools for data visualization and analyses. It takes a\ncomprehensive approach to data integration, enabling researchers to define\ncustom metadata schemas and utilize advanced tools to filter and query their\ndata. Pennsieve's modular architecture allows external applications to extend\nits capabilities, and collaborative workspaces with peer-reviewed data\npublishing mechanisms promote high-quality datasets optimized for downstream\nanalysis, both in the cloud and on-premises. Pennsieve forms the core for major neuroscience research programs including\nthe NIH SPARC Initiative, NIH HEAL Initiative's PRECISION Human Pain Network,\nand NIH HEAL RE-JOIN Initiative. It serves more than 80 research groups\nworldwide, along with several large-scale, inter-institutional projects at\nclinical sites through the University of Pennsylvania. Underpinning the\nSPARC.Science, Epilepsy.Science, and Pennsieve Discover portals, Pennsieve\nstores over 125 TB of scientific data, with 35 TB of data publicly available\nacross more than 350 high-impact datasets. It adheres to the findable,\naccessible, interoperable, and reusable (FAIR) principles of data sharing and\nis recognized as one of the NIH-approved Data Repositories. By facilitating\nscientific data management, discovery, and analysis, Pennsieve fosters a robust\nand collaborative research ecosystem for neuroscience and beyond.","PeriodicalId":501168,"journal":{"name":"arXiv - CS - Emerging Technologies","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The exponential growth of neuroscientific data necessitates platforms that
facilitate data management and multidisciplinary collaboration. In this paper,
we introduce Pennsieve - an open-source, cloud-based scientific data management
platform built to meet these needs. Pennsieve supports complex multimodal
datasets and provides tools for data visualization and analyses. It takes a
comprehensive approach to data integration, enabling researchers to define
custom metadata schemas and utilize advanced tools to filter and query their
data. Pennsieve's modular architecture allows external applications to extend
its capabilities, and collaborative workspaces with peer-reviewed data
publishing mechanisms promote high-quality datasets optimized for downstream
analysis, both in the cloud and on-premises. Pennsieve forms the core for major neuroscience research programs including
the NIH SPARC Initiative, NIH HEAL Initiative's PRECISION Human Pain Network,
and NIH HEAL RE-JOIN Initiative. It serves more than 80 research groups
worldwide, along with several large-scale, inter-institutional projects at
clinical sites through the University of Pennsylvania. Underpinning the
SPARC.Science, Epilepsy.Science, and Pennsieve Discover portals, Pennsieve
stores over 125 TB of scientific data, with 35 TB of data publicly available
across more than 350 high-impact datasets. It adheres to the findable,
accessible, interoperable, and reusable (FAIR) principles of data sharing and
is recognized as one of the NIH-approved Data Repositories. By facilitating
scientific data management, discovery, and analysis, Pennsieve fosters a robust
and collaborative research ecosystem for neuroscience and beyond.