Scientific DataPub Date : 2025-10-09DOI: 10.1038/s41597-025-05920-x
Fengwei Hung, Davide Danilo Chiarelli, James S Famiglietti, Marc F Müller
{"title":"Downscaled global 60-meter resolution estimates of irrigation water sources (2000-2015).","authors":"Fengwei Hung, Davide Danilo Chiarelli, James S Famiglietti, Marc F Müller","doi":"10.1038/s41597-025-05920-x","DOIUrl":"https://doi.org/10.1038/s41597-025-05920-x","url":null,"abstract":"<p><p>This dataset provides high-resolution (60 m) global irrigation maps to support water resource and agricultural management. It identifies the likely irrigation status (rainfed or irrigated) and water source (groundwater or surface water) of croplands for 2000, 2005, 2010, and 2015. We downscaled a 10-km irrigation dataset derived from national and subnational statistics (GMIA) using (i) spatial patterns between high-resolution (30 m) cropland and nearby surface water, and (ii) irrigation water requirements from a global crop model. Validation used household agriculture surveys in India (N = 8,355) and a U.S. well database (N = 1,505,371). In the U.S., our method achieved 85% accuracy in distinguishing groundwater use within 2 km of wells - substantially higher than GMIA (25%). In India's groundwater-dominated regions, our estimates performed comparably to GMIA (73% vs. 72%). These results suggest our dataset offers a more accurate and spatially detailed representation of irrigation water sources, enabling improved analysis of agricultural water use.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1632"},"PeriodicalIF":6.9,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145259043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-09DOI: 10.1038/s41597-025-06071-9
Panagiota Fragkou, Ioannis Martakos, Georgia Rouni, Demetrios Vasilakos, Evangelos Koutsoukos, Alesssio Saviane, Silvia Cappellozza, Nikolaos S Thomaidis, Marios G Kostakis, Martina Samiotaki, Sotiris Kotsiantis, Mariana Barcenas, Skarlatos G Dedos
{"title":"Multiomics analysis of the Silkworm cocoon shell.","authors":"Panagiota Fragkou, Ioannis Martakos, Georgia Rouni, Demetrios Vasilakos, Evangelos Koutsoukos, Alesssio Saviane, Silvia Cappellozza, Nikolaos S Thomaidis, Marios G Kostakis, Martina Samiotaki, Sotiris Kotsiantis, Mariana Barcenas, Skarlatos G Dedos","doi":"10.1038/s41597-025-06071-9","DOIUrl":"https://doi.org/10.1038/s41597-025-06071-9","url":null,"abstract":"<p><p>In this study we combined phenomics, proteomics, metabolomics and lipidomics analyses of historical and contemporary Bombyx mori cocoon shells to case-study the human-driven introduction and diversification of this species in Europe. Prompted by recent findings on the genomic variability that underlies the ancestry and cocoon shell colour of this species, we carried out optical and fluorescence imaging analysis of 148 cocoons shells to identify overt and covert phenotypic traits and employed LC-MS/MS analyses protocols for 80 cocoon shell samples to identify that the cocoon shell of this species contains on average 98 ± 13 (Mean ± SD) proteins, while we identified 141 metabolites and 981 lipids. We validated these generated datasets through multiple validation protocols, through a series of dimensionality reduction methods and clustering algorithms and through narratives from historical archives and manuscripts. Our multiomics datasets provide a valuable foundation for advancing further exploitations of silkworm cocoon shells in multiple scientific perspectives.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1630"},"PeriodicalIF":6.9,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145259034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-09DOI: 10.1038/s41597-025-05915-8
Lin Chen, Ziyi Huang, Xiaoli Liu, Guodong He, Binbin Ren, Fan Yang, Fengfeng Jiang, Qi Tu, Dandan Cai, Kai Zhang, Zhongheng Zhang, Gensheng Zhang, Minfeng Tong
{"title":"JinhuaNSICU, an open accessible Neurosurgical Intensive Care Database.","authors":"Lin Chen, Ziyi Huang, Xiaoli Liu, Guodong He, Binbin Ren, Fan Yang, Fengfeng Jiang, Qi Tu, Dandan Cai, Kai Zhang, Zhongheng Zhang, Gensheng Zhang, Minfeng Tong","doi":"10.1038/s41597-025-05915-8","DOIUrl":"https://doi.org/10.1038/s41597-025-05915-8","url":null,"abstract":"<p><p>The Jinhua Central Hospital Neurosurgical Intensive Care Unit Database (JinhuaNSICU) is a single-center, bilingual (Chinese and English), and open public database focused on neurosurgery patients. This database has been filtered, cleaned, and de-identified, recording a variety of rich clinical information throughout the entire hospital stay of patients admitted to the NSICU. It contains 8,773 individuals with 20,530 episodes' demographics, laboratory tests, vital signs, microbiology culture, examinations with reports, medications with extra special chinese herbal, orders, operations, and surgery with anesthesia. More than 26% of patients have records of more than three hospitalizations. Creating this neurosurgery database will bridge the existing gap in neurosurgical data and inspire more researchers to participate in neurosurgical research, which will further promote the follow-up studies, development of clinical decision support tools and sharing of data for international observation analysis.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1633"},"PeriodicalIF":6.9,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145259050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comprehensive images and serum biomarkers for biliary atresia and other cholestasis in pediatrics.","authors":"Shuyi Liu, Rui Zhang, Yongshang Yu, Chunyixue Geng, Hongbin Ma, Yuyun Liu, Wenhua Qin, Yue Zhang, Qiqiao Zhang, Wenjing Gao","doi":"10.1038/s41597-025-05914-9","DOIUrl":"https://doi.org/10.1038/s41597-025-05914-9","url":null,"abstract":"<p><p>Biliary atresia (BA) and other pediatric cholestatic diseases are rare but serious conditions that require early and accurate diagnosis. Ultrasound has become the primary diagnostic method for BA, particularly for evaluating gallbladder contractility through fasting and postprandial comparisons. However, there is currently a lack of publicly available datasets containing paired fasting-postprandial gallbladder ultrasound images, other key sonographic features, and relevant serum biochemical indicators. The dataset consisted of 2,759 ultrasound images obtained from 1,019 infants diagnosed with cholestatic diseases. This included 377 BA cases (664 fasting and 328 post-prandial images) and 642 non-BA cases (1,004 fasting and 580 post-prandial images). Additionally, 183 images documenting abnormal ultrasonographic findings such as triangular cord sign, dilated hepatic artery, and hilar hepatic cysts, and relevant hepatic biochemical indicators were also collected. The study implemented nnU-Net to perform automated gallbladder segmentation across both fasting and postprandial ultrasound examinations. The dataset represents a valuable resource for enhancing the diagnostic accuracy of BA and facilitating the development of clinical decision-support system.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1634"},"PeriodicalIF":6.9,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145259083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Aerodynamic dataset for selected doubly curved membrane canopy structures.","authors":"Anoop Kodakkal, Kimberly Adamek, Ann-Kathrin Goldbach, Tibebu Birhane, Rodrigo Castedo-Hernandez, Guillermo Martínez-López, Máté Péntek, Kai-Uwe Bletzinger, Roland Wüchner, Girma Bitsuamlak","doi":"10.1038/s41597-025-06046-w","DOIUrl":"https://doi.org/10.1038/s41597-025-06046-w","url":null,"abstract":"<p><p>A dataset of aerodynamic measurements is collected for doubly curved membrane structures as part of a comprehensive experimental testing campaign on wind effects on structural membranes conducted at the WindEEE Dome at Western University, Canada. Common doubly curved membrane geometries - the hypar, ridge valley, arch supported, cone, and umbrella - were tested in isolated instances. The cone geometry was also tested in both a 1 × 3 row and a 3 × 3 group arrangement. All models were tested at a 1:25 scale under atmospheric boundary layer (ABL) flow at angles of attack ranging from 0° to 180° in 10° increments (and 45°, and 135° -depending on the line of symmetry). In addition to ABL, the hypar geometry was subjected to two other distinct flow scenarios: tornado, and downburst. Pressure time series at various tap locations are included in the data. In total, approximately 425 tests were conducted, providing a comprehensive dataset on the aerodynamic behavior of doubly curved structures under wind loads. This experimental data set offers valuable insights for the design and analysis of such structures in architectural and engineering applications and future design guideline developments. Data is available on an open-source dataset Zenodo as part of the ERIES-WENSS project.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1627"},"PeriodicalIF":6.9,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145252436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-08DOI: 10.1038/s41597-025-05979-6
Maksim Kukushkin, Martin Bogdan, Simon Goertz, Jan-Ole Callsen, Eric Oldenburg, Matthias Enders, Thomas Schmid
{"title":"A bimodal image dataset for seed classification from the visible and near-infrared spectrum.","authors":"Maksim Kukushkin, Martin Bogdan, Simon Goertz, Jan-Ole Callsen, Eric Oldenburg, Matthias Enders, Thomas Schmid","doi":"10.1038/s41597-025-05979-6","DOIUrl":"https://doi.org/10.1038/s41597-025-05979-6","url":null,"abstract":"<p><p>The success of deep learning in image classification has been largely underpinned by large-scale datasets, such as ImageNet, which have significantly advanced multi-class classification for RGB and grayscale images. However, datasets that capture spectral information beyond the visible spectrum remain scarce, despite their high potential, especially in agriculture, medicine and remote sensing. To address this gap in the agricultural domain, we present a thoroughly curated bimodal seed image dataset comprising paired RGB and hyperspectral images for 10 plant species, making it one of the largest bimodal seed datasets available. We describe the methodology for data collection and preprocessing and benchmark several deep learning models on the dataset to evaluate their multi-class classification performance. By contributing a high-quality dataset, our manuscript offers a valuable resource for studying spectral, spatial and morphological properties of seeds, thereby opening new avenues for research and applications.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1629"},"PeriodicalIF":6.9,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145252472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-08DOI: 10.1038/s41597-025-05916-7
Jing Fang, Mei Wu, Xinyi Zhang, Yang Chen, Lingjuan Tou, Jinjing Xu, Xiangjun Kong, Yingxiong Qiu
{"title":"Chromosomal-level genome assembly of the autotetraploid Anoectochilus roxburghii (Jinxianlian, Orchidaceae).","authors":"Jing Fang, Mei Wu, Xinyi Zhang, Yang Chen, Lingjuan Tou, Jinjing Xu, Xiangjun Kong, Yingxiong Qiu","doi":"10.1038/s41597-025-05916-7","DOIUrl":"https://doi.org/10.1038/s41597-025-05916-7","url":null,"abstract":"<p><p>Anoectochilus roxburghii (Orchidaceae), commonly known as Jinxianlian, is a highly valued traditional Chinese herbal medicine. Here, we present a high-quality chromosome-level genome assembly of the cultivar 'Jinkang No.1'. Genome survey analyses suggest that the cultivar is an autotetraploid. The assembled genome spans 5.17 Gb, with a contig N50 of 24.90 Mb, and 93.4% (4.82 Gb) is anchored onto 80 pseudo-chromosomes across 20 homologous groups. Annotation of the genome assembly identifies 76.42% repetitive elements and 88,106 protein-coding genes, with 94.6% of these genes functionally annotated. Ks analysis of collinear gene pairs indicates a recent species-specific polyploidization event, likely resulting in autotetraploidization. This reference genome provides a valuable resource for functional genomics, evolutionary biology, and molecular breeding studies of A. roxburghii and its related species.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1623"},"PeriodicalIF":6.9,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145252408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-08DOI: 10.1038/s41597-025-06052-y
Boris Thome, Friederike Hertweck, Serife Yasar, Lukas Jonas, Stefan Conrad
{"title":"A dataset of study program availability in German higher education between 1971 and 1996.","authors":"Boris Thome, Friederike Hertweck, Serife Yasar, Lukas Jonas, Stefan Conrad","doi":"10.1038/s41597-025-06052-y","DOIUrl":"https://doi.org/10.1038/s41597-025-06052-y","url":null,"abstract":"<p><p>Educational systems are dynamic. They shape human capital, technological and societal progress, and also economic growth. Higher education, in particular, fosters innovation, with varying fields of study contributing differently to this process. Yet, despite its importance, no dataset has previously documented the evolution of academic fields across higher education institutions in a specific country. Addressing this gap, we present the RWI-UNI-SUBJECTS<sup>1</sup> dataset, the first extensive collection of study opportunities across German higher education institutions between 1971 and 1996. The dataset originates from annual study guides by the German Federal Employment Agency for high school students. To extract the data, a custom-developed computer vision algorithm was used. We further enriched the dataset with administrative codes for fields, institutions, and districts, enabling seamless integration with additional datasets, such as social security data, official student statistics, or the National Educational Panel Study (NEPS). Covering a total of 105,307 study programs between 1971 and 1996, RWI-UNI-SUBJECTS<sup>1</sup> offers a valuable foundation for interdisciplinary research on education, innovation, and economic development.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1626"},"PeriodicalIF":6.9,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145252445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-08DOI: 10.1038/s41597-025-05912-x
Bjarne Steffen, Florian Egli, Anurag Gumber, Mak Ðukan, Paul Waidelich
{"title":"A global dataset of the cost of capital for renewable energy projects.","authors":"Bjarne Steffen, Florian Egli, Anurag Gumber, Mak Ðukan, Paul Waidelich","doi":"10.1038/s41597-025-05912-x","DOIUrl":"https://doi.org/10.1038/s41597-025-05912-x","url":null,"abstract":"<p><p>The cost of capital (CoC) critically influences the levelized cost of renewable energy and, by extension, the global low-carbon transition. However, reliable and consistent CoC data remain scarce, limiting an appropriate reflection of CoC differences in energy system and integrated assessment models. We present a global dataset of CoC for renewable energy projects, covering 68 countries from 2010 to 2022 and focusing on three key technologies: utility-scale solar photovoltaics, onshore wind, and offshore wind. We systematically compile and standardize data from academic literature and international organizations, ensuring methodological comparability. Our dataset includes 1,429 data points, of which 366 provide nominal, after-tax weighted average cost of capital values. We conduct technical validation through cross-technology comparisons, temporal consistency checks, and source triangulation. By addressing a key data gap, this dataset aims to support evidence-based energy policy analysis and advance the understanding of how financing conditions impact renewable energy costs globally.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1624"},"PeriodicalIF":6.9,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145252423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}