{"title":"A benchmark dataset for class-wise segmentation of construction and demolition waste in cluttered environments.","authors":"Diani Sirimewan, Sanuwani Dayarathna, Sudharshan Raman, Yu Bai, Mehrdad Arashpour","doi":"10.1038/s41597-025-05243-x","DOIUrl":"https://doi.org/10.1038/s41597-025-05243-x","url":null,"abstract":"<p><p>Efficient management of construction and demolition waste (CDW) is essential for enhancing resource recovery. The lack of publicly available, high-quality datasets for waste recognition limits the development and adoption of automated waste handling solutions. To facilitate data sharing and reuse, this study introduces 'CDW-Seg', a benchmark dataset for class-wise segmentation of CDW. The dataset comprises high-resolution images captured at authentic construction sites, featuring skip bins filled with a diverse mixture of CDW materials in-the-wild. It includes 5,413 manually annotated objects across ten categories: concrete, fill dirt, timber, hard plastic, soft plastic, steel, fabric, cardboard, plasterboard, and the skip bin, representing a total of 2,492,021,189 pixels. Each object was meticulously annotated through semantic segmentation, providing reliable ground-truth labels. To demonstrate the applicability of the dataset, an adapter-based fine-tuning approach was implemented using a hierarchical Vision Transformer, ensuring computational efficiency suitable for deployment in automated waste handling scenarios. The CDW-Seg has been made publicly accessible to promote data sharing, facilitate further research, and support the development of automated solutions for resource recovery.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"885"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-05-28DOI: 10.1038/s41597-025-05097-3
Cameron Bracken, Youngjun Son, Daniel Broman, Nathalie Voisin
{"title":"GODEEEP-hydro: Historical and projected power system ready hydropower data for the United States.","authors":"Cameron Bracken, Youngjun Son, Daniel Broman, Nathalie Voisin","doi":"10.1038/s41597-025-05097-3","DOIUrl":"https://doi.org/10.1038/s41597-025-05097-3","url":null,"abstract":"<p><p>Hydropower is a critical electricity resource in the United States which, in addition to low-cost electricity generation, provides valuable ancillary grid services, and supports the integration of nondispatchable weather-dependent resources (e.g., wind and solar). Despite its value to the grid, there are very few comprehensive datasets available from which to study both historical and future impacts of climate, weather driven energy droughts, and integration of other weather driven generation. In this paper, we present a hydropower generation dataset covering 1,452 hydroelectric plants in the contiguous U.S. The dataset contains monthly and weekly hydropower generation estimates for both historical (1982-2019) and future (2020-2099) periods which includes 4 future climate scenarios. In addition, this dataset provides weekly and monthly constraints such as minimum and maximum power which are particularly useful in power system models which are used to study grid reliability, transmission planning and capacity expansion.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"875"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-05-28DOI: 10.1038/s41597-025-05208-0
Alexander Arkowitz, Scott M Ritter, Matthew P Thompson, Jesse D Young, Brad Pietruszka, David E Calkin
{"title":"Quality assured spatial dataset of wildfire containment firelines and engagement outcomes 2017 to 2024.","authors":"Alexander Arkowitz, Scott M Ritter, Matthew P Thompson, Jesse D Young, Brad Pietruszka, David E Calkin","doi":"10.1038/s41597-025-05208-0","DOIUrl":"https://doi.org/10.1038/s41597-025-05208-0","url":null,"abstract":"<p><p>The escalation of wildfires in the USA, coupled with rising firefighting costs and decreasing workforce capacity, underscores the critical need to evaluate the efficiency and effectiveness of containment measures. However, the existing spatial data that records the locations and types of containment measures and wildfire perimeters contains numerous errors and redundancies. This paper presents a comprehensive fireline Quality Assurance and Quality Control dataset developed from the wildland firefighting operations data reported in the National Interagency Fire Center National Incident Feature Service. This improved dataset contains reliable spatial locations for fireline built during suppression operations, the associated verified fire perimeters, and identifies where containment was success or failure for fires greater than 1000 acres from 2017-2024. The improved final dataset represents critical information that was previously unavailable for assessing the success of fireline operations and incident management resource-use efficiency. The lessons learned from analyses utilizing this dataset are critical for improving the efficiency and effectiveness of the United States wildfire management system.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"897"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-05-28DOI: 10.1038/s41597-025-05234-y
Brian C Weeks, Zhizhuo Zhou, Charlotte M Probst, Jacob S Berv, Bruce O'Brien, Brett W Benz, Heather R Skeen, Mark Ziebell, Louise Bodt, David F Fouhey
{"title":"Skeletal trait measurements for thousands of bird species.","authors":"Brian C Weeks, Zhizhuo Zhou, Charlotte M Probst, Jacob S Berv, Bruce O'Brien, Brett W Benz, Heather R Skeen, Mark Ziebell, Louise Bodt, David F Fouhey","doi":"10.1038/s41597-025-05234-y","DOIUrl":"https://doi.org/10.1038/s41597-025-05234-y","url":null,"abstract":"<p><p>Large comparative datasets of avian functional traits have been used to address a wide range of questions in ecology and evolution. To date, this work has been constrained by the limited availability of skeletal trait datasets that include extensive inter- and intra-specific sampling. We use computer vision to identify and measure bones from photographs of museum skeletal specimens to assemble an extensive dataset of functionally important skeletal elements in birds. The dataset spans 2,057 species of birds (Aves: Passeriformes) and includes measurements of 12 skeletal elements from 14,419 individuals. In addition to the trait values directly measured from photographs, we leverage the multi-dimensional nature of our dataset and known phylogenetic relationships of the species to impute missing data under an evolutionary model. To facilitate use of the dataset, the taxonomy has been reconciled with an existing comprehensive avian phylogeny and an additional dataset of external functional traits for all birds.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"884"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-05-28DOI: 10.1038/s41597-025-05152-z
Sangjun Park, Dejiang Zheng, Uichin Lee
{"title":"A PPG Signal Dataset Collected in Semi-Naturalistic Settings Using Galaxy Watch.","authors":"Sangjun Park, Dejiang Zheng, Uichin Lee","doi":"10.1038/s41597-025-05152-z","DOIUrl":"https://doi.org/10.1038/s41597-025-05152-z","url":null,"abstract":"<p><p>The widespread adoption of consumer-grade wearable devices, such as Galaxy Watch, has revolutionized personal health monitoring as they enable continuous and non-invasive measurement of key cardiovascular indicators through photoplethysmography (PPG) sensors. However, existing datasets primarily rely on research-grade devices, limiting the applicability of consumer-grade wearables in real-world conditions. To address this gap, this study presents GalaxyPPG, a dataset collected from 24 participants that includes wrist-worn PPG signals from a Galaxy Watch 5 and an Empatica E4, alongside chest-worn ECG data from a Polar H10. Data were captured during diverse activities in a semi-naturalistic setting, providing insights into the sensing performance of consumer-grade wearables under motion- or stress-inducing activities. This dataset is designed to advance applications of PPG signals, such as HR tracking with diverse physical activities and HRV monitoring for stress detection. Additionally, we offer an open-source toolkit for data collection and analysis using Samsung Galaxy Watch, fostering reproducibility and further research leveraging this toolkit.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"892"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Transcriptome analysis of testes gene expression to explore genetic diversity of Mangalica and Camborough boars.","authors":"Keigo Yamane, Sangwoo Kim, Akari Koide, Erina Yoneda, Mitsunori Kayano, Yuki Muranishi","doi":"10.1038/s41597-025-05196-1","DOIUrl":"https://doi.org/10.1038/s41597-025-05196-1","url":null,"abstract":"<p><p>The testis, an important reproductive organ, is involved in spermatogenesis and steroid hormone secretion and has been the subject of a wide variety of studies. Pigs are often used as model animals for studies on human physiology and disease, and studies on the testicular development of pigs could shed light on human reproduction. Mangalica, an indigenous Hungarian pig breed, has reproductive traits that are different from those of commercial pig breeds. This specificity could reveal important differences in the cascades and reproductive genes between humans and other animals. In this study, we conducted RNA-sequencing analysis of the testes of 14 days old Mangalica and Camborough boars. We also performed clustering and pathway analysis of differentially expressed genes. These datasets and analyses are expected to provide important gene sets for pig testis development that can be applied in future studies on human reproductive mechanisms.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"888"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Chromosome-level genome assembly of scalloped spiny lobster Panulirus homarus homarus.","authors":"Dongfang Sun, Jianjian Lv, Baoquan Gao, Shaoting Jia, Ping Liu, Jian Li, Jitao Li, Xianyun Ren","doi":"10.1038/s41597-025-05253-9","DOIUrl":"https://doi.org/10.1038/s41597-025-05253-9","url":null,"abstract":"<p><p>Lobsters, aquatic organisms of significant economic value, hold an important position in the global aquaculture and fisheries industries. However, due to overfishing and ecological change, the populations of certain lobster species have declined dramatically, prompting conservation efforts in various countries. However, limited genomics research has restricted our capacity to conserve and exploit lobster germplasm resources. Here, we present a chromosome-level reference genome for Panulirus homarus homarus constructed using PacBio long-read sequencing and Hi-C data. The genome assembly size was 2.61 Gb, with a contig N50 of 5.43 Mb, and a scaffold N50 of 36.69 Mb. The assembled sequences were anchored to 73 chromosomes, covering 96.05% of the total genome. A total of 25,580 protein-coding genes were predicted, and 99.98% of the genes were functionally annotated using various protein databases. The high-quality genome assembly provides a valuable resource for studying the biology and evolutionary history of P. h. homarus, and could facilitate sustainable resource management, aquaculture, and conservation of the species.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"900"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Chromosome-level genome assembly of Sambus kanssuensis (Coleoptera: Buprestidae).","authors":"Zhonghua Wei, Yunchun Li, Yingying Li, Jiuzhou Liu, Shuangmei Ding, Xulong Chen, Aimin Shi, Ding Yang","doi":"10.1038/s41597-025-05271-7","DOIUrl":"https://doi.org/10.1038/s41597-025-05271-7","url":null,"abstract":"<p><p>Sambus kanssuensis Ganglbauer, 1890 (Coleoptera: Buprestidae), distributed in Gansu and Sichuan Provinces of China, is a phytophagous pest that feeds on the toxic plant Buddleja. However, the genomic resources of this beetle remain unknown, which impedes the understanding of its ecological adaptations. Consequently, this study presents a complete, well-assembled, and annotated genome of S. kanssuensis. The assembled results indicate a genome size of 312.42 Mb, comprising 206 scaffolds, with an N50 of 34.04 Mb; 98.68% of the assembly sequences were anchored to 11 chromosomes, including one sex chromosome. The genome contains 12,723 protein-coding genes, of which 11,977 have been annotated. BUSCO analysis revealed that the completeness of the chromosome-level genome is 97.9%. This chromosome-level genome provides valuable data for further investigations into detoxification mechanisms, ecological adaptations, population genetics, and the evolution of Buprestidae.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"895"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-05-28DOI: 10.1038/s41597-025-05164-9
Xinzhuo Zhao, Jintu Xu, Junjie Yang, Jin Duan
{"title":"A global urban road network self-adaptive simplification workflow from traffic to spatial representation.","authors":"Xinzhuo Zhao, Jintu Xu, Junjie Yang, Jin Duan","doi":"10.1038/s41597-025-05164-9","DOIUrl":"https://doi.org/10.1038/s41597-025-05164-9","url":null,"abstract":"<p><p>Urban road network is crucial for understanding and revealing the spatial logic of urban organization and evolution. However, existing urban road network datasets like OpenStreetMap are designed for traffic studies, treating each lane as a distinct spatial unit of mobility, which may not align with urban studies considering each road as an integration space for social and cultural dynamics. This study established a novel workflow to self-adaptively transform the global urban road network from traffic representation to spatial representation and provides simplified urban road network data of 35 globally representative cities. Our workflow, comprising six critical stages, is anchored on the segment divergence from their surroundings to guide aggregation decisions, effectively mitigating the risks of over-aggregation and under-aggregation against the diversity of global urban backgrounds. This workflow significantly reduces the duplicated segments of roads from an average of 31.2% to 3.6% in total, performing consistently across diverse countries and continents. This dataset is expected to become a robust data layer for urban socio-economic modelling and GeoAI development.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"883"},"PeriodicalIF":5.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144174740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}