Scientific DataPub Date : 2024-12-19DOI: 10.1038/s41597-024-04266-0
Xuanguang Liang, Wenhao Wang, Junrou Huang, Mingfei Luo, Nima Wangdui, Caiyun Sun, Jianguo Lu
{"title":"A chromosome-level genome assembly of big-barbel schizothorcin, Schizothorax macropogon.","authors":"Xuanguang Liang, Wenhao Wang, Junrou Huang, Mingfei Luo, Nima Wangdui, Caiyun Sun, Jianguo Lu","doi":"10.1038/s41597-024-04266-0","DOIUrl":"https://doi.org/10.1038/s41597-024-04266-0","url":null,"abstract":"<p><p>Big-barbel schizothorcin (Schizothorax macropogon), an endemic and vulnerable species to the mid-reaches of the Yarlung Zangbo River, epitomizes survival in harsh conditions yet suffers significant population contractions due to human activities. This species was the subject of our study in which we leveraged PacBio, MGI-Seq, and Hi-C data to assemble a chromosome-scale genome. This assembly comprises 25 pseudo-chromosomes, yielding a genome size of 1.42 Gb with a scaffold N50 length of 59.4 Mb, indicative of a highly contiguous assembly. A BUSCO assessment ascertained the comprehensiveness of the genome at 97.9%. Annotation efforts identified 46,246 putative protein-coding genes, with 49.61% of the assembled genome annotated as repetitive sequences. This genome assembly is instrumental for advancing conservation of the giant whiskered schizothoracines and related species, and for illuminating the evolution and ecology of schizothoracine fishes in the Qinghai-Tibet Plateau.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1402"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2024-12-19DOI: 10.1038/s41597-024-04178-z
Alejandro Garcia-Moya, Carlos Manuel Alonso-Hernández, Ricardo Sánchez-Murillo, Yasser Morera-Gómez, Minerva Sánchez-Llull, Oscar Díaz Rizo, Osvaldo Cuesta Santos, Rosemery López Lee, Osvaldo Brígido Flores, Enma Odalys Ramos Viltre, Lucia Ortega
{"title":"Spatiotemporal characterization of the isotopic composition of meteoric waters in Cuba.","authors":"Alejandro Garcia-Moya, Carlos Manuel Alonso-Hernández, Ricardo Sánchez-Murillo, Yasser Morera-Gómez, Minerva Sánchez-Llull, Oscar Díaz Rizo, Osvaldo Cuesta Santos, Rosemery López Lee, Osvaldo Brígido Flores, Enma Odalys Ramos Viltre, Lucia Ortega","doi":"10.1038/s41597-024-04178-z","DOIUrl":"https://doi.org/10.1038/s41597-024-04178-z","url":null,"abstract":"<p><p>The stable isotope composition of meteoric water has been widely used to understand hydrological processes worldwide. We present a unique dataset, with the isotopic composition (δ<sup>18</sup>O and δ<sup>2</sup>H) of meteoric waters, derived from a nationwide study in Cuba. It includes monthly composite and event-based precipitations, from January 2017 to December 2021 (N = 526 and N = 111 respectively). Monthly data showed minor seasonal trends (dry vs. rainy), with a notable influence of tropical cyclones. Event-based data demonstrated that precipitation associated with tropical cyclones exhibited lower isotopic compositions. The analysis of potential factors influencing the isotopic composition of precipitation showed a minor influence of the rainfall amount, but negligible influence of factors such are relative humidity, elevation, and air temperature. This data set can be used as a tool not only to understand hydrological processes at the country scale, but also to further improve and develop isotope-enabled modelling for assessing water balances and fluxes, understanding the impact of extreme events, and paleoreconstruction in the Intra-Americas Sea.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1398"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Chromosome-level genome assembly and annotation of a sea toad (Chaunax sp.).","authors":"Zifeng Zhan, Yanshuo Liang, Jiehong Wei, Jing Liu, Kuidong Xu","doi":"10.1038/s41597-024-04245-5","DOIUrl":"https://doi.org/10.1038/s41597-024-04245-5","url":null,"abstract":"<p><p>The sea toad genus Chaunax is a group of small benthic fishes that predominantly inhabiting the deep seas of the Atlantic, Indian, and Pacific Oceans. Although they have the potential to make excellent systems for studies of evolutionary adaptation to deep-sea environments, genomic research on Chaunax has been hindered by a scarcity of high-quality genomic resources. We present a chromosome-scale genome assembly of a Chaunax specimen generated using PacBio long-read sequencing and high-throughput chromosome conformation capture technology. The size of the assembled genome was 706.94 Mb, with a contig N50 of 15.24 Mb and scaffold N50 of 29.42 Mb. Approximately 96.11% of assembled sequences were anchored and oriented onto 24 pseudo-chromosomes. The genome contained 213.47 Mb repetitive sequences, 25,280 protein-coding genes, and 5,090 non-coding RNAs. The high ratio of complete BUSCO genes (97.20%) indicates high quality of genome assembly. The chromosomal-level reference genome of Chaunax sp. provides a preliminary molecular basis for understanding deep-sea adaptation and phenotypic evolution as well as an important reference for whole-genome sequencing of related species.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1397"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2024-12-19DOI: 10.1038/s41597-024-04233-9
Axin Fan, Tingfa Xu, Geer Teng, Xi Wang, Chang Xu, Yuhan Zhang, Jianan Li
{"title":"Multi-angle and full-Stokes polarization multispectral images using quarter-wave plate and tunable filter.","authors":"Axin Fan, Tingfa Xu, Geer Teng, Xi Wang, Chang Xu, Yuhan Zhang, Jianan Li","doi":"10.1038/s41597-024-04233-9","DOIUrl":"https://doi.org/10.1038/s41597-024-04233-9","url":null,"abstract":"<p><p>Polarization multispectral imaging has advanced significantly due to its robust information representation capability. Imaging application requires rigorous simulation evaluation and experimental validation using standardized datasets. However, the current full-Stokes polarization multispectral images (FSPMI) dataset, while providing simulation data, is limited by image drift and spectral bands. To overcome these limitations and supplement experimental data, this paper introduces the multi-angle and full-Stokes polarization multispectral images (MAFS-PMI) dataset. The imaging system utilizes a rotatable quarter-wave plate (QWP) and a fixed liquid crystal tunable filter (LCTF) to modulate polarization information. Meanwhile, the LCTF allows switching between multiple spectral bands. The acquired multi-angle polarization multispectral images facilitate the experimental validation of encoding strategies and reconstruction algorithms. Additionally, the derived full-Stokes polarization multispectral images enable the simulation evaluation of imaging methods. The MAFS-PMI dataset involves 73 fast axis angles (0° to 180°), four Stokes parameters, five polarization parameters, 35 spectral bands (520 nm to 690 nm), 400 × 400 pixels, and 12 distinct objects. This dataset offers a valuable resource for developing advanced imaging methods.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1401"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2024-12-19DOI: 10.1038/s41597-024-04277-x
Diego Moya, Dennis Copara, Sara Giarola, Adam Hawkes
{"title":"Global datasets of geospatial-AI-resolved energy consumers including climate-driven energy demands, geographical and socioeconomic realities for a transition reset.","authors":"Diego Moya, Dennis Copara, Sara Giarola, Adam Hawkes","doi":"10.1038/s41597-024-04277-x","DOIUrl":"https://doi.org/10.1038/s41597-024-04277-x","url":null,"abstract":"<p><p>Traditional models deliberately simplify millions of consumers into a single, homogeneous, representative agent with perfect market knowledge and rational expectations, limiting their capacity to capture real-world complexities. To address this limitation in mainstream models, this article provides global datasets to parametrise energy consumers within climate-energy-economy models considering climate-driven energy demand, socioeconomic and demographic factors. The datasets emerge from applying geospatial artificial intelligence, machine learning and big data analytics on a range of geospatial parameters at 1 km<sup>2</sup> resolution. Twenty distinctive energy consumers are defined using three heterogeneous geospatial features, eight diverse and two evolving parameters. This parametrisation of consumers strengthens the applicability of climate-energy-economy models to guide effective, equitable and just climate policy design. This comprehensive analysis of complex interactions between climate, socioeconomic and demographic factors supports more realistic decision-making for a sustainable transition reset. This research emphasises the geospatial distribution of energy consumers to enhance technoeconomic assessment, understanding consumer dynamics for consumer-led resource allocation and informed policy implementation. These datasets can be used in climate-energy-economy models to parametrise consumers beyond traditional approaches.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1408"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Genome sequencing and assembly of near threatened Clarias dussumieri (Valenciennes, 1840), an endemic catfish of peninsular India.","authors":"Vindhya Mohindra, Labrechai Mog Chowdhury, Ravi Charan, Valaparambil Saidumohammad Basheer, Joykrushna Jena","doi":"10.1038/s41597-024-04272-2","DOIUrl":"https://doi.org/10.1038/s41597-024-04272-2","url":null,"abstract":"<p><p>Clarias dussumieri, a near threatened freshwater catfish, is endemic to peninsular India and has aquaculture potential. Unlike its sister species, C. magur, the male fish needs not be sacrificed during captive breeding. Thus, the generation of genomic information of this species becomes significant for effective genome mining through the bioprospecting of novel genes for important production traits. In this study, the genome assembly was undertaken to address this gap by generating high quality chromosome level genome assembly using PacBio long reads and Hi-C scaffolding. The total assembled genome was found to be 918.72 Mb in size and showed 95.23% completeness. Its characterization exhibited 41.46% repeats, 1,174,725 SSRs and 25,369 predicted genes with functional annotations. The Single copy orthologs analysis placed C. dussumieri in a distinct position with C. magur. The comprehensive genomic information offers resources for comparative genomics with other Clarias species for improvement of economic traits.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1406"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2024-12-19DOI: 10.1038/s41597-024-04234-8
Songbin Wu, Zi Shao, Robbie M Andrew, Longfei Bing, Jiaoyue Wang, Le Niu, Zhu Liu, Fengming Xi
{"title":"Global CO<sub>2</sub> uptake by cement materials accounts 1930-2023.","authors":"Songbin Wu, Zi Shao, Robbie M Andrew, Longfei Bing, Jiaoyue Wang, Le Niu, Zhu Liu, Fengming Xi","doi":"10.1038/s41597-024-04234-8","DOIUrl":"https://doi.org/10.1038/s41597-024-04234-8","url":null,"abstract":"<p><p>The majority of the carbon footprint of the cement industry originates from the decomposition of alkaline carbonates during clinker production. Recent studies have demonstrated that calcium oxides and other alkaline oxides in cement materials can sequester CO<sub>2</sub> through the carbonation process and partially offset the carbon emissions generated during cement production. This study employs a comprehensive analytical model to estimate the CO<sub>2</sub> uptake via hydrated cement carbonation, including concrete, mortar, construction waste, and cement kiln dust (CKD), covering major cement production and consumption regions worldwide from 1930 to 2023. In 2023, the global annual cement CO<sub>2</sub> uptake reached 0.93 Gt/yr (95% CI: 0.80-1.13Gt/yr). From 1930 to 2023, the global cumulative cement CO<sub>2</sub> absorption reached 23.89 Gt (95% CI: 20.47-28.74 Gt), equivalent to 52.32% of the CO<sub>2</sub> process emissions from cement production during the same period. Our system for estimating cement emissions and uptake is updated annually, providing consistent and accurate data for the cement industry and carbon cycle studies. This data supports improved adaptation to future challenges.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1409"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2024-12-19DOI: 10.1038/s41597-024-04268-y
Chunlin He, Xinhui Zhang, Zhengyong Wen, Qiong Shi, Zhaobin Song
{"title":"A chromosome-scale reference genome assembly for Triplophysa lixianensis.","authors":"Chunlin He, Xinhui Zhang, Zhengyong Wen, Qiong Shi, Zhaobin Song","doi":"10.1038/s41597-024-04268-y","DOIUrl":"https://doi.org/10.1038/s41597-024-04268-y","url":null,"abstract":"<p><p>In this study, we constructed a chromosome-scale reference genome assembly for Lixian plateau loach, Triplophysa lixianensis, by integration of MGI short-read, PacBio HiFi long-read and Hi-C sequencing technologies. A 668-Mb haplotypic genome assembly was obtained for a female T. lixianensis, and 98.91% of the assembled sequences were anchored into 25 chromosomes. This assembly owned a moderate repeat content (35.63%) and an annotation of 23,774 protein-coding genes, among them 94.15% were predicted with functions. The assembled genome of T. lixianensis shared a good syntenic relationship with previously published data of its relative T. dalaica. Taken together, our genome data presented here provide a valuable genetic resource for in-depth evolutionary and functional studies, as well as molecular breeding and conservation of this valuable fish species to elevate its ecological and economical values.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1404"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2024-12-19DOI: 10.1038/s41597-024-04253-5
Santiago Bogarra, Manuel Moreno-Eguilaz, Juan Antonio Ortega-Redondo, Jordi-Roger Riba
{"title":"A dataset of voltage and current waveforms in an electric arc under low pressure for aircraft power systems.","authors":"Santiago Bogarra, Manuel Moreno-Eguilaz, Juan Antonio Ortega-Redondo, Jordi-Roger Riba","doi":"10.1038/s41597-024-04253-5","DOIUrl":"https://doi.org/10.1038/s41597-024-04253-5","url":null,"abstract":"<p><p>This paper presents an experimental dataset developed for the detection of parallel arc faults in aircraft electrical systems. This dataset is based on a total of 960 experiments performed in a low-pressure chamber under different conditions using two electrodes placed on the surface of an insulating material. These experiments correspond to 2 insulating materials, 12 electrode distances, and 10 pressure conditions representative of aircraft environments. Each experimental condition was repeated four times, resulting in 960 experimental recordings, each containing one million samples of time, current, and voltage signals of the electric arc induced on the surface of the insulating material. The dataset can be used to model arc behavior under different pressure conditions, to identify patterns that indicate the presence of an arc, and to accelerate the improvement of arc identification. This dataset has the potential to be used to develop arc fault detection and identification methods for more electric and all-electric aircraft and other electric vehicles.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1396"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A chromosome-level genome assembly of skipjack tuna, Katsuwonus pelamis (Perciformes: Scombridae).","authors":"Xuanguang Liang, Junrou Huang, Bilin Liu, Feng Wu, Jian Liu, Jianguo Lu","doi":"10.1038/s41597-024-04280-2","DOIUrl":"https://doi.org/10.1038/s41597-024-04280-2","url":null,"abstract":"<p><p>Skipjack tuna (Katsuwonus pelamis), a highly migratory pelagic species widely distributed in tropical and subtropical oceanic regions, has consistently ranked third in global fishery landings from 2015 to 2022 and holds substantial economic significance for the coastal fisheries of Pacific Rim countries. Integrating PacBio and Hi-C data, a chromosome-level assembly of its genome was accomplished. This assembly comprises 24 pseudo-chromosomes, yielding a genome size of 827.9 Mb with a scaffold N50 length of 32.7 Mb, indicative of a highly contiguous assembly. A BUSCO assessment ascertained the comprehensiveness of the genome at 98.7%, indicative of comprehensive genomic representation. A total of 32,001 protein-coding genes were predicted with 31,993 genes (99.98%) annotated. The chromosome-level genome assembly of K. pelamis is key to understanding its evolution and genetics, facilitating targeted conservation and sustainable fishing practices for this economically important species.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1405"},"PeriodicalIF":5.8,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}