{"title":"Lidar-derived digital surface model of Stromboli island updated to August 2023.","authors":"Marina Bisson, Roberto Gianardi, Riccardo Civico, Paolo Madonia, Tullio Ricci, Claudia Spinetti","doi":"10.1038/s41597-025-04856-6","DOIUrl":"https://doi.org/10.1038/s41597-025-04856-6","url":null,"abstract":"<p><p>Digital surface models reproduce the 3D topography of a territory at different spatial resolutions depending on the acquisition technique of source data. In active and densely populated volcanic areas, updated digital topographies are fundamental for mapping and quantifying the morphological changes generated by the eruptive events and play a key role in modelling volcanic phenomena and related hazards. This work presents the high-resolution Digital Surface Model of Stromboli Island, Italy, updated to 4<sup>th</sup> August 2023. The model, obtained by elaborating more than 109 × 10<sup>6</sup> Airborne Lidar points (x,y,z), reconstructs the volcano's surface through an elevation matrix at a spatial resolution of 50 cm, reproducing both natural and anthropic elements. The model has been validated by using Ground Control Points and the vertical accuracy results in 8 cm. Nowadays, this model represents the most updated and accurate digital 3D topography of the entire island and, for this reason, can be considered a relevant data not only for multi-temporal morphological and volcanological analyses but also for hazard assessment studies.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"522"},"PeriodicalIF":5.8,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11953410/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143743517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-28DOI: 10.1038/s41597-025-04884-2
Anna Kazarina, Hallie Wiechman, Soumyadev Sarkar, Tanner Richie, Sonny T M Lee
{"title":"Recovery of 679 metagenome-assembled genomes from different soil depths along a precipitation gradient.","authors":"Anna Kazarina, Hallie Wiechman, Soumyadev Sarkar, Tanner Richie, Sonny T M Lee","doi":"10.1038/s41597-025-04884-2","DOIUrl":"10.1038/s41597-025-04884-2","url":null,"abstract":"<p><p>Soil contains a diverse community of organisms; these can include archaea, fungi, viruses, and bacteria. In situ identification of soil microorganisms is challenging. The use of genome-centric metagenomics enables the assembly and identification of microbial populations, allowing the categorization and exploration of potential functions living in the complex soil environment. However, the heterogeneity of the soil-inhabiting microbes poses a tremendous challenge, with their functions left unknown, and difficult to culture in lab settings. In this study, using genome assembling strategies from both field core samples and enriched monolith samples, we assembled 679 highly complete metagenome-assembled genomes (MAGs). The ability to identify these MAGs from samples across a precipitation gradient in the state of Kansas (USA) provided insights into the impact of precipitation levels on soil microbial populations. Metabolite modeling of the MAGs revealed that more than 80% of the microbial populations possessed carbohydrate-active enzymes, capable of breaking down chitin and starch.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"521"},"PeriodicalIF":5.8,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11953352/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143743532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-27DOI: 10.1038/s41597-025-04759-6
Siya Li, Quansheng Ge, Fubao Sun, Qiulei Ji, Wenbin Liu, Ronggao Liu, Duanyang Xu, Zexing Tao
{"title":"Annual 30 m land cover dataset on the Tibetan Plateau from 1990 to 2023.","authors":"Siya Li, Quansheng Ge, Fubao Sun, Qiulei Ji, Wenbin Liu, Ronggao Liu, Duanyang Xu, Zexing Tao","doi":"10.1038/s41597-025-04759-6","DOIUrl":"10.1038/s41597-025-04759-6","url":null,"abstract":"<p><p>Accurate land cover data was fundamental for formulating sound land planning and sustainable development strategies. This study focused on the Tibetan Plateau (TP), a globally sensitive ecological area, and developed a locally tailored annual 30 m resolution land cover dataset from 1990 to 2023 (TPLCD). Leveraging the Google Earth Engine (GEE) platform for Landsat data processing, LandTrendr was employed to generate robust, high-precision training samples. Subsequently, random forest classification and spatiotemporal smoothing strategies were applied to precisely map the land cover dynamics of the TP. Rigorous validation through visual interpretation, authoritative third-party datasets (Geo-Wiki and GLCVSS), and thematic dataset cross-comparisons, revealed an overall accuracy of 84.8%, and a Kappa coefficient of 0.78, fully affirming the dataset's high reliability. This dataset provided invaluable empirical evidence for understanding the vulnerability and adaptability of the TP's ecosystem.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"510"},"PeriodicalIF":5.8,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11950319/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143731154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-27DOI: 10.1038/s41597-025-04743-0
Eberechi Wogu, Patrick Filima, Bradley Caron, Daniel Deabler, Peer Herholz, Catherine Leal, Mohammed F Mehboob, Sohmee Kim, Ananya Gosain, Alisha Flexwala, Soichi Hayashi, Simisola Akintoye, George Ogoh, Tawe Godwin, Damian Eke, Franco Pestilli
{"title":"A labeled Clinical-MRI dataset of Nigerian brains.","authors":"Eberechi Wogu, Patrick Filima, Bradley Caron, Daniel Deabler, Peer Herholz, Catherine Leal, Mohammed F Mehboob, Sohmee Kim, Ananya Gosain, Alisha Flexwala, Soichi Hayashi, Simisola Akintoye, George Ogoh, Tawe Godwin, Damian Eke, Franco Pestilli","doi":"10.1038/s41597-025-04743-0","DOIUrl":"10.1038/s41597-025-04743-0","url":null,"abstract":"<p><p>There is currently a paucity of neuroimaging data from the African continent, limiting the diversity of data from a significant proportion of the global population. This in turn diminishes global health research and innovation. To address this issue, we present and describe the first Magnetic Resonance Imaging (MRI) dataset from individuals in the African nation of Nigeria. This dataset contains pseudonymized structural MRI (T1w, T2w, FLAIR) data of clinical quality, with 35 images from healthy control subjects, 31 images from individuals diagnosed with age-related dementia, and 22 from individuals with Parkinson's Disease. Given the potential for Africa to contribute to the global neuroscience community, this unique MRI dataset represents both an opportunity and benchmark for future studies to share data from the African continent.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"518"},"PeriodicalIF":5.8,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11950411/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143731152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-27DOI: 10.1038/s41597-025-04886-0
Taylor A Berger, Miles Wischnewski, Alexander Opitz, Ivan Alekseichuk
{"title":"Human head models and populational framework for simulating brain stimulations.","authors":"Taylor A Berger, Miles Wischnewski, Alexander Opitz, Ivan Alekseichuk","doi":"10.1038/s41597-025-04886-0","DOIUrl":"10.1038/s41597-025-04886-0","url":null,"abstract":"<p><p>Noninvasive brain stimulation (NIBS) is pivotal in studying human brain-behavior relations and treating brain disorders. NIBS effectiveness relies on informed targeting of specific brain regions, a challenge due to anatomical differences between humans. Computational volumetric head modeling can capture individual effects and enable comparison across a population. However, most studies implementing modeling use a single-head model, ignoring morphological variability, potentially skewing interpretation, and realistic precision. We present a comprehensive dataset of 100 realistic head models with variable tissue conductivity values, lead-field matrices, standard-space co-registrations, and quality-assured tissue segmentations to provide a large sample of healthy adult head models with anatomical and tissue variance. Leveraging the Human Connectome Project s1200 release, this dataset powers population head modeling for stimulation target optimization, MEEG source modeling simulations, and advanced meta-analysis of brain stimulation studies. We performed a quality assessment for each head mesh, which included a semi-manual segmentation accuracy correction and finite-element analysis quality measures. This dataset will facilitate brain stimulation developments in academic and clinical research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"516"},"PeriodicalIF":5.8,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11950330/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143731131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-27DOI: 10.1038/s41597-025-04788-1
Sarah Matej, Florian Weidinger, Lisa Kaufmann, Nicolas Roux, Simone Gingrich, Helmut Haberl, Fridolin Krausmann, Karl-Heinz Erb
{"title":"A global land-use data cube 1992-2020 based on the Human Appropriation of Net Primary Production.","authors":"Sarah Matej, Florian Weidinger, Lisa Kaufmann, Nicolas Roux, Simone Gingrich, Helmut Haberl, Fridolin Krausmann, Karl-Heinz Erb","doi":"10.1038/s41597-025-04788-1","DOIUrl":"10.1038/s41597-025-04788-1","url":null,"abstract":"<p><p>Land use is intimately linked to key components of the Earth system, including the climate system, biodiversity and biogeochemical cycles. Advanced understanding of patterns and dynamics of land use is vital for assessing impacts on these system components and for developing strategies to ensure sustainability. However, thematically detailed data that enable the analyses of spatiotemporal dynamics of land use, including land-use intensity, are currently lacking. This study presents a comprehensive land-use data cube (LUIcube) that traces global land-use area and intensity developments between 1992 and 2020 annually at 30 arcsecond spatial resolution. It discerns 32 land-use classes that can be aggregated to cropland, grazing land, forestry, built-up land and wilderness. Land-use intensity is represented through the framework of Human Appropriation of Net Primary Production, which allows to quantify changes in NPP, respectively biomass flows, induced by land conversion and land-management. The LUIcube provides the necessary database for analyzing the role of natural and socioeconomic drivers of land-use change and its ecological impacts to inform strategies for sustainable land management.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"511"},"PeriodicalIF":5.8,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11950351/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143730644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-27DOI: 10.1038/s41597-025-04888-y
Andrea Martín-Díaz, Clara de Vega, Sara Martín-Hernanz, Abelardo Aparicio, Rafael G Albaladejo
{"title":"De novo transcriptome assembly of the plant Helianthemum marifolium for the study of adaptive mechanisms.","authors":"Andrea Martín-Díaz, Clara de Vega, Sara Martín-Hernanz, Abelardo Aparicio, Rafael G Albaladejo","doi":"10.1038/s41597-025-04888-y","DOIUrl":"10.1038/s41597-025-04888-y","url":null,"abstract":"<p><p>The genus Helianthemum, commonly known as rockroses, encompasses 140 species primarily distributed in the Palearctic region, with notable diversification driven by climatic and geological changes. These plants are valuable for studying speciation processes and ecological divergence. The chemical properties of the leaves have also been investigated for containing valuable bioactive compounds with several therapeutic properties. However, the availability of genomic resources for species in this genus are almost entirely lacking. Here, we assembled and annotated the first reference transcriptome of Helianthemum marifolium, a species with wide morphological variability and infraspecific diversity. Illumina paired-end RNA sequences were generated using leaves from 16 individuals, representing the four recognized subspecies, all cultivated in a greenhouse. RNA reads were assembled with Trinity and Oases, and EvidentialGene produced a transcriptome with 122,002 transcripts. The transcriptome showed 59524 hits on the UniProtBK database through BLASTx. This transcriptome will be an invaluable resource for transcriptome-level population studies, conservation genetics of the many endangered species within the genus, and for deepen into the metabolic pathways of leaf-derived compounds.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"515"},"PeriodicalIF":5.8,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11950303/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143731083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-27DOI: 10.1038/s41597-025-04862-8
Mario Klanfar, Tomislav Korman, Dubravko Domitrović, Vjekoslav Herceg
{"title":"2D and 3D Scans of Granular Rock Material Heaps and Calibration Models.","authors":"Mario Klanfar, Tomislav Korman, Dubravko Domitrović, Vjekoslav Herceg","doi":"10.1038/s41597-025-04862-8","DOIUrl":"10.1038/s41597-025-04862-8","url":null,"abstract":"<p><p>The angle of repose (AoR) is critical for studying granular materials in mining, geotechnical engineering, pharmaceuticals, and agriculture. Accurate AoR measurements are essential for understanding material stability and movement. The descriptor includes photographs of 86 heaps of rock materials dolomite, quartz sand, marble, and gravel. In addition, photographs of 1 ideal cone and 4 CAD drawing models are provided. These images are processed using specialized software to create detailed 2D and 3D models of the heaps. The generated STL and point-cloud models can be used to calculate the AoR with high accuracy, taking into account the complex surface morphology that traditional methods may overlook. Additionally, these models enable standardized comparisons and can be used for calibration purposes. This data facilitates the development and validation of new AoR measurement methods, enhancing the reliability and consistency of granular material analysis.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"513"},"PeriodicalIF":5.8,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11950640/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143731593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-27DOI: 10.1038/s41597-025-04825-z
Harshavardhana T Gowda, Neha Kaul, Carlos Carrasco, Marcus Battraw, Safa Amer, Saniya Kotwal, Selena Lam, Zachary McNaughton, Ferdous Rahimi, Sana Shehabi, Jonathon Schofield, Lee M Miller
{"title":"A database of upper limb surface electromyogram signals from demographically diverse individuals.","authors":"Harshavardhana T Gowda, Neha Kaul, Carlos Carrasco, Marcus Battraw, Safa Amer, Saniya Kotwal, Selena Lam, Zachary McNaughton, Ferdous Rahimi, Sana Shehabi, Jonathon Schofield, Lee M Miller","doi":"10.1038/s41597-025-04825-z","DOIUrl":"10.1038/s41597-025-04825-z","url":null,"abstract":"<p><p>Upper limb based neuromuscular interfaces aim to provide a seamless way for humans to interact with technology. Among noninvasive interfaces, surface electromyogram (EMG) signals hold significant promise. However, their sensitivity to physiological and anatomical factors remains poorly understood, raising questions about how these factors influence gesture decoding across individuals or groups. To facilitate the study of signal distribution shifts across individuals or groups of individuals, we present a dataset of upper limb EMG signals and physiological measures from 91 demographically diverse adults. Participants were selected to represent a range of ages (18 to 92 years) and body mass indices (healthy, overweight, and obese). The dataset also includes measures such as skin hydration and elasticity, which may affect EMG signals. This dataset provides a basis to study demographic confounds in EMG signals and serves as a benchmark to test the development of fair and unbiased algorithms that enable accurate hand gesture decoding across demographically diverse subjects. Additionally, we validate the quality of the collected data using state-of-the-art gesture decoding techniques.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"517"},"PeriodicalIF":5.8,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11950301/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143730622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-03-27DOI: 10.1038/s41597-025-04836-w
Zhi Liu, Yao Xiao, Zhu Su, Luyao Ye, Kaili Lu, Xian Peng
{"title":"Bilingual Dialogue Dataset with Personality and Emotion Annotations for Personality Recognition in Education.","authors":"Zhi Liu, Yao Xiao, Zhu Su, Luyao Ye, Kaili Lu, Xian Peng","doi":"10.1038/s41597-025-04836-w","DOIUrl":"10.1038/s41597-025-04836-w","url":null,"abstract":"<p><p>Dialogue datasets are essential for advancing natural language processing (NLP) tasks. However, many existing datasets lack integrated annotations for personality and emotion, limiting models' ability to effectively capture these aspects and generate personalized, human-like dialogues, which ultimately impact user experience. To address this challenge, we construct bilingual dialogue datasets in Chinese and English, incorporating Big Five personality traits and emotion annotations. We utilize the AutoGen tool within a multi-agent framework to generate multi-turn question-answering dialogue datasets based on fables. By creating persona agents with diverse personalities, we effectively enhance the heterogeneity of personalities, overcoming previous limitations in personality diversity. Finally, we validate the utterance quality in the dataset and investigate the alignment between conversational utterances and speakers' personality traits. Moreover, by integrating emotional annotations for each utterance, This dataset offers significant potential for developing emotion-aware systems that automatically detect personality traits. It serves as a valuable resource for advancing emotionally intelligent dialogue systems and research in personality and affective computing.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"514"},"PeriodicalIF":5.8,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11950162/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143731082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}