{"title":"Extensive Dataset for Peristaltic Pump Accuracy Enhancement in Pharmaceutical Environments.","authors":"Davide Privitera, Alessandro Mecocci, Sandro Bartolini","doi":"10.1038/s41597-025-05902-z","DOIUrl":"10.1038/s41597-025-05902-z","url":null,"abstract":"<p><p>We publish a comprehensive dataset of peristaltic pump dosing outputs in pharmaceutical manufacturing, where accuracy is crucial for drug quality and patient safety, consisting of 149,847 measurements spanning volumes from 0.1 to 2.0 ml. An industrial filling system was used to acquire data under controlled conditions, using calibrated weighing equipment. To the best of our knowledge, this is the first dataset documenting pump behavior across such a wide range of volumes. The dataset aims to constitute a solid tool that enables investigation from short-term precision to long-term stability standpoints, providing detailed insights into peristaltic pump behavior under various operating conditions. Additionally, the dataset incorporates compensation outcomes across multiple volumes, documenting both statistical and AI-based compensation strategies, thus exemplifying how the statistical behavior of the dosing can change in response to some compensation strategies aimed to improve dosing accuracy. This resource directly addresses pharmaceutical industry needs by supporting optimization of quality control systems and validation of novel compensation strategies.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1618"},"PeriodicalIF":6.9,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12500925/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145239348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-03DOI: 10.1038/s41597-025-05835-7
Giacomo Carloni, Luca Martelli, Alberto Martini, Thomas Gusmeo, Gianluca Vignaroli, Giulio Viola
{"title":"A new 3D seismotectonic model of the Northern Apennines Pedeapenninic margin for seismic hazard assessment.","authors":"Giacomo Carloni, Luca Martelli, Alberto Martini, Thomas Gusmeo, Gianluca Vignaroli, Giulio Viola","doi":"10.1038/s41597-025-05835-7","DOIUrl":"10.1038/s41597-025-05835-7","url":null,"abstract":"<p><p>Faults have the potential to generate earthquakes causing significant damage to societal infrastructure and life losses. Innovative methodologies and multidisciplinary research approaches are essential for assessing seismic hazard in areas where the study and parametrization of earthquake-generating faults present significant challenges, for example due to the scarcity of fault exposures. This is the case of the Northern Apennines front (Italy), where active and seismogenic faults are concealed beneath thick sedimentary deposits. To study the local seismotectonic framework we present a new 3D model for a sector of the Northern Apennines Pedeapenninic margin between Parma and Bologna. The model was generated by integrating geological, geophysical and seismological datasets in the software Leapfrog Works. It assembles eight primary geological units from the surface down to ∼15 km depth and allowed for the reconstruction of 54 active faults, including 12 seismogenic faults. The main aim of this study is to introduce a 3D seismotectonic database that serves scientific, educational and practical applications and can be used for seismic hazard analytical assessment.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1616"},"PeriodicalIF":6.9,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12494688/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145225631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-03DOI: 10.1038/s41597-025-05903-y
Marco Civera, Fabrizio Aloschi, Galilea Margherita Di Maio, Juan Pablo Fierro Carrasco, Andrea Miano, Bernardino Chiaia, Andrea Prota
{"title":"Seismic resilience of urban networks: dataset for infrastructure visualization and vulnerability assessment.","authors":"Marco Civera, Fabrizio Aloschi, Galilea Margherita Di Maio, Juan Pablo Fierro Carrasco, Andrea Miano, Bernardino Chiaia, Andrea Prota","doi":"10.1038/s41597-025-05903-y","DOIUrl":"10.1038/s41597-025-05903-y","url":null,"abstract":"<p><p>We provide geographic information system (GIS) data and a multimodal dataset from a systematic infrastructure vulnerability assessment in the urban road networks of Turin and Naples, Italy. The seismic typologies of the relevant structural objects (SOs), including bridges, buildings, and roads, were evaluated using digital elevation models (DEMs) and satellite data. The presented GIS data are essential for visualizing and spatially interconnecting SOs; this enables network modeling as a complex system within the Spatial Data Infrastructure (SDI) portfolio of interest. The dataset also includes landslide characteristics from Geoportale Piemonte and the GeoNetwork catalog. Potential applications include resilience analysis, seismic risk assessment, emergency response planning, and post-disaster recovery estimations. Moreover, the dataset helps investigate the interplay between structural vulnerability and geohazards like landslides, heavy rainfall, and earthquakes. Notably, it is particularly relevant for research on urban networks as complex systems, where SDIs assess transportation efficiency and functionality in both pre- and post-event scenarios.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1614"},"PeriodicalIF":6.9,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12494711/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145225585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DenPAR: Annotated Intra-Oral Periapical Radiographs Dataset for Machine Learning.","authors":"Sumudu Rasnayaka, Dhanushka Leuke Bandara, Amali Jayasundara, Ruwan Jayasinghe, Chathura Wimalasiri, Piumal Rathnayake, Shamod Wijerathne, Roshan Ragel, Vajira Thambawita, Isuru Nawinne","doi":"10.1038/s41597-025-05906-9","DOIUrl":"10.1038/s41597-025-05906-9","url":null,"abstract":"<p><p>Dental diseases are one of the most common diseases that affect humans. Clinicians employ several techniques for diagnosing and monitoring dental diseases, with intra-oral periapical (IOPA) radiographs being among the most commonly utilized methods. The development of artificial intelligence (AI) technologies for analyzing oral radiographs is being explored across various imaging modalities. However, the limited availability of publicly accessible datasets has been a significant challenge. Although datasets of dental radiographs are available, most of these datasets contain panoramic radiographs with teeth segmentation only. This new data set includes IOPA radiographs with annotations of important landmarks along with tooth segmentation. The dataset includes 1000 images with marked landmarks, along with metadata. Researchers can leverage this resource to create AI solutions for analyzing IOPA radiographs.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1615"},"PeriodicalIF":6.9,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12494692/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145225640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Polish multichannel audio-visual child speech dataset with double-expert sigmatism diagnosis.","authors":"Michal Krecichwost, Zuzanna Miodonska, Agata Sage, Joanna Trzaskalik, Ewa Kwasniok, Pawel Badura","doi":"10.1038/s41597-025-05896-8","DOIUrl":"10.1038/s41597-025-05896-8","url":null,"abstract":"<p><p>The paper introduces PAVSig: Polish Audio-Visual child speech dataset for computer-aided diagnosis of Sigmatism (lisp). The study aimed to gather data on articulation, acoustics, and visual appearance of the articulators in different child speech patterns, particularly in sigmatism. The data was collected in 2021-2023 in six kindergarten and school facilities in Poland during the speech and language therapy examinations of 201 children aged 4-8. The diagnosis was performed simultaneously with data recording, including 15-channel spatial audio signals and a dual-camera stereovision stream of the speaker's oral region. The data record comprises audiovisual recordings of 51 words and 17 logotomes containing all 12 Polish sibilants and the corresponding speech and language therapy diagnoses from two independent speech and language therapy experts. In total, we share 66,781 audio-video segments, including 12,830 words and 53,951 phonemes (12,576 sibilants).</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1612"},"PeriodicalIF":6.9,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12491466/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145213588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Spine endoscopic atlas: an open-source dataset for surgical instrument segmentation.","authors":"Zhipeng Xu, Hong Wang, Yongxian Huang, Jianjin Zhang, Yanhong Chen, Shangjie Wu, Zhouyang Hu, Guanghui Yue, Jax Luo, Guoxin Fan, Xiang Liao","doi":"10.1038/s41597-025-05897-7","DOIUrl":"10.1038/s41597-025-05897-7","url":null,"abstract":"<p><p>Endoscopic spine surgery (ESS) is a minimally invasive procedure used for spinal nerve decompression, herniated disc removal, and spinal fusion. Despite its many advantages, its steep learning curve poses a challenge to widespread adoption. The development of artificial intelligence (AI) systems is crucial for enhancing the precision and safety of ESS. The automatic segmentation of surgical instruments is a key step towards realizing intelligent surgical assistance systems. As such, this paper has created the Spine Endoscopic Atlas (SEA) dataset, a comprehensive collection of annotated images encompassing all instruments commonly used in spinal endoscopic surgery. In total, SEA contains 48,510 images and 10,662 instrument segmentations derived from real-world ESS. This dataset is specifically designed to train deep learning models for precise instrument segmentation. Through validation of five models, we demonstrate the dataset's value in improving segmentation accuracy under complex conditions, providing a foundation for future AI advancements in ESS.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1611"},"PeriodicalIF":6.9,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12491453/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145213571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-02DOI: 10.1038/s41597-025-05901-0
Can Trong Nguyen, Davina Vačkářová, Jan Weinzettel
{"title":"Consistent global dataset on biodiversity intactness footprint of agricultural production from 2000 to 2020.","authors":"Can Trong Nguyen, Davina Vačkářová, Jan Weinzettel","doi":"10.1038/s41597-025-05901-0","DOIUrl":"10.1038/s41597-025-05901-0","url":null,"abstract":"<p><p>Global biodiversity is rapidly declining, primarily due to agricultural production driven by both domestic and transboundary consumption. This study addresses the challenges posed by inconsistent spatiotemporal biodiversity data by developing a time series of biodiversity loss footprints based on Biodiversity Intactness Index (BII). Numerous land use, land cover, and auxiliary datasets were integrated to produce a consistent time series of high-resolution harmonized land use (HHLU) maps. These maps were utilized to quantify spatial BII using linear-mixed effect models. Biodiversity intactness loss (BII footprint) was subsequently attributed to specific crops and livestock commodities. This study provides comprehensive global datasets, including HHLU and BII maps, and synthesized BII footprints across 14 biomes, 193 countries and territories, 154 crop items, and 9 livestock categories from 2000 to 2020. These datasets facilitate spatiotemporal analyses to identify trends and patterns in global biodiversity integrity and biodiversity footprints, thereby elucidating the ecological trade-offs embedded in international trade. These insights can encourage appropriate interventions to transform consumption patterns and supply chains toward the effective conservation of global biodiversity.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1613"},"PeriodicalIF":6.9,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12491505/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145213574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-01DOI: 10.1038/s41597-025-05909-6
Anna Pot, Laura L Carstensen
{"title":"A generated image repository of aging faces.","authors":"Anna Pot, Laura L Carstensen","doi":"10.1038/s41597-025-05909-6","DOIUrl":"10.1038/s41597-025-05909-6","url":null,"abstract":"<p><p>Faces are a rich source of information for humans and a substantial amount of behavioral science research uses face stimuli to assess person perception. Unfortunately, this body of research is limited by an overreliance on young, predominantly white faces normed on young adult perceivers. To address these limitations, we created an open-access database of AI-generated faces that represents the same individuals at three life stages (young adulthood, middle age, and older adulthood) including equal numbers of males and females. Using advanced generative algorithms, the approach digitally aged 62 young individuals, thus preserving identity-specific features while realistically portraying age-related changes. The resulting database comprises 186 images. Each image has been age-normed and validated for authenticity. Although the database will be useful for many research questions, the stimuli are especially well-suited for research on age comparisons because the same individuals can be presented at different ages.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1610"},"PeriodicalIF":6.9,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12489049/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145207306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Scientific DataPub Date : 2025-10-01DOI: 10.1038/s41597-025-05697-z
Pamela Sánchez-Vendizú, Gideon Erkenswick, Jhakelin Reyes, Samantha López Clinton, Thalía Silvestre Espejo, Gabriela Cáceres, Zane Libke, Alejandra Arana, Jorge Mendoza-Silva, Cristian Tirapelle, Sean Williams, Varun Swamy, José Martínez-Altamirano, Juan Esteves, Juan P Barnuevo-Bullón, Jacqueline Hernández-Mejía, Xiomara Caffo, Alejandro Mendevil Malpica, Roberto Salazar-Aragón, Leticia Gutiérrez-Jiménez, Jennifer Stabile, Naija Cuzmar, Timothy D Paine, Priscila Peralta-Aguilar, Giancarlo Inga-Díaz, Jesus Lescano, Andrés Viñas-Martínez, Mary E McElroy, Daxs Coayla, Liza-María Linares R, Nicholas W Pilfold, Alexandra J Sacco, Mónica Arakaki, José Luis Mena, Mathias W Tobler, Letty Salinas, César Arana, Víctor Pacheco, Stefan Prost, Mrinalini Watsa
{"title":"Decoding the Peruvian Amazon with in situ DNA barcoding of vertebrate and plant taxa.","authors":"Pamela Sánchez-Vendizú, Gideon Erkenswick, Jhakelin Reyes, Samantha López Clinton, Thalía Silvestre Espejo, Gabriela Cáceres, Zane Libke, Alejandra Arana, Jorge Mendoza-Silva, Cristian Tirapelle, Sean Williams, Varun Swamy, José Martínez-Altamirano, Juan Esteves, Juan P Barnuevo-Bullón, Jacqueline Hernández-Mejía, Xiomara Caffo, Alejandro Mendevil Malpica, Roberto Salazar-Aragón, Leticia Gutiérrez-Jiménez, Jennifer Stabile, Naija Cuzmar, Timothy D Paine, Priscila Peralta-Aguilar, Giancarlo Inga-Díaz, Jesus Lescano, Andrés Viñas-Martínez, Mary E McElroy, Daxs Coayla, Liza-María Linares R, Nicholas W Pilfold, Alexandra J Sacco, Mónica Arakaki, José Luis Mena, Mathias W Tobler, Letty Salinas, César Arana, Víctor Pacheco, Stefan Prost, Mrinalini Watsa","doi":"10.1038/s41597-025-05697-z","DOIUrl":"10.1038/s41597-025-05697-z","url":null,"abstract":"<p><p>Species extinctions in the tropics are accelerating, outpacing documentation efforts. Meanwhile, DNA barcoding is flourishing in the Global North, backed by extensive infrastructure, allowing non-taxonomic experts to identify species from nonlethal, minimally invasive, and environmental samples. However, hyper-diverse regions like Peru make up only 0.52% (n = 93,246) of the Barcode of Life Database (BOLD). To address this, we established three decentralized laboratories with low-cost, portable nanopore sequencers. From 2018-2023, we generated 1,858 barcodes in situ using six genetic markers for 1,097 vertebrates and 76 plants from existing and new biobanks. We present the first genetic barcodes for 30 mammal and 196 bird species from Peruvian specimens, increasing the number of Peruvian mammal and bird species in BOLD by 110% and 36.5% respectively. We also report the first records of the marsupial Marmosops ocellatus and the bat Sturnira lilium for Peru. This dataset represents an effort to go from fresh or museum-preserved samples to barcodes entirely in situ, avoiding the export of samples outside the country, and facilitating local capacity in molecular biodiversity research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1545"},"PeriodicalIF":6.9,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12488867/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145207403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Quality controlled, reliable groundwater level data with corresponding specific yield over India.","authors":"Satish Kumar Kuruva, Maya Raghunath Suryawanshi, Amin Shakya, Chethan Va, Balaram Shaw, Vandana Sukumaran, Retinder Kour, Aayushi Kochar, Shard Chander, Bhaskar R Nikam, Nagesh Kumar Dasika, Bramha Dutt Vishwakarma","doi":"10.1038/s41597-025-05899-5","DOIUrl":"10.1038/s41597-025-05899-5","url":null,"abstract":"<p><p>Groundwater is a vital resource for domestic, agricultural, and industrial use, with its demand growing due to population growth and climate change. Several studies have identified groundwater depleting in India at unsustainable rate over North-west part, but a contrasting trend is observed in the southern India. To better study groundwater dynamics quality-controlled and reliable well data is essential, which is missing. Here we process seasonal groundwater levels from 32,299 wells across India to obtain reliable well data and provide respective specific yields. Initially, wells with no data and negative values are removed. Later three-sigma method is imposed on each well to eliminate outliers. Finally, wells with at least two values per year, with no value repeating more than twice consecutively, are retained, resulting in 2,759 reliable wells. We used vectorization-based method to classify aquifer types and estimated specific yields based on hydrogeological map. We also provide open access to data and scripts so that researchers can study groundwater variations, compare GRACE and model-based groundwater estimates against in-situ well data.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1609"},"PeriodicalIF":6.9,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12489037/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145207415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}