Data in Brief最新文献

筛选
英文 中文
ShrimpDiseaseBD: An image dataset for detecting shrimp diseases in the aquaculture sector of Bangladesh ShrimpDiseaseBD:用于检测孟加拉国水产养殖部门虾类疾病的图像数据集
IF 1
Data in Brief Pub Date : 2025-04-11 DOI: 10.1016/j.dib.2025.111553
Mohammad Manzurul Islam, Anabil Sarker, Ashiquzzaman Choudhury, Noortaz Ahmed, Ahmed Abdal Shafi, Nishat Tasnim Niloy, Md Shorif Hossain, Md Sawkat Ali, Abdullahi Chowdhury, Md. Hasanul Ferdaus
{"title":"ShrimpDiseaseBD: An image dataset for detecting shrimp diseases in the aquaculture sector of Bangladesh","authors":"Mohammad Manzurul Islam,&nbsp;Anabil Sarker,&nbsp;Ashiquzzaman Choudhury,&nbsp;Noortaz Ahmed,&nbsp;Ahmed Abdal Shafi,&nbsp;Nishat Tasnim Niloy,&nbsp;Md Shorif Hossain,&nbsp;Md Sawkat Ali,&nbsp;Abdullahi Chowdhury,&nbsp;Md. Hasanul Ferdaus","doi":"10.1016/j.dib.2025.111553","DOIUrl":"10.1016/j.dib.2025.111553","url":null,"abstract":"<div><div>Shrimp farming is a significant contributor to Bangladesh's economy, providing livelihoods for millions of people in coastal areas. However, the shrimp industry faces challenges from prevalent shrimp diseases, which can disrupt the economy and harm the environment. Detecting these diseases early and effectively is crucial. To address this concern, a dataset has been developed containing images of healthy and diseased shrimp of different types. The images were collected from local shrimp farms under expert supervision using high-quality smartphone cameras. The dataset includes 1149 original images, with diseased shrimp images annotated to improve detection capabilities. This dataset is expected to be valuable for detecting shrimp diseases with precision and timing and is likely to encourage research and practical applications in automated shrimp health monitoring. It will also be a valuable resource for computer vision and aquaculture researchers.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111553"},"PeriodicalIF":1.0,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143864706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterization and genomic analysis of Lactiplantibacillus plantarum LP8 as a probiotic candidate for medical applications 植物乳杆菌LP8作为医学应用的候选益生菌的特性和基因组分析
IF 1
Data in Brief Pub Date : 2025-04-11 DOI: 10.1016/j.dib.2025.111555
Nirusna Jehma , Nattarika Chaichana , Jirasa Boonsan , Kamonnut Singkhamanan , Monwadee Wonglapsuwan , Rattanaruji Pomwised , Sarunyou Chusri , Komwit Surachat
{"title":"Characterization and genomic analysis of Lactiplantibacillus plantarum LP8 as a probiotic candidate for medical applications","authors":"Nirusna Jehma ,&nbsp;Nattarika Chaichana ,&nbsp;Jirasa Boonsan ,&nbsp;Kamonnut Singkhamanan ,&nbsp;Monwadee Wonglapsuwan ,&nbsp;Rattanaruji Pomwised ,&nbsp;Sarunyou Chusri ,&nbsp;Komwit Surachat","doi":"10.1016/j.dib.2025.111555","DOIUrl":"10.1016/j.dib.2025.111555","url":null,"abstract":"<div><div>This study presents the whole genome sequencing (WGS) and functional analysis of <em>Lactiplantibacillus plantarum</em> LP8, a promising probiotic strain, is presented in this research. The genome comprises a 3.23 Mbp circular chromosome and three plasmids including plasmid1_LP8 (58,764 bp), plasmid2_LP8 (45,003 bp), and plasmid3_LP8 (7985 bp). Functional annotation identified 3151 coding sequences (CDSs) mapped to 209 RAST subsystems, alongside biosynthesis gene clusters encoding Plantaricin J and other secondary metabolites such as RiPP-like peptides and terpenes. Safety analysis revealed no acquired antimicrobial resistance gene (AMR) or virulence genes, with only intrinsic resistance to certain antibiotics. In vitro analysis confirmed its sensitivity to antibiotics such as ampicillin, erythromycin, and tetracycline, and its hemolytic activity displayed an α-hemoly-sis pattern. These findings confirm <em>L. plantarum</em> LP8 as a safeand effective probiotic candidate with significant potential for antimicrobial and biotechnological applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111555"},"PeriodicalIF":1.0,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143887347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data on Swiss consumers’ perception of different types of sustainability levies, agriculture and willingness to choose suboptimal potatoes in different settings 瑞士消费者对不同类型的可持续税收、农业的看法以及在不同环境下选择次优土豆的意愿的数据
IF 1
Data in Brief Pub Date : 2025-04-11 DOI: 10.1016/j.dib.2025.111551
Jeanine Ammann , Gabriele Mack , Rita Saleh
{"title":"Data on Swiss consumers’ perception of different types of sustainability levies, agriculture and willingness to choose suboptimal potatoes in different settings","authors":"Jeanine Ammann ,&nbsp;Gabriele Mack ,&nbsp;Rita Saleh","doi":"10.1016/j.dib.2025.111551","DOIUrl":"10.1016/j.dib.2025.111551","url":null,"abstract":"<div><div>We present representative survey data from 481 Swiss consumers. Data were collected in the German-speaking parts of Switzerland in February and March 2024. The survey includes three independent main parts.</div><div>In a first part, we collected qualitative and quantitative data on participants’ perception of Swiss agriculture and farmers. Specifically, participants’ trust in crop and livestock production farmers and their perceived knowledge about production methods and their affect towards farmers was assessed.</div><div>In a second part, we collected quantitative data on participants’ preference for different sustainability levies. For this, six different products were used (i.e., fresh/processed vegetables, dairy, and meat). For each of these six products, participants were shown four levy options from which they had to choose the one that they found most appealing. For vegetables, the options were: (A) reduction of risks related to plant protection products, (B) more support for local farmers, (C) support for environmental sustainability, and (D) sustainability projects in general. For the animal products, option (A) was an increase in animal welfare, whilst options (B), (C) and (D) were the same as for the vegetable products.</div><div>In a third part, we collected qualitative and quantitative data on participants preferences for suboptimal or optimal potatoes. Here, a 2 × 2 experimental design (setting × information) was used. This means that participants were presented with either a supermarket or farm shop setting and with or without food waste information. Participants then chose between two potatoes: optimal potato A, suboptimal potato B, or neither. Both potatoes were equally expensive.</div><div>Further, we collected personal information about participants such as gender, age, education level and how they placed themselves regarding their political orientation on a left-right scale. We further collected behavioural data including diet, that is, milk and meat consumption frequency as well as shopping behaviour, where we asked participants where they usually did their grocery shopping. At the end of the survey, we used existing and new scales to measure participants’ perception of farmers, health consciousness and environmental attitudes. Before collecting this data, ethical approval was obtained from the Agroscope ethical commission (application EK-AGS-2024-N-01).</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111551"},"PeriodicalIF":1.0,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CMHT autonomous dataset: A multi-sensor dataset including radar and IR for autonomous driving CMHT自动驾驶数据集:用于自动驾驶的多传感器数据集,包括雷达和红外
IF 1
Data in Brief Pub Date : 2025-04-11 DOI: 10.1016/j.dib.2025.111552
Howard Zhang , Ash Liu , Saied Habibi , Martin v. Mohrenschildt , Ryan Ahmed
{"title":"CMHT autonomous dataset: A multi-sensor dataset including radar and IR for autonomous driving","authors":"Howard Zhang ,&nbsp;Ash Liu ,&nbsp;Saied Habibi ,&nbsp;Martin v. Mohrenschildt ,&nbsp;Ryan Ahmed","doi":"10.1016/j.dib.2025.111552","DOIUrl":"10.1016/j.dib.2025.111552","url":null,"abstract":"<div><div>Standardized datasets are essential for the development and evaluation of autonomous driving algorithms. As the types of sensors available to researchers increase, datasets containing a variety of temporally and spatially aligned sensors have become increasingly valuable. This paper presents a driving dataset recorded using a complete sensor suite for research on autonomous driving, perception, and sensor fusion. The dataset consists of over 9000 frames of data recorded at 10-20Hz using a complete sensor suite made up of Velodyne LiDAR, GPS/IMU, mm-wave radar, as well as color and infrared cameras. The capture scenarios include poor weather/lighting conditions, such as rain/night scenarios, and diverse traffic conditions, such as highways and cities with various objects. Both fully synchronized data and raw recordings in the form of ROS2 bags are provided, as well as 3D tracklet labels for individual objects. This paper provides technical details on the driving platform, data format, and utilities.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111552"},"PeriodicalIF":1.0,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Endophytic bacteriome data of Litchi chinensis established by metagenomic 16S rRNA gene sequencing 通过元基因组 16S rRNA 基因测序建立的荔枝内生菌群数据
IF 1
Data in Brief Pub Date : 2025-04-10 DOI: 10.1016/j.dib.2025.111544
Dinh Sy Nguyen , Dinh Minh Tran
{"title":"Endophytic bacteriome data of Litchi chinensis established by metagenomic 16S rRNA gene sequencing","authors":"Dinh Sy Nguyen ,&nbsp;Dinh Minh Tran","doi":"10.1016/j.dib.2025.111544","DOIUrl":"10.1016/j.dib.2025.111544","url":null,"abstract":"<div><div>This work reported the diversity profiling and predicted metabolic function of the endophytic bacteriome of lychee (<em>Litchi chinensis</em> S.) cultivated in Dak Lak Province of Vietnam for the first time. Roots of lychee were collected from three different fields in Krong Ana District in Dak Lak. 16S rRNA primers were used to sequence the metagenomic library. Kraken 2 was used to analyze the taxonomic distribution, while the MetaCyc database was used to predict the metabolic function. We identified 10 phyla, 14 classes, 27 orders, 30 families, and 27 genera of the endophytic bacteria from the sample. Actinomycetota was the most predominant phylum (84.49%), and biosynthesis was the bacteriome's primary function (75.42%). Data provided insight into the taxonomic distribution and metabolic function of lychee endophytic bacteria and might be helpful for the next steps concerning sustainable lychee cultivation using endophytic bacteria.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111544"},"PeriodicalIF":1.0,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143843346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Updating “A dataset on patient-individual lymph node involvement in oropharyngeal squamous cell carcinoma” with an additional dataset from a second institution 更新“口咽鳞状细胞癌患者个体淋巴结累及数据集”,使用来自第二机构的额外数据集
IF 1
Data in Brief Pub Date : 2025-04-10 DOI: 10.1016/j.dib.2025.111546
Sergi Benavente , Roman Ludwig , Panagiotis Balermpas , Jan Unkelbach
{"title":"Updating “A dataset on patient-individual lymph node involvement in oropharyngeal squamous cell carcinoma” with an additional dataset from a second institution","authors":"Sergi Benavente ,&nbsp;Roman Ludwig ,&nbsp;Panagiotis Balermpas ,&nbsp;Jan Unkelbach","doi":"10.1016/j.dib.2025.111546","DOIUrl":"10.1016/j.dib.2025.111546","url":null,"abstract":"<div><div>With this update, we add 164 patients with newly diagnosed oropharyngeal squamous cell carcinoma (OPSCC) from the University Hospital Vall d'Hebron (HVH) in Barcelona, Spain, to the previously published cohort of 287 OPSCC patients from the University Hospital Zurich (USZ). For each patient, we report the clinical involvement of lymph node levels (LNLs) I-V and VII on both sides of the neck. LNL involvement is assessed separately for the available diagnostic modalities comprising computed tomography (CT), magnetic resonance imaging (MRI), and/or <sup>18</sup>FDG-positron emission tomography (PET/CT). For 10 surgically treated patients, we also report pathological LNL involvement after neck dissection. Additionally, we report clinicopathological factors such as sex, age, alcohol and nicotine abuse, HPV status, TNM stage, tumor subsite (ICD-10 code) and tumor volume, and whether the tumor extended over the mid-sagittal plane.</div><div>The additional data is made available in the same CSV file format as the records of the initial dataset. The new data represents a valuable update to the original records that substantially increases the size of the cohort. In addition, it allows assessing differences between datasets, which provides information on potential patient biases. Due to the same data format, it is straightforward to reproduce any analysis that was done on the original data with the extended dataset.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111546"},"PeriodicalIF":1.0,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data on transgenerational memory effects of photosynthetic efficiency of twelve wheat varieties under elevated carbon dioxide concentration and reduced soil water availability 二氧化碳浓度升高和土壤水分有效性降低条件下12个小麦品种光合效率的跨代记忆效应
IF 1
Data in Brief Pub Date : 2025-04-09 DOI: 10.1016/j.dib.2025.111545
Bernd J. Berauer , Suraj Chaudhary , Lorenz Kottmann , Andreas H. Schweiger
{"title":"Data on transgenerational memory effects of photosynthetic efficiency of twelve wheat varieties under elevated carbon dioxide concentration and reduced soil water availability","authors":"Bernd J. Berauer ,&nbsp;Suraj Chaudhary ,&nbsp;Lorenz Kottmann ,&nbsp;Andreas H. Schweiger","doi":"10.1016/j.dib.2025.111545","DOIUrl":"10.1016/j.dib.2025.111545","url":null,"abstract":"&lt;div&gt;&lt;div&gt;This data [&lt;span&gt;&lt;span&gt;1&lt;/span&gt;&lt;/span&gt;] represents ACi curves of twelve winter wheat varieties, which were grown under elevated and ambient CO&lt;sub&gt;2&lt;/sub&gt; concentrations within a FACE experiment and the subsequent F1 generation was exposed to ambient and elevated CO&lt;sub&gt;2&lt;/sub&gt; concentrations in a highly controlled environment using climate chambers. The 12 winter wheat genotypes (&lt;em&gt;Triticum aestivum&lt;/em&gt; L.) were selected based on their susceptibilty to leaf rust (&lt;em&gt;Puccinia triticina&lt;/em&gt; Eriks.) and Fusarium head blight (&lt;em&gt;Fusarium graminearum&lt;/em&gt; Schwabe) according to the descriptive variety list of the German Federal Office of Plant Varietes (Beschreibende Sortenliste, Bundessortenamt 2024). The aim was to obtain a diverse set of varieties with the widest possible range of susceptibilities to leaf rust and fusarium head blight. Photosynthesis was measured using the novel Dynamic Assimilation Technique, thus not with the common steady-state approach. The individual wheat plants were measured twice, once under saturating soil water availability (θ&lt;sub&gt;FC&lt;/sub&gt;) and once under reduced soil water availability (θ&lt;sub&gt;csoil&lt;/sub&gt;). θ&lt;sub&gt;csoil&lt;/sub&gt; represents the gravimetric water content when the soil matric potential drops below the root matric potential, thus the onset of plant drought stress (&lt;em&gt;sensu&lt;/em&gt; Cai et al&lt;em&gt;.&lt;/em&gt; [2]). The photosynthesis data was used to fit ACi curves and extract the maximum Rubisco carboxylation rate [Vc&lt;sub&gt;max&lt;/sub&gt;], maximum rate of electron transport [J&lt;sub&gt;max&lt;/sub&gt;] and dark respiration [Rd]. At both measurements we determined BBCH and plant height to quantify plant morphological development, as well as leaf water potential to quantify plant ecohydrologic status. At the end of the experiment, biomass was harvested and reported. Further, we provide environmental data of the climate chambers in use.&lt;/div&gt;&lt;div&gt;Within the data repository, we provide comprehensive experimental data on the investigation of transgenerational memory effects on photosynthetic efficiency. We provide photosynthetic raw data as well as processed (merged) and derived (extracted ACi fit) data. Additionally, we provide the R-code to reproduce the calculation of the derived parameters.&lt;/div&gt;&lt;div&gt;Data on transgenerational memory effects (that is, the influence of the parental environment on offspring phenotype and performance) are scarce, i.e. on the adaptive capacity of the photosynthetic apparatus. Thus, the data provided here can contribute to closing this gap. The highly controlled environment allows to closely investigate cause-effect relationships, thereby contributing to a mechanistic understanding of the transgenerational memory effects on photosynthetic efficiency and how this is altered by reduced soil water availability. By using a recently developed methodological approach, the data contributes to further investigate the quality of the method and establish it within the field of plant ecophysiology.&lt;/div&gt;&lt;/di","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111545"},"PeriodicalIF":1.0,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data of vegetation structure metrics retrieved from airborne laser scanning surveys for European demonstration sites 欧洲示范点机载激光扫描植被结构度量数据检索
IF 1
Data in Brief Pub Date : 2025-04-09 DOI: 10.1016/j.dib.2025.111548
W. Daniel Kissling, Wessel Mulder, Jinhu Wang, Yifang Shi
{"title":"Data of vegetation structure metrics retrieved from airborne laser scanning surveys for European demonstration sites","authors":"W. Daniel Kissling,&nbsp;Wessel Mulder,&nbsp;Jinhu Wang,&nbsp;Yifang Shi","doi":"10.1016/j.dib.2025.111548","DOIUrl":"10.1016/j.dib.2025.111548","url":null,"abstract":"&lt;div&gt;&lt;div&gt;This dataset provides a standardized collection of rasterized Light Detection And Ranging (LiDAR) metrics in GeoTIFF format, derived from country-wide airborne laser scanning (ALS) data across seven demonstration sites in five European countries: Mols Bjerge National Park (Denmark), Reserve Naturelle Nationale du Bagnas (France), Oostvaardersplassen (Netherlands), Salisbury Plain (United Kingdom), Knepp Estate (United Kingdom), Monks Wood (United Kingdom), and the island of Comino (Malta). The sites range in areal size from 0.08 km&lt;sup&gt;2&lt;/sup&gt; to 54 km&lt;sup&gt;2&lt;/sup&gt; and include habitat types such as forests, broadleaf and conifer woodlands, small plantations, dry and wet grasslands, marshes, reedbeds, arable fields, farmland, scrublands and mediterranean garigue. A total of 35 LiDAR metrics were calculated, of which 28 represent vegetation structural attributes. These include vegetation height (seven metrics), vegetation cover (fourteen metrics), and vegetation vertical variability (seven metrics). Additionally, seven metrics describe point density (one metric), eigenvalues (three metrics), and normal vectors (three metrics). The rasterized LiDAR metrics have a spatial resolution of 10 m, with coverage and extent defined by shapefiles corresponding to each demonstration site. The raw ALS point clouds were clipped to the site boundaries and processed with the 'Laserfarm' workflow, a standardized computational workflow that includes modular pipelines for re-tiling, normalization, feature extraction, and rasterization. Laserfarm employs the feature extraction module of the open-source ‘Laserchicken’ software to compute the LiDAR metrics. The workflow was implemented using the IT services of the Dutch national facility for information and communication technology, SURF. The clipped LiDAR point clouds are available through a public repository, except for the LiDAR point clouds from Comino, Malta, which are not publicly available. The 35 rasterized LiDAR metrics (GeoTIFF files, 10 m resolution) from all sites, including Comino, as well as the corresponding site boundary shapefiles (geospatial vector format), are provided in a Zenodo repository. Additionally, the Jupyter Notebooks with Python code for executing the Laserfarm workflow are available to facilitate reproducibility and further computational applications. Users should note that the rasterized LiDAR metrics may contain zero or NA values, particularly over water surfaces, with the pulse penetration ratio metric potentially indicating false high vegetation cover over water. Users may reclassify or mask areas with zero values accordingly. Some pixels exhibit abnormal vegetation height values, which can be filtered before analysis. Certain striping patterns, likely resulting from overlapping flight lines and increased point density, are present in some metrics, though their overall impact appears minimal. This dataset enables diverse applications, including canopy height measurements, mapp","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111548"},"PeriodicalIF":1.0,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integration of solar flare and coronal mass ejection event data 太阳耀斑和日冕物质抛射事件数据的整合
IF 1
Data in Brief Pub Date : 2025-04-09 DOI: 10.1016/j.dib.2025.111539
Anli Ji , Manolis K. Georgoulis , Berkay Aydin
{"title":"Integration of solar flare and coronal mass ejection event data","authors":"Anli Ji ,&nbsp;Manolis K. Georgoulis ,&nbsp;Berkay Aydin","doi":"10.1016/j.dib.2025.111539","DOIUrl":"10.1016/j.dib.2025.111539","url":null,"abstract":"<div><div>Solar flares and coronal mass ejections are solar transient events that can impact our technological infrastructure in near-Earth and Earth environments. While related, not all flares generate CMEs and there are a limited number of resources that connect CMEs to flares and eventually to their source active regions. We present an integrated solar-flare-to-CME association dataset, along with a multi-step, data-driven spatiotemporal integration methodology for matching solar flares with coronal mass ejections. We perform a confidence-based scoring process that involves spatial and temporal data integration for integrating LASCO CME data to their solar sources. Such process is based on identifying the likely CME candidates with corresponding start and peak times of flares and the first detection time of CMEs. In addition, we check for the locations of flares (within 70 degrees) and CME principal angles as well as the width of each CME to generate spatial connections between instances. Furthermore, we use external association sources to implement a custom verification schema that provides further fidelity to our associations. We also provide an exploratory analysis demonstrating the number of associations generated after the integration. With this data resource, we connect the coronal mass ejections to their source active regions and generate a high-utility solar eruption labeling schema.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111539"},"PeriodicalIF":1.0,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143870541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balinese text-to-speech dataset as digital cultural heritage 作为数字文化遗产的巴厘文本-语音数据集
IF 1
Data in Brief Pub Date : 2025-04-09 DOI: 10.1016/j.dib.2025.111528
I Gusti Agung Gede Arya Kadyanan, Ngurah Agus Sanjaya ER, Anak Agung Istri Ngurah Eka Karyawati, I Gede Ngurah Arya Wira Putra, ⁠I Made Suma Gunawan, Ni Made Julia Budiantari, Hana Christine Octavia
{"title":"Balinese text-to-speech dataset as digital cultural heritage","authors":"I Gusti Agung Gede Arya Kadyanan,&nbsp;Ngurah Agus Sanjaya ER,&nbsp;Anak Agung Istri Ngurah Eka Karyawati,&nbsp;I Gede Ngurah Arya Wira Putra,&nbsp;⁠I Made Suma Gunawan,&nbsp;Ni Made Julia Budiantari,&nbsp;Hana Christine Octavia","doi":"10.1016/j.dib.2025.111528","DOIUrl":"10.1016/j.dib.2025.111528","url":null,"abstract":"<div><div>Balinese language has a complex and unique language level system, yet still lacks representation in speech-based technologies such as Text-to-Speech (TTS) and speech recognition. As one of the linguistically rich regional languages, Balinese language digitization efforts have not been optimally developed, limiting research in natural language processing (NLP) as well as the application of regional language-based voice technologies. The limitation of voice-based datasets in Balinese is a major challenge in the development of this technology. Therefore, this research aims to develop a dataset of Balinese native speaker audio recordings covering various language levels to support applications in Text-to-Speech (TTS) systems, speech recognition, and voice-to-text technology. The dataset was developed through a data acquisition process that involved recording the voices of native Balinese speakers of the Badung dialect. Data was collected by recording the voices of native Balinese speakers using the Badung dialect. The resulting recordings were then processed using denoising techniques to improve audio quality, before being categorized based on Balinese politeness levels (Alus Singgih, Alus Sor, Alus Mider, Mider, and Andap) as well as including additional phrases and alphabets to provide a wider variety to the dataset. The results show that this dataset consists of 1187 recordings that reflect a wide range of social variation in Balinese. By providing this resource, this research not only contributes to the development of speech-based technologies, but also plays a role in the preservation of Balinese in the digital age, as well as opening up further research opportunities in NLP for languages with limited resources.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111528"},"PeriodicalIF":1.0,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143833620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信