{"title":"A dataset of the representatives elected in France during the fifth republic","authors":"Noémie Févrat , Vincent Labatut , Émilie Volpi , Guillaume Marrel","doi":"10.1016/j.dib.2025.111542","DOIUrl":"10.1016/j.dib.2025.111542","url":null,"abstract":"<div><div>The electoral system is a cornerstone of democracy, shaping the structure of political competition, representation, and accountability. In the case of France, it is difficult to access data describing elected representatives, though, as they are scattered across a number of sources, including public institutions, but also academic and individual efforts. This article presents a unified relational database that aims at tackling this issue by gathering information regarding representatives elected in France over the whole Fifth Republic (1958–present). This database constitutes an unprecedented resource for analyzing the evolution of political representation in France, exploring trends in party system dynamics, gender equality, and the professionalization of politics. By providing a longitudinal view of French elected representatives, the database facilitates research on the institutional stability of the Fifth Republic, offering insights into the factors of political change.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111542"},"PeriodicalIF":1.0,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143843228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Data in BriefPub Date : 2025-04-06DOI: 10.1016/j.dib.2025.111532
Mushfiqur Rahman, Md Al Mamun
{"title":"A comprehensive Malabar Spinach dataset for diseases classification","authors":"Mushfiqur Rahman, Md Al Mamun","doi":"10.1016/j.dib.2025.111532","DOIUrl":"10.1016/j.dib.2025.111532","url":null,"abstract":"<div><div>This study focuses on the urgent need to increase detection of diseases in Malabar Spinach, a valuable leaf vegetable crop which is at risk from several disease types including Anthracous leaf spot and Straw mite infestation. There is still a lack of research focused on Malabar spinach, although advances in machine vision have considerably increased the detection of largescale crop diseases. By developing and evaluating machine vision algorithms specifically designed for accurate detection of diseases in Malabar spinach, this research aims to fill this gap. To achieve this, a comprehensive dataset comprising images of both healthy and diseased Malabar Spinach plants is utilized for training, testing, and validation purposes. This study seeks to develop reliable disease detection models through the examination of different image processing techniques and deep learning algorithms such as ResNet50. In particular, the performance of these models is rigorously evaluated on the basis of a set of standardized evaluation metrics which aim to achieve an overall test accuracy of 94%. The results of this research will have a major impact on the cultivation of Malabar spinach in terms of precision farming techniques and effective crop management practices. This study will contribute to the wider objectives of agricultural sustainability and food security, through increasing crop productivity and reducing yield losses. In the end, it is intended to strengthen the resilience of farming communities dependent on Malabar Spinach crops by providing farmers and experts with efficient tools for detecting diseases.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111532"},"PeriodicalIF":1.0,"publicationDate":"2025-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143820392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Data in BriefPub Date : 2025-04-06DOI: 10.1016/j.dib.2025.111547
Muhammad ‘Amir Roslan, Farah Liana Mohd Redzuan
{"title":"Data-driven thermal analysis of indoor cooling for a thermoelectric radiant panel system","authors":"Muhammad ‘Amir Roslan, Farah Liana Mohd Redzuan","doi":"10.1016/j.dib.2025.111547","DOIUrl":"10.1016/j.dib.2025.111547","url":null,"abstract":"<div><div>A thermal analysis of a thermoelectric radiant panel (TERP) system was conducted to evaluate its cooling effectiveness in maintaining indoor comfort within the temperature range recommended by ASHRAE Standard 55 and Malaysian Standard (MS) 1525 under tropical climate conditions. The study utilized ANSYS software, incorporating computational fluid dynamics (CFD) tools for detailed heat transfer simulations. The analysis examined the impact of outdoor temperature variations on indoor conditions and TERP system panel temperatures by considering heat transfer mechanisms, boundary conditions, and material properties to ensure accuracy. By accounting for these factors, the simulation provided a realistic representation of the thermal environment in a tropical climate. The simulation results were visualized as 3D thermal profiles, with color contours illustrating temperature distributions across the panel surface and the indoor environment. The collected data was then simplified into graph form for easier interpretation and analysis, including an evaluation of compliance with thermal comfort standards such as ASHRAE Standard 55 and MS 1525. The findings serve as a benchmark for assessing thermoelectric cooling performance in tropical buildings and offer valuable insights for optimizing thermoelectric cooling systems in similar climates. This study contributes to the development of sustainable cooling solutions and supports further research in thermoelectric cooling applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111547"},"PeriodicalIF":1.0,"publicationDate":"2025-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Data in BriefPub Date : 2025-04-05DOI: 10.1016/j.dib.2025.111536
Seunghyeon Wang , Sungkon Moon , Ikchul Eum , Dongjin Hwang , Jaejun Kim
{"title":"A text dataset of fire door defects for pre-delivery inspections of apartments during the construction stage","authors":"Seunghyeon Wang , Sungkon Moon , Ikchul Eum , Dongjin Hwang , Jaejun Kim","doi":"10.1016/j.dib.2025.111536","DOIUrl":"10.1016/j.dib.2025.111536","url":null,"abstract":"<div><div>Defect classification from text descriptions written by inspectors during the construction stage can be highly beneficial, offering advantages such as cost savings and improved reputation of apartment complexes by allowing early identification and resolution of issues. Combining automated methods with textual data can facilitate the rapid identification and diagnosis of faults. To develop such automated methods, this research constructed a dataset from real-world data collected from three apartment complexes. This study classifies fire door defects into eight categories: frame gap, door closer adjustment defect, contamination, dent, scratch, sealing components, mechanical operation components, and others. The level of detail in this classification ensures a comprehensive understanding of fire door issues. The main contributions of this dataset to the field are twofold. First, it represents a unique dataset based on real-world fire door defect descriptions, which is currently non-existent in this domain. Second, the dataset's expert labeling adds significant value by ensuring accurate fault classification. We hope this dataset will encourage the development of robust text classification techniques suitable for real-world applications by providing a reliable benchmark.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111536"},"PeriodicalIF":1.0,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143820394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Data in BriefPub Date : 2025-04-04DOI: 10.1016/j.dib.2025.111519
Koundinya Challa, Issa W. AlHmoud, Chandra Jaiswal, Anish C. Turlapaty, Balakrishna Gokaraju
{"title":"EMG features dataset for arm activity recognition","authors":"Koundinya Challa, Issa W. AlHmoud, Chandra Jaiswal, Anish C. Turlapaty, Balakrishna Gokaraju","doi":"10.1016/j.dib.2025.111519","DOIUrl":"10.1016/j.dib.2025.111519","url":null,"abstract":"<div><div>This study presents a dataset on hand gesture recognition using electromyography (EMG) signals. The data was collected from eight healthy subjects aged between 19 and 35 years, with each subject performing three distinct hand gestures (lifting, grabbing, and flexing). Surface EMG signals were recorded using the Delsys Trigno Wireless biofeedback system from four sensors placed on the dominant hand's Palm A, Palm B, Biceps, and Forearm. The signals were sampled at 2000 Hz and segmented into gesture trials for analysis. The raw EMG data were filtered and processed to extract seven time-domain features across each channel, resulting in 28 total features. These features were reduced using Principal Component Analysis (PCA) to six components, which accounted for 95 % of the variance. The dataset was then used to train and test machine learning models (Random Forest and Logistic Regression) for gesture classification. This dataset has potential reuse in developing gesture recognition algorithms, enhancing prosthetic control, or exploring human–computer interaction (HCI) applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111519"},"PeriodicalIF":1.0,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143833621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Data in BriefPub Date : 2025-04-04DOI: 10.1016/j.dib.2025.111531
Mohsen Heydarzadeh, Teemu Toivola, Victor Vega-Garita, Eero Immonen
{"title":"Dataset of lithium-ion cell degradation under randomized current profiles for NMC, NCA, and LFP chemistries","authors":"Mohsen Heydarzadeh, Teemu Toivola, Victor Vega-Garita, Eero Immonen","doi":"10.1016/j.dib.2025.111531","DOIUrl":"10.1016/j.dib.2025.111531","url":null,"abstract":"<div><div>This paper describes an experimental dataset of lithium-ion cells subjected to a randomized usage profile and periodically characterized through diagnostic tests. The study involved testing eight cells from three types of chemistries (NMC, NCA, LFP) over more than 600 full charge–discharge cycles. The dataset captures the cell's degradation process, including capacity fade and power loss. The experimental procedure comprised an initial full charge/discharge cycle to activate the cells, followed by repeated cycles at varying discharging current rates ranging from 0.5C to 2C/5C. Periodic Hybrid Dynamic Pulse Power Characterization (HPPC) and Constant Current∼ (CC) discharging profiles were executed as reference performance testing (RPT) to monitor transient dynamics and overall performance changes. Furthermore, the Constant Current–Constant Voltage (CC–CV) charging protocol was implemented at 1C charging rates. The experiment entailed the collection of data on voltage, current, cell temperature, ambient temperature, and time, with a 1 Hz sampling rate, utilizing specialized equipment, Chroma 1107 system and temperature sensors. The dataset facilitates the characterization of cell aging under various usage patterns, thereby enabling the development of models and management strategies for different applications. The data was collected at the New Energy Research Center at Turku University of Applied Sciences (TUAS), Finland.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111531"},"PeriodicalIF":1.0,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143843227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A multi-class driver behavior dataset for real-time detection and road safety enhancement","authors":"Arafat Sahin Afridi, Arafath Kafy, Ms. Nazmun Nessa Moon, Md. Shahriar Shakil","doi":"10.1016/j.dib.2025.111529","DOIUrl":"10.1016/j.dib.2025.111529","url":null,"abstract":"<div><div>This paper introduces a novel dataset designed to support the development of AI-driven driver monitoring systems. The dataset captures real-world driver behaviors under diverse driving conditions, including private vehicles and public buses, in Dhaka, Bangladesh. It comprises 7286 high-resolution images categorized into five behavioral classes: Safe Driving, Talking on the Phone, Texting, Turning, and Other Distracting Behaviors. The dataset reflects natural variations in driver behavior, such as different lighting conditions, angles, and vehicle types, making it highly applicable to real-world scenarios. By providing a comprehensive and annotated dataset, we aim to support the development of intelligent transportation systems and contribute to reducing accidents caused by distracted driving. The dataset is publicly available and can be used to train and evaluate machine learning models for real-time driver behavior detection.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111529"},"PeriodicalIF":1.0,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143820393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reflected Light Microscopic Iron ore image dataset for iron ore characterization","authors":"Shama Firdaus , Shamama Anwar , Subrajeet Mohapatra , Prabodha Ranjan Sahoo","doi":"10.1016/j.dib.2025.111540","DOIUrl":"10.1016/j.dib.2025.111540","url":null,"abstract":"<div><div>The dataset contains two folders “IronOreRLM” and “Sample Images”. The folder Sample Images contains few images from each of the grades included in the study and has total of 12 images. This folder is like an abstract of the full dataset and has been created for preview purpose. The IronOreRLM folder is main dataset containing a total of 563 reflected light microscopic (RLM) images of iron ores collected from various mines across India. These RLM images are a valuable source of information about the ores, providing insights into constituent elements, ore quality, structure, and more. Various analyses can be conducted on this dataset to extract meaningful information from the images. The primary goal of acquiring this dataset is to automate the chemical-extensive tasks in mineral processing by leveraging the capabilities of computer vision. While the research work associated with the dataset has been cited in this article, it does not limit the scope of the dataset.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111540"},"PeriodicalIF":1.0,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Data in BriefPub Date : 2025-04-03DOI: 10.1016/j.dib.2025.111533
Muhammad Bamoki , Shakhawan Hares Wady , Soran Badawi
{"title":"Holy Quran Kurdish Sorani translation dataset for language modelling","authors":"Muhammad Bamoki , Shakhawan Hares Wady , Soran Badawi","doi":"10.1016/j.dib.2025.111533","DOIUrl":"10.1016/j.dib.2025.111533","url":null,"abstract":"<div><div>The Holy Quran serves as a foundational text in Islamic theology and has been translated into numerous languages across the globe. This paper introduces a manual translation of the Holy Quran into the Kurdish language, specifically designed to aid natural language processing (NLP) research and linguistic analysis. The translation process employed a thorough methodology that combined advanced linguistic tools with the expertise of bilingual religious scholars, translators, and professional proofreaders over several years. Careful attention was given to maintaining both semantic accuracy and theological precision, ensuring a faithful representation of the original Arabic text. The dataset comprises two primary files: a raw translation and a refined linguistic version. We performed various statistical analyses, including the identification of the top 20 most frequent words, a comparative analysis of verse lengths between the Kurdish and Arabic versions, and an evaluation of unique word distributions in both the raw and processed texts. This Kurdish Quran translation dataset represents a significant resource for computational linguistics, particularly in the development of neural machine translation models and in linguistic research focused on under-resourced languages.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111533"},"PeriodicalIF":1.0,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143843347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Data in BriefPub Date : 2025-04-03DOI: 10.1016/j.dib.2025.111538
Ashvini Gaikwad, Manoj Deshpande, Varsha Bhole
{"title":"Dataset creation of thermal images of pomegranate for internal defect detection","authors":"Ashvini Gaikwad, Manoj Deshpande, Varsha Bhole","doi":"10.1016/j.dib.2025.111538","DOIUrl":"10.1016/j.dib.2025.111538","url":null,"abstract":"<div><div>Datasets are crucial in various fields, especially in the context of machine learning, data science and research. Datasets are used to train machine learning models. A model learns patterns and relationships from the data it is exposed to. The dataset used for training a machine learning model shall be diversified and consist sufficient samples of desired categories. This paper presents various steps and its outcome in preparing the dataset of digital and thermal images of pomegranate for recognising internal defects. The defects in fruits are often categorised as surface defects and internal defects. The surface defects are recognised with digital RGB image but fails to give insight about the internal structure of the fruit in which we are often interested. The thermal images can be used to detect the internal defects in fruits. When a fruit is subjected to temperature difference as compared to the surrounding, the thermal emissions from fruit captured through a thermal camera (thermal image) gives the key information about the internal damages in the fruit. The internal defects are reflected in thermal image as variations in temperature of adjacent pixels. The k-mean segmentation is applied for identifying internal defects with thermal images in pomegranates to categorize them viz. No defect, major defect and minor defect. This information is useful for training a machine learning algorithms that are intended for bulk processing in the field of fruit defect detection and classification.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111538"},"PeriodicalIF":1.0,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143833619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}