{"title":"Dataset creation of thermal images of pomegranate for internal defect detection","authors":"Ashvini Gaikwad, Manoj Deshpande, Varsha Bhole","doi":"10.1016/j.dib.2025.111538","DOIUrl":null,"url":null,"abstract":"<div><div>Datasets are crucial in various fields, especially in the context of machine learning, data science and research. Datasets are used to train machine learning models. A model learns patterns and relationships from the data it is exposed to. The dataset used for training a machine learning model shall be diversified and consist sufficient samples of desired categories. This paper presents various steps and its outcome in preparing the dataset of digital and thermal images of pomegranate for recognising internal defects. The defects in fruits are often categorised as surface defects and internal defects. The surface defects are recognised with digital RGB image but fails to give insight about the internal structure of the fruit in which we are often interested. The thermal images can be used to detect the internal defects in fruits. When a fruit is subjected to temperature difference as compared to the surrounding, the thermal emissions from fruit captured through a thermal camera (thermal image) gives the key information about the internal damages in the fruit. The internal defects are reflected in thermal image as variations in temperature of adjacent pixels. The k-mean segmentation is applied for identifying internal defects with thermal images in pomegranates to categorize them viz. No defect, major defect and minor defect. This information is useful for training a machine learning algorithms that are intended for bulk processing in the field of fruit defect detection and classification.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111538"},"PeriodicalIF":1.0000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925002707","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Datasets are crucial in various fields, especially in the context of machine learning, data science and research. Datasets are used to train machine learning models. A model learns patterns and relationships from the data it is exposed to. The dataset used for training a machine learning model shall be diversified and consist sufficient samples of desired categories. This paper presents various steps and its outcome in preparing the dataset of digital and thermal images of pomegranate for recognising internal defects. The defects in fruits are often categorised as surface defects and internal defects. The surface defects are recognised with digital RGB image but fails to give insight about the internal structure of the fruit in which we are often interested. The thermal images can be used to detect the internal defects in fruits. When a fruit is subjected to temperature difference as compared to the surrounding, the thermal emissions from fruit captured through a thermal camera (thermal image) gives the key information about the internal damages in the fruit. The internal defects are reflected in thermal image as variations in temperature of adjacent pixels. The k-mean segmentation is applied for identifying internal defects with thermal images in pomegranates to categorize them viz. No defect, major defect and minor defect. This information is useful for training a machine learning algorithms that are intended for bulk processing in the field of fruit defect detection and classification.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.