Md Tahsin , Muhammad Ibrahim , Anika Tabassum Nafisa , Maksura Binte Rabbani Nuha , Mehrab Islam Arnab , Md. Hasanul Ferdaus , Mohammad Manzurul Islam , Mohammad Rifat Ahmmad Rashid , Taskeed Jabid , Md. Sawkat Ali , Nishat Tasnim Niloy
{"title":"PaddyVarietyBD: Classifying paddy variations of Bangladesh with a novel image dataset","authors":"Md Tahsin , Muhammad Ibrahim , Anika Tabassum Nafisa , Maksura Binte Rabbani Nuha , Mehrab Islam Arnab , Md. Hasanul Ferdaus , Mohammad Manzurul Islam , Mohammad Rifat Ahmmad Rashid , Taskeed Jabid , Md. Sawkat Ali , Nishat Tasnim Niloy","doi":"10.1016/j.dib.2025.111514","DOIUrl":null,"url":null,"abstract":"<div><div>Among countless crop varieties produced worldwide, the staple food of most of Asia, some parts of Europe, and North America is rice. Being an essential food item, rice offers an integral contribution to the economy of countries like China, India, Bangladesh, Pakistan, Indonesia, and so on. Scientists have long been working on developing new and improved rice species to battle different environmental hindrances and natural calamities. Although numerous research and studies have been conducted on this diverse crop, artificial intelligence, in particular, machine learning has not been practiced in this field with its full potential. The key factors behind this lag include the unavailability of standard and ready-to-use datasets. Intending to mitigate this drawback, this paper proposes an image dataset of paddy species to assist researchers and scientists in classifying, analyzing, and evaluating paddy classes. To the best of our knowledge, this is the first standard and open dataset of paddy varieties in Bangladesh. The rice sample was collected from two places namely – Bangladesh Institute of Nuclear Agriculture (BINA) and the Bangladesh Institute of Rice Research Institute (BRRI) where agrarian scientists work on developing new or improving existing paddy species. The dataset contains 14,000 RGB microscopic images of each paddy kernel. The enormity and inclusivity of the dataset make it useful for global research purposes. The dataset can be a useful resource not only in the area of artificial intelligence, but also in agriculture, botanical, and economic research.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111514"},"PeriodicalIF":1.0000,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235234092500246X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Among countless crop varieties produced worldwide, the staple food of most of Asia, some parts of Europe, and North America is rice. Being an essential food item, rice offers an integral contribution to the economy of countries like China, India, Bangladesh, Pakistan, Indonesia, and so on. Scientists have long been working on developing new and improved rice species to battle different environmental hindrances and natural calamities. Although numerous research and studies have been conducted on this diverse crop, artificial intelligence, in particular, machine learning has not been practiced in this field with its full potential. The key factors behind this lag include the unavailability of standard and ready-to-use datasets. Intending to mitigate this drawback, this paper proposes an image dataset of paddy species to assist researchers and scientists in classifying, analyzing, and evaluating paddy classes. To the best of our knowledge, this is the first standard and open dataset of paddy varieties in Bangladesh. The rice sample was collected from two places namely – Bangladesh Institute of Nuclear Agriculture (BINA) and the Bangladesh Institute of Rice Research Institute (BRRI) where agrarian scientists work on developing new or improving existing paddy species. The dataset contains 14,000 RGB microscopic images of each paddy kernel. The enormity and inclusivity of the dataset make it useful for global research purposes. The dataset can be a useful resource not only in the area of artificial intelligence, but also in agriculture, botanical, and economic research.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.