{"title":"秋葵病分类与分割数据集:数据集的收集、分析与应用","authors":"K. Sowmiya , M. Thenmozhi","doi":"10.1016/j.dib.2025.111662","DOIUrl":null,"url":null,"abstract":"<div><div>The early diagnosis of okra leaf diseases is crucial for maintaining crop health and ensuring high agricultural productivity. To facilitate the development of robust deep learning models for automated disease detection, we present a comprehensive dataset of 2500 okra leaf images collected from real-time agricultural fields in India. The dataset consists of six classes, including healthy leaves (Class 0) and five diseased categories: Leaf Curly Virus (Class 1), Alternaria Leaf Spot (Class 2), Cercospora Leaf Spot (Class 3), Phyllosticta Leaf Spot (Class 4), and Downy Mildew (Class 5). Each image is resized to 224 × 224 pixels to ensure compatibility with standard deep learning models. The primary objective of this dataset collection is to provide a benchmark resource for researchers working on early-stage plant disease classification, detection and segmentation. This dataset is unique as it is one of the first publicly available Indian okra leaf disease datasets captured in real-world conditions, incorporating natural variations in lighting, leaf positioning, and environmental factors. It serves as a valuable resource for future young researchers in the field of smart agriculture, enabling advancements in machine learning-based disease diagnosis, smart farming applications, and precision agriculture. Future enhancements will focus on expanding the dataset with more images, including different growth stages and environmental conditions, to improve model generalization and real-world applicability.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"61 ","pages":"Article 111662"},"PeriodicalIF":1.0000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Okra disease dataset for classification and segmentation: Dataset collection, analysis and applications\",\"authors\":\"K. Sowmiya , M. Thenmozhi\",\"doi\":\"10.1016/j.dib.2025.111662\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The early diagnosis of okra leaf diseases is crucial for maintaining crop health and ensuring high agricultural productivity. To facilitate the development of robust deep learning models for automated disease detection, we present a comprehensive dataset of 2500 okra leaf images collected from real-time agricultural fields in India. The dataset consists of six classes, including healthy leaves (Class 0) and five diseased categories: Leaf Curly Virus (Class 1), Alternaria Leaf Spot (Class 2), Cercospora Leaf Spot (Class 3), Phyllosticta Leaf Spot (Class 4), and Downy Mildew (Class 5). Each image is resized to 224 × 224 pixels to ensure compatibility with standard deep learning models. The primary objective of this dataset collection is to provide a benchmark resource for researchers working on early-stage plant disease classification, detection and segmentation. This dataset is unique as it is one of the first publicly available Indian okra leaf disease datasets captured in real-world conditions, incorporating natural variations in lighting, leaf positioning, and environmental factors. It serves as a valuable resource for future young researchers in the field of smart agriculture, enabling advancements in machine learning-based disease diagnosis, smart farming applications, and precision agriculture. Future enhancements will focus on expanding the dataset with more images, including different growth stages and environmental conditions, to improve model generalization and real-world applicability.</div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"61 \",\"pages\":\"Article 111662\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2025-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340925003920\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925003920","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Okra disease dataset for classification and segmentation: Dataset collection, analysis and applications
The early diagnosis of okra leaf diseases is crucial for maintaining crop health and ensuring high agricultural productivity. To facilitate the development of robust deep learning models for automated disease detection, we present a comprehensive dataset of 2500 okra leaf images collected from real-time agricultural fields in India. The dataset consists of six classes, including healthy leaves (Class 0) and five diseased categories: Leaf Curly Virus (Class 1), Alternaria Leaf Spot (Class 2), Cercospora Leaf Spot (Class 3), Phyllosticta Leaf Spot (Class 4), and Downy Mildew (Class 5). Each image is resized to 224 × 224 pixels to ensure compatibility with standard deep learning models. The primary objective of this dataset collection is to provide a benchmark resource for researchers working on early-stage plant disease classification, detection and segmentation. This dataset is unique as it is one of the first publicly available Indian okra leaf disease datasets captured in real-world conditions, incorporating natural variations in lighting, leaf positioning, and environmental factors. It serves as a valuable resource for future young researchers in the field of smart agriculture, enabling advancements in machine learning-based disease diagnosis, smart farming applications, and precision agriculture. Future enhancements will focus on expanding the dataset with more images, including different growth stages and environmental conditions, to improve model generalization and real-world applicability.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.