RoseLeafInsight:玫瑰叶病识别的高分辨率图像数据集

IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES
Arnob Das Shacha, Sabbir Hossain Durjoy, Md. Emon Shikder, Md Mostafa Kamal, Md Mehedi Hasan Shoib, Md Hasan Imam Bijoy
{"title":"RoseLeafInsight:玫瑰叶病识别的高分辨率图像数据集","authors":"Arnob Das Shacha,&nbsp;Sabbir Hossain Durjoy,&nbsp;Md. Emon Shikder,&nbsp;Md Mostafa Kamal,&nbsp;Md Mehedi Hasan Shoib,&nbsp;Md Hasan Imam Bijoy","doi":"10.1016/j.dib.2025.111968","DOIUrl":null,"url":null,"abstract":"<div><div>The Rose (genus Rosa) has become a significant factor in the Bangladeshi flower industry, both in terms of exports and local consumption. However, rose farming in this country faces serious challenges due to diseases affecting its leaves, which weaken the plants and result in lower flower yields and financial losses for farmers. Rosa (genus Rosa) is one of the most attractive and commercially valuable flower genera. However, agricultural rose production faces several challenges, such as pesticide resistance, which affects plant growth and results in a reduced quantity and quality of healthy flowers. Several natural factors also cause interference with rose production. Most farmers involved in this industry have limited education, which hinders their ability to identify early-stage rose-leaf disease solely through visual inspection. Furthermore, limited communication with agricultural experts exacerbates the situation, leading to delayed interventions and economic losses. This study presents the rose leaf disease dataset, which would help enhance disease tracking, diagnosis, and research in roses. From October 2024 to January 2025, large-scale field surveys were conducted to capture quality images for each condition class in rose leaves. In this paper, four classes comprise ‘Black Spot,’ ‘Insect Hole,’ ‘Yellow Mosaic Virus,’ and ‘Healthy,’ representing different stages in disease progression. There are 3,228 original images, categorized as follows: Black Spot (409), Insect Hole (453), Yellow Mosaic Virus (680), and Healthy (1,686). During the pre-processing stage, the images are resized to 3000×3000 pixels, and low-quality, duplicate, or irrelevant images are removed to ensure high quality. We have employed various augmentation techniques, including rotation, flipping, contrast adjustment, blurring, shearing, zooming, and noise addition, to increase the dataset size and enhance model generalization. Datasets like this one are in high demand for agricultural research, leading to improved disease management and increased yields. These goals can be achieved through high-accuracy machine-learning models for early disease detection and cause identification. This gives the farmers more time to take necessary actions for disease prevention and pest control. This tech-based system combines the field of agriculture with the cutting edge of computer science and AI, making precision agriculture even more effective and efficient. Our dataset is designed to meet the need for data to train these models and provide a baseline benchmark for disease detection in our specific crop, the Rose. Improvements in different generations of models, as well as numerous other forms of scientific advancements, can lead to further increases in efficiency and ultimately result in better, smarter farms. In our initial testing for categorizing rose leaves, we employed two well-known transfer learning models. Among them, MobileNetV2 performed exceptionally well, achieving an accuracy of 96.79% in image classification. This dataset can be integrated with innovative farming equipment, such as drones and sensors, to monitor large fields in real-time. This dataset serves as a benchmark for training deep learning models, enabling enhanced automated monitoring and decision-making in precision agriculture.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"62 ","pages":"Article 111968"},"PeriodicalIF":1.4000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RoseLeafInsight: A high-resolution image dataset for rose leaf disease recognition\",\"authors\":\"Arnob Das Shacha,&nbsp;Sabbir Hossain Durjoy,&nbsp;Md. Emon Shikder,&nbsp;Md Mostafa Kamal,&nbsp;Md Mehedi Hasan Shoib,&nbsp;Md Hasan Imam Bijoy\",\"doi\":\"10.1016/j.dib.2025.111968\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The Rose (genus Rosa) has become a significant factor in the Bangladeshi flower industry, both in terms of exports and local consumption. However, rose farming in this country faces serious challenges due to diseases affecting its leaves, which weaken the plants and result in lower flower yields and financial losses for farmers. Rosa (genus Rosa) is one of the most attractive and commercially valuable flower genera. However, agricultural rose production faces several challenges, such as pesticide resistance, which affects plant growth and results in a reduced quantity and quality of healthy flowers. Several natural factors also cause interference with rose production. Most farmers involved in this industry have limited education, which hinders their ability to identify early-stage rose-leaf disease solely through visual inspection. Furthermore, limited communication with agricultural experts exacerbates the situation, leading to delayed interventions and economic losses. This study presents the rose leaf disease dataset, which would help enhance disease tracking, diagnosis, and research in roses. From October 2024 to January 2025, large-scale field surveys were conducted to capture quality images for each condition class in rose leaves. In this paper, four classes comprise ‘Black Spot,’ ‘Insect Hole,’ ‘Yellow Mosaic Virus,’ and ‘Healthy,’ representing different stages in disease progression. There are 3,228 original images, categorized as follows: Black Spot (409), Insect Hole (453), Yellow Mosaic Virus (680), and Healthy (1,686). During the pre-processing stage, the images are resized to 3000×3000 pixels, and low-quality, duplicate, or irrelevant images are removed to ensure high quality. We have employed various augmentation techniques, including rotation, flipping, contrast adjustment, blurring, shearing, zooming, and noise addition, to increase the dataset size and enhance model generalization. Datasets like this one are in high demand for agricultural research, leading to improved disease management and increased yields. These goals can be achieved through high-accuracy machine-learning models for early disease detection and cause identification. This gives the farmers more time to take necessary actions for disease prevention and pest control. This tech-based system combines the field of agriculture with the cutting edge of computer science and AI, making precision agriculture even more effective and efficient. Our dataset is designed to meet the need for data to train these models and provide a baseline benchmark for disease detection in our specific crop, the Rose. Improvements in different generations of models, as well as numerous other forms of scientific advancements, can lead to further increases in efficiency and ultimately result in better, smarter farms. In our initial testing for categorizing rose leaves, we employed two well-known transfer learning models. Among them, MobileNetV2 performed exceptionally well, achieving an accuracy of 96.79% in image classification. This dataset can be integrated with innovative farming equipment, such as drones and sensors, to monitor large fields in real-time. This dataset serves as a benchmark for training deep learning models, enabling enhanced automated monitoring and decision-making in precision agriculture.</div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"62 \",\"pages\":\"Article 111968\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340925006924\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925006924","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

在出口和当地消费方面,玫瑰(Rosa属)已成为孟加拉国花卉产业的重要因素。然而,这个国家的玫瑰种植面临着严重的挑战,因为病害影响了它的叶子,使植物变弱,导致开花产量下降,给农民造成经济损失。玫瑰(Rosa属)是最具吸引力和商业价值的花属之一。然而,农业玫瑰生产面临着一些挑战,例如农药抗性,这影响了植物生长,导致健康花朵的数量和质量下降。一些自然因素也会干扰玫瑰的生产。大多数从事这一行业的农民受教育程度有限,这阻碍了他们仅通过目视检查识别早期玫瑰叶病的能力。此外,与农业专家的有限沟通加剧了这种情况,导致干预措施延迟和经济损失。本研究提出了玫瑰叶病数据集,这将有助于加强玫瑰疾病的跟踪、诊断和研究。从2024年10月到2025年1月,进行了大规模的野外调查,捕获了玫瑰叶片中每个条件类别的高质量图像。本文将“黑斑病”、“虫洞病”、“黄花叶病毒”和“健康病”分为4个类别,分别代表疾病发展的不同阶段。共有3228张原始图像,分类如下:黑斑(409张)、虫洞(453张)、黄花叶病毒(680张)和健康(1686张)。在预处理阶段,将图像调整为3000×3000像素,并删除低质量、重复或不相关的图像以确保高质量。我们采用了各种增强技术,包括旋转、翻转、对比度调整、模糊、剪切、缩放和噪声添加,以增加数据集大小并增强模型泛化。像这样的数据集在农业研究中需求量很大,从而改善了疾病管理并提高了产量。这些目标可以通过用于早期疾病检测和病因识别的高精度机器学习模型来实现。这使农民有更多的时间采取必要的措施来预防疾病和控制害虫。这个以技术为基础的系统将农业领域与计算机科学和人工智能的前沿相结合,使精准农业更加有效和高效。我们的数据集旨在满足训练这些模型的数据需求,并为我们的特定作物玫瑰的疾病检测提供基线基准。不同一代模型的改进,以及许多其他形式的科学进步,可以进一步提高效率,最终形成更好、更智能的农场。在我们对玫瑰叶进行分类的初步测试中,我们采用了两个众所周知的迁移学习模型。其中,MobileNetV2表现异常出色,图像分类准确率达到96.79%。该数据集可以与创新的农业设备(如无人机和传感器)集成,以实时监控大片农田。该数据集可作为训练深度学习模型的基准,从而增强精准农业的自动化监测和决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
RoseLeafInsight: A high-resolution image dataset for rose leaf disease recognition
The Rose (genus Rosa) has become a significant factor in the Bangladeshi flower industry, both in terms of exports and local consumption. However, rose farming in this country faces serious challenges due to diseases affecting its leaves, which weaken the plants and result in lower flower yields and financial losses for farmers. Rosa (genus Rosa) is one of the most attractive and commercially valuable flower genera. However, agricultural rose production faces several challenges, such as pesticide resistance, which affects plant growth and results in a reduced quantity and quality of healthy flowers. Several natural factors also cause interference with rose production. Most farmers involved in this industry have limited education, which hinders their ability to identify early-stage rose-leaf disease solely through visual inspection. Furthermore, limited communication with agricultural experts exacerbates the situation, leading to delayed interventions and economic losses. This study presents the rose leaf disease dataset, which would help enhance disease tracking, diagnosis, and research in roses. From October 2024 to January 2025, large-scale field surveys were conducted to capture quality images for each condition class in rose leaves. In this paper, four classes comprise ‘Black Spot,’ ‘Insect Hole,’ ‘Yellow Mosaic Virus,’ and ‘Healthy,’ representing different stages in disease progression. There are 3,228 original images, categorized as follows: Black Spot (409), Insect Hole (453), Yellow Mosaic Virus (680), and Healthy (1,686). During the pre-processing stage, the images are resized to 3000×3000 pixels, and low-quality, duplicate, or irrelevant images are removed to ensure high quality. We have employed various augmentation techniques, including rotation, flipping, contrast adjustment, blurring, shearing, zooming, and noise addition, to increase the dataset size and enhance model generalization. Datasets like this one are in high demand for agricultural research, leading to improved disease management and increased yields. These goals can be achieved through high-accuracy machine-learning models for early disease detection and cause identification. This gives the farmers more time to take necessary actions for disease prevention and pest control. This tech-based system combines the field of agriculture with the cutting edge of computer science and AI, making precision agriculture even more effective and efficient. Our dataset is designed to meet the need for data to train these models and provide a baseline benchmark for disease detection in our specific crop, the Rose. Improvements in different generations of models, as well as numerous other forms of scientific advancements, can lead to further increases in efficiency and ultimately result in better, smarter farms. In our initial testing for categorizing rose leaves, we employed two well-known transfer learning models. Among them, MobileNetV2 performed exceptionally well, achieving an accuracy of 96.79% in image classification. This dataset can be integrated with innovative farming equipment, such as drones and sensors, to monitor large fields in real-time. This dataset serves as a benchmark for training deep learning models, enabling enhanced automated monitoring and decision-making in precision agriculture.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Data in Brief
Data in Brief MULTIDISCIPLINARY SCIENCES-
CiteScore
3.10
自引率
0.00%
发文量
996
审稿时长
70 days
期刊介绍: Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信