Arnob Das Shacha, Sabbir Hossain Durjoy, Md. Emon Shikder, Md Mostafa Kamal, Md Mehedi Hasan Shoib, Md Hasan Imam Bijoy
{"title":"RoseLeafInsight:玫瑰叶病识别的高分辨率图像数据集","authors":"Arnob Das Shacha, Sabbir Hossain Durjoy, Md. Emon Shikder, Md Mostafa Kamal, Md Mehedi Hasan Shoib, Md Hasan Imam Bijoy","doi":"10.1016/j.dib.2025.111968","DOIUrl":null,"url":null,"abstract":"<div><div>The Rose (genus Rosa) has become a significant factor in the Bangladeshi flower industry, both in terms of exports and local consumption. However, rose farming in this country faces serious challenges due to diseases affecting its leaves, which weaken the plants and result in lower flower yields and financial losses for farmers. Rosa (genus Rosa) is one of the most attractive and commercially valuable flower genera. However, agricultural rose production faces several challenges, such as pesticide resistance, which affects plant growth and results in a reduced quantity and quality of healthy flowers. Several natural factors also cause interference with rose production. Most farmers involved in this industry have limited education, which hinders their ability to identify early-stage rose-leaf disease solely through visual inspection. Furthermore, limited communication with agricultural experts exacerbates the situation, leading to delayed interventions and economic losses. This study presents the rose leaf disease dataset, which would help enhance disease tracking, diagnosis, and research in roses. From October 2024 to January 2025, large-scale field surveys were conducted to capture quality images for each condition class in rose leaves. In this paper, four classes comprise ‘Black Spot,’ ‘Insect Hole,’ ‘Yellow Mosaic Virus,’ and ‘Healthy,’ representing different stages in disease progression. There are 3,228 original images, categorized as follows: Black Spot (409), Insect Hole (453), Yellow Mosaic Virus (680), and Healthy (1,686). During the pre-processing stage, the images are resized to 3000×3000 pixels, and low-quality, duplicate, or irrelevant images are removed to ensure high quality. We have employed various augmentation techniques, including rotation, flipping, contrast adjustment, blurring, shearing, zooming, and noise addition, to increase the dataset size and enhance model generalization. Datasets like this one are in high demand for agricultural research, leading to improved disease management and increased yields. These goals can be achieved through high-accuracy machine-learning models for early disease detection and cause identification. This gives the farmers more time to take necessary actions for disease prevention and pest control. This tech-based system combines the field of agriculture with the cutting edge of computer science and AI, making precision agriculture even more effective and efficient. Our dataset is designed to meet the need for data to train these models and provide a baseline benchmark for disease detection in our specific crop, the Rose. Improvements in different generations of models, as well as numerous other forms of scientific advancements, can lead to further increases in efficiency and ultimately result in better, smarter farms. In our initial testing for categorizing rose leaves, we employed two well-known transfer learning models. Among them, MobileNetV2 performed exceptionally well, achieving an accuracy of 96.79% in image classification. This dataset can be integrated with innovative farming equipment, such as drones and sensors, to monitor large fields in real-time. This dataset serves as a benchmark for training deep learning models, enabling enhanced automated monitoring and decision-making in precision agriculture.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"62 ","pages":"Article 111968"},"PeriodicalIF":1.4000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RoseLeafInsight: A high-resolution image dataset for rose leaf disease recognition\",\"authors\":\"Arnob Das Shacha, Sabbir Hossain Durjoy, Md. Emon Shikder, Md Mostafa Kamal, Md Mehedi Hasan Shoib, Md Hasan Imam Bijoy\",\"doi\":\"10.1016/j.dib.2025.111968\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The Rose (genus Rosa) has become a significant factor in the Bangladeshi flower industry, both in terms of exports and local consumption. However, rose farming in this country faces serious challenges due to diseases affecting its leaves, which weaken the plants and result in lower flower yields and financial losses for farmers. Rosa (genus Rosa) is one of the most attractive and commercially valuable flower genera. However, agricultural rose production faces several challenges, such as pesticide resistance, which affects plant growth and results in a reduced quantity and quality of healthy flowers. Several natural factors also cause interference with rose production. Most farmers involved in this industry have limited education, which hinders their ability to identify early-stage rose-leaf disease solely through visual inspection. Furthermore, limited communication with agricultural experts exacerbates the situation, leading to delayed interventions and economic losses. This study presents the rose leaf disease dataset, which would help enhance disease tracking, diagnosis, and research in roses. From October 2024 to January 2025, large-scale field surveys were conducted to capture quality images for each condition class in rose leaves. In this paper, four classes comprise ‘Black Spot,’ ‘Insect Hole,’ ‘Yellow Mosaic Virus,’ and ‘Healthy,’ representing different stages in disease progression. There are 3,228 original images, categorized as follows: Black Spot (409), Insect Hole (453), Yellow Mosaic Virus (680), and Healthy (1,686). During the pre-processing stage, the images are resized to 3000×3000 pixels, and low-quality, duplicate, or irrelevant images are removed to ensure high quality. We have employed various augmentation techniques, including rotation, flipping, contrast adjustment, blurring, shearing, zooming, and noise addition, to increase the dataset size and enhance model generalization. Datasets like this one are in high demand for agricultural research, leading to improved disease management and increased yields. These goals can be achieved through high-accuracy machine-learning models for early disease detection and cause identification. This gives the farmers more time to take necessary actions for disease prevention and pest control. This tech-based system combines the field of agriculture with the cutting edge of computer science and AI, making precision agriculture even more effective and efficient. Our dataset is designed to meet the need for data to train these models and provide a baseline benchmark for disease detection in our specific crop, the Rose. Improvements in different generations of models, as well as numerous other forms of scientific advancements, can lead to further increases in efficiency and ultimately result in better, smarter farms. In our initial testing for categorizing rose leaves, we employed two well-known transfer learning models. Among them, MobileNetV2 performed exceptionally well, achieving an accuracy of 96.79% in image classification. This dataset can be integrated with innovative farming equipment, such as drones and sensors, to monitor large fields in real-time. This dataset serves as a benchmark for training deep learning models, enabling enhanced automated monitoring and decision-making in precision agriculture.</div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"62 \",\"pages\":\"Article 111968\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340925006924\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925006924","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
RoseLeafInsight: A high-resolution image dataset for rose leaf disease recognition
The Rose (genus Rosa) has become a significant factor in the Bangladeshi flower industry, both in terms of exports and local consumption. However, rose farming in this country faces serious challenges due to diseases affecting its leaves, which weaken the plants and result in lower flower yields and financial losses for farmers. Rosa (genus Rosa) is one of the most attractive and commercially valuable flower genera. However, agricultural rose production faces several challenges, such as pesticide resistance, which affects plant growth and results in a reduced quantity and quality of healthy flowers. Several natural factors also cause interference with rose production. Most farmers involved in this industry have limited education, which hinders their ability to identify early-stage rose-leaf disease solely through visual inspection. Furthermore, limited communication with agricultural experts exacerbates the situation, leading to delayed interventions and economic losses. This study presents the rose leaf disease dataset, which would help enhance disease tracking, diagnosis, and research in roses. From October 2024 to January 2025, large-scale field surveys were conducted to capture quality images for each condition class in rose leaves. In this paper, four classes comprise ‘Black Spot,’ ‘Insect Hole,’ ‘Yellow Mosaic Virus,’ and ‘Healthy,’ representing different stages in disease progression. There are 3,228 original images, categorized as follows: Black Spot (409), Insect Hole (453), Yellow Mosaic Virus (680), and Healthy (1,686). During the pre-processing stage, the images are resized to 3000×3000 pixels, and low-quality, duplicate, or irrelevant images are removed to ensure high quality. We have employed various augmentation techniques, including rotation, flipping, contrast adjustment, blurring, shearing, zooming, and noise addition, to increase the dataset size and enhance model generalization. Datasets like this one are in high demand for agricultural research, leading to improved disease management and increased yields. These goals can be achieved through high-accuracy machine-learning models for early disease detection and cause identification. This gives the farmers more time to take necessary actions for disease prevention and pest control. This tech-based system combines the field of agriculture with the cutting edge of computer science and AI, making precision agriculture even more effective and efficient. Our dataset is designed to meet the need for data to train these models and provide a baseline benchmark for disease detection in our specific crop, the Rose. Improvements in different generations of models, as well as numerous other forms of scientific advancements, can lead to further increases in efficiency and ultimately result in better, smarter farms. In our initial testing for categorizing rose leaves, we employed two well-known transfer learning models. Among them, MobileNetV2 performed exceptionally well, achieving an accuracy of 96.79% in image classification. This dataset can be integrated with innovative farming equipment, such as drones and sensors, to monitor large fields in real-time. This dataset serves as a benchmark for training deep learning models, enabling enhanced automated monitoring and decision-making in precision agriculture.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.