{"title":"Instance Segmentation and Classification of Coffee Leaf Plant using Mask RCNN and Transfer Learning","authors":"Ahmed Nashat, Fatma Mazen","doi":"10.21608/fuje.2023.226247.1057","DOIUrl":null,"url":null,"abstract":"Coffee is one of the most consumed beverages in the world. It is crucial in the economy of many industrial companies in developing countries. This study proposes a deep learning algorithm called Mask RCNN to segment coffee leaves from complex real-world backgrounds and classify them as healthy and unhealthy. The RoCole dataset was manually labeled using the VGG Image annotator. The algorithm uses Resnet101 and the FPN architecture for feature extraction. The RPN creates region proposals for each feature map to separate the input image from the background. The system has a high-test accuracy of 97.76% for the binary classifier. If the image is classified as unhealthy, it goes through another segmentation stage based on the HSV color model to highlight the defective areas of the coffee leaf. The instance segmentation results showed that the mAP@50:95 was 100%, the recall@50:95 was 84.5%, and the F1-score was 91.6%.","PeriodicalId":484000,"journal":{"name":"Fayoum University Journal of Engineering","volume":"42 5-6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fayoum University Journal of Engineering","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.21608/fuje.2023.226247.1057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Coffee is one of the most consumed beverages in the world. It is crucial in the economy of many industrial companies in developing countries. This study proposes a deep learning algorithm called Mask RCNN to segment coffee leaves from complex real-world backgrounds and classify them as healthy and unhealthy. The RoCole dataset was manually labeled using the VGG Image annotator. The algorithm uses Resnet101 and the FPN architecture for feature extraction. The RPN creates region proposals for each feature map to separate the input image from the background. The system has a high-test accuracy of 97.76% for the binary classifier. If the image is classified as unhealthy, it goes through another segmentation stage based on the HSV color model to highlight the defective areas of the coffee leaf. The instance segmentation results showed that the mAP@50:95 was 100%, the recall@50:95 was 84.5%, and the F1-score was 91.6%.