Alifya Febriana, K. Muchtar, R. Dawood, Chih-Yang Lin
{"title":"USK-COFFEE Dataset: A Multi-Class Green Arabica Coffee Bean Dataset for Deep Learning","authors":"Alifya Febriana, K. Muchtar, R. Dawood, Chih-Yang Lin","doi":"10.1109/CyberneticsCom55287.2022.9865489","DOIUrl":null,"url":null,"abstract":"Coffee is one of the plantation commodities that plays a big role in the world economy. According to the classification of coffee, each type of coffee has various shapes and textures. Traditional human visual sorting of coffee beans is time-consuming, labor-intensive, and may result in low-quality coffee due to work stress and exhaustion. The contribution of this paper is twofold. First, a new dataset, called USK-Coffee, which contains a total of 8.000 images and is divided into 4 classes, is created and made publicly available. To the best of our knowledge, the USK-Coffee dataset is currently the most comprehensive green coffee bean dataset. Second, this study aims to offer a lightweight and understandable intelligent coffee bean sort accurately system that uses deep learning (DL) to assist farmers in sorting green bean arabica by variety. To be specific, this paper presents a baseline for classification performance on the dataset using the benchmark deep learning models, MobileNetV2, and ResNet-18. These models achieved an average classification accuracy of 81.31% and 81.12%, respectively. The dataset is available at: http://comvis.unsyiah.ac.id/usk-coffee/","PeriodicalId":178279,"journal":{"name":"2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom)","volume":"214 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CyberneticsCom55287.2022.9865489","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Coffee is one of the plantation commodities that plays a big role in the world economy. According to the classification of coffee, each type of coffee has various shapes and textures. Traditional human visual sorting of coffee beans is time-consuming, labor-intensive, and may result in low-quality coffee due to work stress and exhaustion. The contribution of this paper is twofold. First, a new dataset, called USK-Coffee, which contains a total of 8.000 images and is divided into 4 classes, is created and made publicly available. To the best of our knowledge, the USK-Coffee dataset is currently the most comprehensive green coffee bean dataset. Second, this study aims to offer a lightweight and understandable intelligent coffee bean sort accurately system that uses deep learning (DL) to assist farmers in sorting green bean arabica by variety. To be specific, this paper presents a baseline for classification performance on the dataset using the benchmark deep learning models, MobileNetV2, and ResNet-18. These models achieved an average classification accuracy of 81.31% and 81.12%, respectively. The dataset is available at: http://comvis.unsyiah.ac.id/usk-coffee/